Blogs | Mohammed Alshehri

How to Live When You Feel Like an Outlier

April 03, 2026

On calibration, compromise, and the art of remaining fully yourself while becoming easier to trust, understand, and include — through the lens of machine learning and life.

Read full post →

Thought on Jensen Huang on Lex Fridman

March 27, 2026

Reflections on Jensen Huang’s interview with Lex Fridman — on systems thinking, CUDA, organizational architecture, and why NVIDIA’s deepest moat is institutional.

Read full post →

Teaching an LLM to Coach Itself: Multi-Agent Math Tutoring with Reinforcement Learning

March 07, 2026 · Mohammed Alshehri

Training a Solver-Coach-Reviser system on Hendrycks MATH using Tinker RL and Qwen3-8B.

Read full post →

Recursive Language Models as Procedural Scaling

February 08, 2026 · Mohammed Alshehri

Why long-context is not one problem, and how chaining bounded model calls through a REPL offers a cleaner alternative to context-window maximalism.

Read full post →

How RL Changed My Taste in AI Systems

February 04, 2026 · Mohammed Alshehri

A narrative learning path through reinforcement learning — from vague fascination to implementing, diagnosing, and iterating on RL systems with confidence.

Read full post →

Reinforcement Learning vs Regular Training: The Real Difference Is Not the Math, It Is the Loop

January 09, 2026 · Mohammed Alshehri

Why RL feels qualitatively different from supervised learning, and how the feedback loop creates systems challenges that go far beyond the algorithm.

Read full post →

GPT-OSS Safeguard as Policy-Executable Safety, and the Cabinet Briefing Risk Scanner Built on Top of It

January 03, 2026 · Mohammed Alshehri

A systems-focused account of how GPT-OSS Safeguard can be used as a policy-executable safety component, operationalized into a Cabinet Briefing Risk Scanner for high-stakes government communications.

Read full post →

2025: The Year I Bet on Myself

January 01, 2026 · Mohammed Alshehri

A personal reflection on leaving IBM, building startups from zero, and the lessons learned from a full year of betting on yourself as a founder.

Read full post →

Research Imperatives and the Struggle for Algorithmic Dominance

December 14, 2025 · Mohammed Alshehri

An in-depth analysis of the AI race between Google DeepMind, OpenAI, and Anthropic — their origins, strategies, and the rivalry shaping the future of artificial intelligence.

Read full post →

My Thoughts on Sora 2

October 15, 2025

Sora 2 isn’t just another AI milestone — it’s a revolution that changes how we define creativity, truth, and perception itself.

Read full post →

RL: Why It Matters More Than Ever

October 10, 2025 · Mohammed Alshehri

A comprehensive guide to Reinforcement Learning — from Q-Learning and DQN to A3C, PPO, and real-world projects. Covers the math, the code, the key papers, and why RL is more important than ever.

Read full post →

Evolutionary Algorithms: A Practical Introduction in Python

October 03, 2025

A practical introduction on how to program Evolutionary Algorithms in Python to solve optimization tasks, including Hill Climbers, selection methods, and TPOT.

Read full post →

Are Open Source Models Actually Free?

September 30, 2025

A look at the hidden costs behind open source AI models — compute, infrastructure, fine-tuning, and operational overhead that don’t show up in the license.

Read full post →

Context Engineering Guide

July 08, 2025

A concrete step-by-step guide to context engineering in practice — from system prompts and structured outputs to RAG, memory, and tool calling in AI agent workflows.

Read full post →

What is Model Context Protocol (MCP)? How it simplifies AI integrations compared to APIs

June 30, 2025

MCP is a new open protocol that standardizes how AI agents connect to tools and data sources — like a USB-C port for AI. Here’s how it works and why it matters.

Read full post →

Let’s Talk AI Agents

June 16, 2025

Read full post →

Concurrency Models in Python

April 27, 2025

A deep dive into Python’s concurrency models — from the infamous GIL and multi-threading to multiprocessing, concurrent.futures, and Futures — inspired by Fluent Python.

Read full post →