Blogs
How to Live When You Feel Like an Outlier
April 03, 2026
On calibration, compromise, and the art of remaining fully yourself while becoming easier to trust, understand, and include — through the lens of machine learning and life.
Thought on Jensen Huang on Lex Fridman
March 27, 2026
Reflections on Jensen Huang’s interview with Lex Fridman — on systems thinking, CUDA, organizational architecture, and why NVIDIA’s deepest moat is institutional.
Teaching an LLM to Coach Itself: Multi-Agent Math Tutoring with Reinforcement Learning
March 07, 2026 · Mohammed Alshehri
Training a Solver-Coach-Reviser system on Hendrycks MATH using Tinker RL and Qwen3-8B.
Recursive Language Models as Procedural Scaling
February 08, 2026 · Mohammed Alshehri
Why long-context is not one problem, and how chaining bounded model calls through a REPL offers a cleaner alternative to context-window maximalism.
How RL Changed My Taste in AI Systems
February 04, 2026 · Mohammed Alshehri
A narrative learning path through reinforcement learning — from vague fascination to implementing, diagnosing, and iterating on RL systems with confidence.
Reinforcement Learning vs Regular Training: The Real Difference Is Not the Math, It Is the Loop
January 09, 2026 · Mohammed Alshehri
Why RL feels qualitatively different from supervised learning, and how the feedback loop creates systems challenges that go far beyond the algorithm.
GPT-OSS Safeguard as Policy-Executable Safety, and the Cabinet Briefing Risk Scanner Built on Top of It
January 03, 2026 · Mohammed Alshehri
A systems-focused account of how GPT-OSS Safeguard can be used as a policy-executable safety component, operationalized into a Cabinet Briefing Risk Scanner for high-stakes government communications.
2025: The Year I Bet on Myself
January 01, 2026 · Mohammed Alshehri
A personal reflection on leaving IBM, building startups from zero, and the lessons learned from a full year of betting on yourself as a founder.
Research Imperatives and the Struggle for Algorithmic Dominance
December 14, 2025 · Mohammed Alshehri
An in-depth analysis of the AI race between Google DeepMind, OpenAI, and Anthropic — their origins, strategies, and the rivalry shaping the future of artificial intelligence.
My Thoughts on Sora 2
October 15, 2025
Sora 2 isn’t just another AI milestone — it’s a revolution that changes how we define creativity, truth, and perception itself.
RL: Why It Matters More Than Ever
October 10, 2025 · Mohammed Alshehri
A comprehensive guide to Reinforcement Learning — from Q-Learning and DQN to A3C, PPO, and real-world projects. Covers the math, the code, the key papers, and why RL is more important than ever.
Evolutionary Algorithms: A Practical Introduction in Python
October 03, 2025
A practical introduction on how to program Evolutionary Algorithms in Python to solve optimization tasks, including Hill Climbers, selection methods, and TPOT.
Are Open Source Models Actually Free?
September 30, 2025
A look at the hidden costs behind open source AI models — compute, infrastructure, fine-tuning, and operational overhead that don’t show up in the license.
Context Engineering Guide
July 08, 2025
A concrete step-by-step guide to context engineering in practice — from system prompts and structured outputs to RAG, memory, and tool calling in AI agent workflows.
What is Model Context Protocol (MCP)? How it simplifies AI integrations compared to APIs
June 30, 2025
MCP is a new open protocol that standardizes how AI agents connect to tools and data sources — like a USB-C port for AI. Here’s how it works and why it matters.
Let’s Talk AI Agents
June 16, 2025
Concurrency Models in Python
April 27, 2025
A deep dive into Python’s concurrency models — from the infamous GIL and multi-threading to multiprocessing, concurrent.futures, and Futures — inspired by Fluent Python.