Claude Code CLI: The Tools, Tricks, and MCP Servers That Actually Matter
A practical guide to Claude Code's most useful features — CLAUDE.md memory, Plan Mode, MCP servers, custom slash commands, hooks, and Git worktrees for parallel sessions — with real commands and config examples.
Claude AI: The Features You're Probably Not Using (But Should Be)
A practical tour of Claude's most powerful and overlooked capabilities — Artifacts, Projects, MCP, prompt engineering patterns, and the tricks that actually change how you work.
Speculative Decoding: Making LLMs 2–3x Faster Without Breaking Anything
Technical diagram comparing standard autoregressive decoding (one token per forward pass) vs. speculative decoding (draft + parallel verify for 2-3x speedup)
GLM-5 vs Qwen 3.5 vs MiniMax M2.5: The Open-Weight LLM Showdown (2026 Edition)
A deep dive into benchmark scores, real-world performance, and architectural strategies for deploying GLM-5, Qwen 3.5, and MiniMax M2.5 open-source LLMs in production.
China’s LLM Face-off: GLM-5 vs Qwen3.5 vs MiniMax-M2 — Who Wins in Code, Reasoning, and AI Agents?
Conceptual diagram of a hybrid AI architecture using GLM-5 for reasoning and planning, Qwen 3.5 for multimodal UI and tools, and MiniMax M2.5 for high-speed code execution agents.