All Posts

Welcome to My Blog

January 15, 2025•1 min read

An introduction to this blog and what you can expect to find here.

Managing State for Streaming AI Responses

November 22, 2025•4 min read

LLM responses arrive as chunks, not all at once. Handle loading, streaming, completion, and errors without breaking the user experience.

ai react typescript streaming state-management

Designing UI for Streaming AI Responses

November 22, 2025•7 min read

AI systems stream responses with variable length and timing. Here's how to design interfaces that show progress immediately and handle uncertainty gracefully.

ai design ui engineering llm

Prompt Caching: Design for Reuse

November 21, 2025•3 min read

Structure prompts to maximize Anthropic's prompt caching, reducing costs by 90% and latency by 85% for repeated context.

ai claude anthropic optimization engineering

LLM Evals: Testing AI Outputs Systematically

November 21, 2025•6 min read

How to test LLM outputs with code-based grading, human evaluation, and LLM-as-judge. When to use each method and why statistical rigor matters.

ai testing llm engineering evaluation

Designing Error Messages for LLMs

November 21, 2025•3 min read

Error messages consume context and affect LLM decision-making. Structure errors as data, use reference IDs for details, and return actionable recovery paths.

ai tools context-management engineering

Understanding MCP Resources

November 19, 2025•3 min read

Resources represent data or files that an MCP client can read. A case study of the SQLite MCP server shows how resources and tools work together.

mcp ai resources engineering

Tool Output Design for Context Efficiency

November 16, 2025•2 min read

How to design tool responses that preserve context space for what matters. Filter early, return minimal data, and structure outputs for LLM consumption.

ai tools context-management engineering

Give AI Agents the Map First

November 16, 2025•3 min read

AI agents work better when they see the full structure upfront, then make targeted requests. How to use progressive disclosure for efficient context management.

ai context-management claude mcp

Context Escape Velocity

November 16, 2025•3 min read

How to recognize when your conversation has grown too large to be effective, and what to do about it.

ai context-management debugging claude

Anatomy of a Context Window

November 16, 2025•2 min read

Understanding what fills an LLM's context window and how it affects model behavior.

reference context-window llm ai

Getting Your Next.js Site Indexed on Google

October 23, 2025•4 min read

Set up Google Analytics, verify your domain with DNS, and get your Next.js site appearing in search results.

nextjs seo vercel google-search-console

Deploying Next.js to Vercel with Git Integration

October 23, 2025•4 min read

Connect your GitHub repository to Vercel for automatic deployments every time you push code.

nextjs vercel deployment github git

What is a Token?

January 24, 2025•2 min read

Definition and explanation of tokens in large language models.

reference token llm ai

Model Context Protocol: Connecting AI to Your Tools

January 24, 2025•3 min read

MCP provides a standardized way for AIs to interact with tools, from Figma to your calendar to custom workflows you build yourself.

ai mcp claude anthropic tools

Prompting Techniques That Actually Work

January 23, 2025•5 min read

Five prompting techniques that improve LLM outputs: few-shot learning, chain-of-thought reasoning, XML structure, output constraints, and prompt chaining.

ai llm prompting claude engineering

How Prompt Priming Shapes LLM Responses

January 23, 2025•3 min read

Your prompt's opening sets the context for the entire response.

ai llm prompting vectors embeddings

How LLMs Think and Respond

January 22, 2025•3 min read

LLMs generate text one token at a time. Understanding how they convert text to vectors, use attention to weigh context, and predict probabilities explains their behavior.

ai llm transformers machine-learning

Debugging LLMs: Understanding Attention, Tokens, and Context

January 22, 2025•3 min read

When models fail or behave unexpectedly, you need to understand why. Practical debugging techniques for tokenization, attention patterns, and context limits.

ai debugging llm engineering

Progressive Disclosure in Agent Skills

January 21, 2025•1 min read

The architectural pattern that makes Agent Skills scalable: load only what's needed, when it's needed.

ai architecture agent-skills claude

Building Agent Skills: A Practical Guide

January 21, 2025•4 min read

Anthropic's Agent Skills let you equip Claude with specialized capabilities through reusable skill packages. Here's how to build them.

ai anthropic agent-skills claude mcp