SmartDuke Technologies
Blog · Field notes

Long-form
shipping notes.

Essays, teardowns, and patterns from real client engagements. Written for the engineers and product builders shipping AI products.

All essays · 10

Latest from the lab.

Network of connected nodes representing multiple AI techniquesPatterns
Patterns·11 min

RAG vs agents vs fine-tuning: when each one wins.

Three techniques. Three different problems. Most teams reach for the wrong one because they're picking based on hype, not problem shape. Here's the honest decision framework.

Read essay
Abstract financial data and budget planning visualization$Opinion
Opinion·8 min

How much does it cost to build an AI agent in 2026?

Pricing for AI work is opaque. Here's the honest breakdown — what a prototype costs, what production costs, what operations costs, and what makes the numbers move.

Read essay
Dashboard with multiple analytics charts and graphsField notes
Field notes·13 min

LangSmith vs Langfuse vs Arize vs Braintrust: comparing AI observability platforms.

Four platforms, four philosophies. We've shipped on all of them. Here's the honest comparison — what each does well, what each doesn't, and how to pick.

Read essay
Code editor showing structured test cases on a dark screenEngineering
Engineering·10 min

How to write your first AI eval suite without a framework.

You don't need LangSmith, Braintrust, or any platform to ship your first eval suite. Most production-grade evals start as 100 prompts in a JSON file and a script. Here's the playbook.

Read essay
Circuit board representing a hardened production AI systemOperations
Operations·9 min

AI safety in production: a checklist that actually ships.

Safety isn't a content filter you add at the end. It's an architecture. These six layers are non-negotiable before any AI product touches real users.

Read essay
Closeup of structured code with HTML-like markup on a dark screenEngineering
Engineering·10 min

Schema.org markup for AI engines: what actually works in 2026.

Most schema markup is wasted effort. The four types that actually move citation rates on ChatGPT, Perplexity, and Google AI Overviews — and what to skip.

Read essay
Abstract colorful pattern representing eval suite scoringEngineering
Engineering·12 min

Evals that actually catch regressions before users do.

The eval suite most teams ship with is a confidence-builder, not a regression detector. Here's the structure we use to catch real failures earlier.

Read essay
Black laptop with code representing agent loop architecturesPatterns
Patterns·18 min

Five agent-loop patterns we keep reaching for.

Planner-executor, ReAct, supervisor-worker, hierarchical, pure tool-calling. When each one fits, when it doesn't, and when to mix them.

Read essay
Cascading binary code representing input streams flowing into a systemOperations
Operations·9 min

Guardrails that survive contact with real users.

Why bolt-on safety layers fail and what production-grade guardrail architecture actually looks like in 2026.

Read essay
Binary code background representing AI search engines parsing structured contentField notes
Field notes·14 min

GEO and AEO: the new search stack for AI-native brands.

Citation-rate has replaced rank-position. Here's how we instrument content for ChatGPT, Perplexity, and Google AI Overviews — and what we measure.

Read essay
Field notes · Monthly

One email a month.
No fluff, no funnel.

The teardowns, eval recipes, and architecture notes that helped us ship in the last 30 days. Written by the people who actually build the work.

Unsubscribe anytime. Never sold, never shared.