The OpenNash Field Notes
Working notes from the frontier of agent deployment.
Work with OpenNash
Need an agent that still works after launch?
We build production AI agents with the evals, routing, tools, review loops, and runbooks that keep them useful in the real world.
FIG 01 This week's feature
June 9, 2026
Security
11 min read
Mythos/Fable 5 ExploitBench: From Crash to Code Execution
A plain-English breakdown of Mythos/Fable 5, ExploitBench, and why crash-to-code-execution capability changes AI cyber risk.
FIG 02 Recently published
3 notes
Mythos/Fable 5 Evals: Awareness and Sandbagging
What Mythos/Fable 5 reveals about safety evals, evaluation awareness, sandbagging, and why models may behave differently under test.
Read more ->Mythos/Fable 5 Bio Risk: Why Anthropic Stops Short of CB-2
Why Anthropic judged Mythos 5 below CB-2 despite strong biology results, and what that says about AI bio risk thresholds.
Read more ->Claude Fable 5 and Mythos 5: Same Weights, Different Safeguards
How Claude Fable 5 and Mythos 5 use the same model with different safeguards for cyber, biology, chemistry, and trusted access.
Read more ->
FIG 03 The full index
All posts
News
Field guide
Decagon Alternatives in 2026: 7 Options Compared on Pricing, Deployment, and Ownership
AI Safety
Mythos/Fable 5 NLA: What Anthropic Found Inside
Engineering
Recursive Self-Improvement Needs a Testing Harness, Not Just a Smarter Agent
Field guide
AI Customer Service for Financial Services: Audit Trails, SOC 2, and What Regulators Will Actually Ask
Engineering
What Is AG-UI? The Protocol Layer Agentic Apps Were Missing
Compare
Agentforce vs Sierra vs Decagon vs OpenNash: AI Agent Fit Compared
Services
AI Agent Consulting Services: Build Production Agents You Own
Pricing
AI Agent Pricing 2026: Platforms, Retainers, and True Cost
Field guide
HIPAA-Compliant AI Customer Service: What Healthcare Buyers Actually Need to Verify
Field guide
Why You Need an AI Services Partner: AGI Ships, But It Does Not Self-Install
Field guide
Workflow Discovery for AI Automation: Map the Process Before You Build the Agent
Field guide
Decagon Alternatives in 2026: 7 Options Compared on Pricing, Deployment Time, and Ownership
Field guide
AI in Insurance: Where Deterministic Automation Saves Time First
Field guide
How to Eval AI Agents in 2026: From Pretty Demos to Production Evidence
Engineering
Macro Evals for Agentic Systems: Why One Failed Trace Is Not Enough
Market
Replacing Junior Jobs With AI Is a Bad Business Strategy
Engineering
Self-Improving AI Agents Are Not Magic: What OpenAI's Tax AI and Karpathy's Autoresearch Teach Builders
Engineering
What the Anthropic Cookbooks Reveal About Building Agents That Actually Work
Field guide
AI Agent Deployment Is Not the Finish Line: The Operating Model After Launch
Market
Open AI Strategy: Why Model Routing, Traces, and Memory Beat Vendor Lock-In
Market
Verifiable Outcomes: Why the Best AI Agents Start With Boring Proof
Field guide
Open Source Just Caught Up to Opus 4.5: Why the Moat Is the Harness, Not the Model
Market
AI Receptionist for Law Firms: Intake, Conflicts, Escalation, and Audit Trails
Market
Best AI Voice Agents for Customer Support: Retell, Bland, Synthflow, Sierra, and Custom Compared
Market
AI Customer Support Platform vs Custom AI Agent: Build, Buy, or Own in 2026
Market
Zendesk AI Agent Pricing: Licenses, Setup, and Resolution Fees
Market
Sierra AI Pricing: What Outcome-Based Really Costs and When to Walk Away
Market
Agentforce vs. Sierra vs. Custom AI Agents: Which Customer AI Platform Actually Fits?
Engineering
AI Coding Harnesses: Stop Writing Code, Start Designing the System Around the Agent
Engineering
Notion AI Agent Architecture: 5 Rebuilds, 100+ Tools, and What Finally Worked
Market
Why You Need a Professional Services Partner to Deploy AI Agents (And What That Actually Looks Like)
Market
Build vs Buy AI Customer Support: The 3-Year Cost Sierra, Ada, and Salesforce Hope You Skip
Market
Intercom Fin Pricing Reality: When $0.99 Per Resolution Becomes a $1.2M Problem
Market
Ada AI Review 2026: Pricing Reality, Platform Limits, and When Custom Wins
Market
Salesforce Agentforce vs. Custom AI Agents: When the CRM Giant Isn't the Right Fit
Market
Why 'Configurable' Isn't 'Custom': What Businesses Actually Get from AI Agent Platforms vs. Purpose-Built Software
Market
AI Agent Audit Trails: What Regulated Industries Need Before Deploying Agents
Engineering
LLM Routing in Production: How to Pick the Right Model for Each Task
Field guide
Open-Source LLMs for Business Automation: When They Beat GPT-4 and When They Don't
Engineering
Agent Reliability Engineering: SRE Patterns That Keep AI Agents Running in Production
Market
AI Agent UX: How to Design Interfaces That Users Actually Trust
Field guide
Human-in-the-Loop Agent Design: 5 Patterns for When AI Should Ask Permission
Market
Your Team's First AI Agent: 5 High-ROI Starting Points That Are Not Chatbots
Market
Build vs. Buy AI Automation in 2026: A Decision Framework for Technical Leaders
Engineering
Agent Memory Beyond RAG: Short-Term, Long-Term, and Episodic Memory Patterns That Work
Engineering
Evaluation-Driven Development: How to Ship AI Features That Actually Improve Over Time
Market
AI Labor Market Impact 2026: What to Automate Now and What Still Needs Humans
Engineering
The 8 Levels of Agentic Engineering: How Teams Move from Copilot to Background Agents
Field guide
AI Automation, SaaS Stock Drops, and the Future of Apps: What Actually Gets Repriced
Field guide
Why AI Agents Fail in Production: 7 Failure Modes and How to Prevent Them
Engineering
9 Agentic Workflow Patterns Ranked: Which Ones Actually Work in Production
Market
The $50-a-Day Agent: Cost Engineering for Production AI Workflows
Market
OpenClaw for Business: What It Is, How It Works, and When to Deploy It
Podcast
AI's Real Bottleneck Isn't Software. It's Materials.
Market
How to Pick an AI Agent Platform Without Getting Locked In
Engineering
6 Agentic Knowledge Base Patterns: How AI Agents Are Replacing Static Wikis
Market
From Pilot to Production: The Enterprise AI Agent Readiness Checklist
Engineering
The Lethal Trifecta: Securing AI Agents Against Data Exfiltration (Enterprise Checklist)
Market
5 AI Agent Use Cases That Work in 'Boring' Companies
Market
From Prototype to Production: The Agent Deployment Checklist
Engineering
LLM Evaluation Beyond ROUGE: Building Custom Evals for Enterprise Agents
Field guide
Agentic Workflows vs Traditional Automation: When to Choose Each (2026 Guide)
Market
The Lethal Trifecta: Why Your AI Agent Is a Data Leak Waiting to Happen
Market
AI Agent Evals for Non-Technical Leaders
Resource
CAISO Storage Daily Brief: Dispatch, Prices & Ancillaries (2026)
Resource
ERCOT Prices Today: Day-Ahead, Real-Time & Ancillary (2026)
Market
The Hidden Tax of Agents: Compound Error and Cost Explosions
Resource
Clinical Trial Sponsors List: Top 500 Recruiting (2026)
Resource
FDA 510(k) Cleared Companies: Complete List (2026)
Resource
Fortune 500 Full List (2026)
Resource
Y Combinator Companies: Full List (2026)
Podcast
Personal AI Infrastructure: How Scaffolding Changes Everything
Engineering
Five Production Agent Patterns from Anthropic's Playbook
No posts match that search yet.