AI Newsletter

AI Newsletter

🤖 AI Agents Weekly: Claude Haiku 4.5, Deep Agents, SWE-grep, nanochat, Agent Skills, Veo 3.1 Fast, n8n AI Workflow Builder

Claude Haiku 4.5, Deep Agents, SWE-grep, nanochat, Agent Skills, Veo 3.1 Fast, n8n AI Workflow Builder

Oct 18, 2025
∙ Paid
15
2
1
Share

In today’s issue:

  • Anthropic introduces Claude Haiku 4.5

  • Deep Agents: The future of AI Agents

  • Demystifying RL in Agentic Reasoning

  • Cognition introduces SWE-grep for faster context retrieval

  • Andrej Karpathy releases nanochat

  • Anthropic introduced Agent Skills

  • n8n introduced AI Workflow Builder

  • OpenAI released gpt-5-search-api

  • Google DeepMind and Yale launched Cell2Sentence-Scale 27B

  • Google introduced Veo 3.1 and Veo 3.1 Fast

  • Top AI dev news, tools, and papers.



Top Stories

Claude Haiku 4.5

Chart comparing frontier models on SWE-bench Verified which measures performance on real-world coding tasks

Anthropic’s new small model, Claude Haiku 4.5, delivers near–frontier performance at a fraction of the cost and latency. It matches or beats Claude Sonnet 4 on real-world coding and computer-use benchmarks, making it ideal for responsive AI tasks such as live chat, coding agents, and customer-service automation.

  • Speed and efficiency – Haiku 4.5 runs 2× faster and costs â…“ as much as Sonnet 4, while achieving comparable SWE-bench Verified scores and even outperforming it on computer-use tasks.

  • Ideal for multi-agent workflows – Anthropic highlights pairing Sonnet 4.5 (for planning) with teams of Haiku 4.5s (for execution) to parallelize subtasks efficiently.

  • Safety-first design – In alignment testing, Haiku 4.5 showed the lowest rate of misaligned behaviors among all Claude models, earning the AI Safety Level 2 (ASL-2) classification, less restrictive than Sonnet 4.5’s ASL-3.

  • Broad deployment – Available via the Claude API, Claude Code, Amazon Bedrock, and Google Vertex AI, it’s positioned as a drop-in replacement for Haiku 3.5 or Sonnet 4 for developers seeking cost-efficient, real-time reasoning.

  • Benchmarks – On SWE-bench Verified, Terminal-Bench, τ²-Bench, AIME, OSWorld, and MMMLU, Haiku 4.5 maintains strong accuracy using 128K thinking budgets with tool use enabled.

Blog

Keep reading with a 7-day free trial

Subscribe to AI Newsletter to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 elvis
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture