NLP Newsletter

NLP Newsletter

Share this post

NLP Newsletter
NLP Newsletter
🧠 AI Agents Weekly: Claude 3.7 Sonnet, GPT-4.5, VisionAgent, Phi-4, Thinking QwQ
Copy link
Facebook
Email
Notes
More

🧠 AI Agents Weekly: Claude 3.7 Sonnet, GPT-4.5, VisionAgent, Phi-4, Thinking QwQ

Claude 3.7 Sonnet, GPT-4.5, VisionAgent, Phi-4, Thinking QwQ

Mar 01, 2025
∙ Paid
15

Share this post

NLP Newsletter
NLP Newsletter
🧠 AI Agents Weekly: Claude 3.7 Sonnet, GPT-4.5, VisionAgent, Phi-4, Thinking QwQ
Copy link
Facebook
Email
Notes
More
2
Share

In today’s issue:

  • Anthropic announces Claude 3.7 Sonnet

  • OpenAI launched GPT-4.5

  • VisionAgent is an open-source Python library using agents for vision tasks

  • DeepSeek Open-source Week: FlashMLA, DeepEP, 3FS,…

  • Microsoft announced Phi-4

  • Qwen launches Thinking (QwQ)

  • Google’s new method for improving planning and reasoning with agents

  • Top AI dev news and much more.



Top Stories

Claude 3.7 Sonnet

Bar chart showing Claude 3.7 Sonnet as state-of-the-art for SWE-bench Verified

Anthropic has unveiled Claude 3.7 Sonnet, its most advanced AI model yet, alongside Claude Code, a new agentic coding tool.

Key highlights:

  • Hybrid reasoning model – Claude 3.7 Sonnet is the first AI model that dynamically switches between rapid responses and in-depth, step-by-step reasoning, controlled via API settings.

  • State-of-the-art coding performance – Benchmarks like SWE-bench Verified and TAU-bench show Claude 3.7 Sonnet leading in real-world coding tasks, outperforming previous models.

  • Extended thinking mode – Available on Pro, Team, and Enterprise plans, this feature allows deeper self-reflection for better accuracy in math, physics, and instruction-following.

  • Fine-tuned control over response depth – API users can limit the model’s "thinking budget" by setting a maximum number of reasoning tokens.

  • Launch of Claude Code – A command-line coding assistant that automates test-driven development, debugging, and large-scale refactoring directly from the terminal.

  • GitHub integration – Now available across all Claude plans, enabling seamless repository connections for debugging, documentation, and full-stack development.

  • Safer & more reliable – Reduces unnecessary refusals by 45% and includes enhanced defenses against prompt injection attacks.

Blog | Claude Code Overview | Extended Reasoning Report

Keep reading with a 7-day free trial

Subscribe to NLP Newsletter to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 elvis
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More