AI Newsletter

AI Newsletter

🤖AI Agents Weekly: Gemini 2.5 Updates, Claude 4, II-Agent, Gemma 3n, MCIP, Veo 3

Gemini 2.5 Updates, Claude 4, II-Agent, Gemma 3n, MCIP, Veo 3

May 24, 2025
∙ Paid
17
1
Share

In today’s issue:

  • Google I/O Gemini 2.5 Updates

  • Anthropic announced Claude 4

  • DeepMind’s new creative stack and Veo 3

  • Microsoft Build updates and agent news

  • OpenAI Response API updates

  • Model Contextual Integrity Protocol

  • Google announced Gemma 3n

  • Running Claude’s coding agent programmatically

  • GitHub has launched the Copilot coding agent

  • II-Agent is a fully open-source, generalist AI assistant

  • DeepMind is evolving Gemini 2.5 Pro into a world model

  • Top AI devs news, research, and more



Top Stories

Google I/O Gemini Updates

Chart demonstrating Gemini 2.5 Pro Deep think's advanced capabilities

Google’s latest 2.5-series release builds on the March launch with stronger reasoning, new multimodal abilities, and tighter security. Highlights:

  • Performance leadership – Updated 2.5 Pro now tops WebDev Arena for coding and every LMArena leaderboard for overall quality. Its 1 M-token context window keeps state-of-the-art accuracy on long-context and video tasks, while LearnLM integration makes it the highest-rated model for pedagogy.

  • Deep Think mode – An experimental setting for 2.5 Pro that tests multiple solution hypotheses before answering. Early scores: 2025 USAMO (math) SOTA, 84 % on MMMU (multimodal reasoning), and leadership on LiveCodeBench for competition-level coding.

  • Faster, cheaper 2.5 Flash – The efficient sibling gains 20–30 % token savings yet improves on reasoning, multimodality, code, and long-context benchmarks. Preview is live in Google AI Studio, Vertex AI, and the Gemini app; GA in early June.

  • New multimodal & agentic capabilities

    • Native audio output with tone/accent control, multi-speaker TTS, and emotion-aware dialogue in the Live API.

    • Project Mariner “computer use” skills (UI automation) exposed via Gemini API/Vertex AI for RPA partners this summer.

  • Security and developer controls – Stronger defenses against indirect prompt injection; opt-in “thought summaries” expose model reasoning traces; thinking budgets let devs trade latency vs. quality; SDK now natively parses Model Context Protocol (MCP) tool definitions.

Blog

Keep reading with a 7-day free trial

Subscribe to AI Newsletter to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 elvis
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture