top of page
Search

ASR weekly AI newsletter #2: 25/Aug

  • Aug 25, 2025
  • 3 min read

Industry News & Market Analysis

Major AI Model Releases

DeepSeek V3.1: The Efficiency Champion

DeepSeek released V3.1, a hybrid reasoning model that changes the cost-performance equation in enterprise AI.Key highlights:

  • 671B total parameters, ~37B active (MoE), context window 164K

  • "Thinking mode" tokens allow controllable reasoning costs

  • Beats Claude 4 Sonnet in API cost (11% of price)

  • Benchmarks: AIME 93.1, LiveCodeBench 74.8, GPQA Diamond 80.1, SWE-Bench Verified 66%

  • Supports Anthropic API format, lowering migration costs

Sources:


Cohere Command R (Reasoning Model)

Cohere launched a new enterprise-focused LLM, Command R, targeting controllable tool use and balanced safety.

  • Open weights for research & deployment

  • Agentic reasoning & function calling well supported

  • Day-0 integration with major inference providers

  • Benchmarks show superior performance to GPT-OSS-120B

📌 Sources:


Google DeepMind Genie 3: Interactive World Simulator

DeepMind unveiled Genie 3, a multimodal world model simulating interactive environments.

  • Inputs: text, images, video

  • Persistent states off-camera

  • Used to train SIMA agents

  • Strategic applications in AI safety, embodied learning, and simulation-based evaluation

📌 Sources:


Qwen-Image-Edit (Alibaba)

Alibaba open-sourced Qwen-Image-Edit (20B parameters, Apache 2.0).

  • Supports bilingual editing (CN/EN)

  • Capable of object transformation & semantic content insertion

  • Ranked #2 globally on Image Editing Arena (ELO 1098)

  • Adopted into ComfyUI and Anycoder ecosystems

📌 Sources:


ByteDance Seed-OSS-36B

ByteDance released Seed-OSS-36B, a long-context dense model.

  • 36B parameters, 512K token context

  • Trained on 12T tokens without synthetic data

  • Enables controllable reasoning token allocation

  • Outperforms Qwen3/Hunyuan for long outputs

📌 Sources:


💰 Market Dynamics & Funding

Databricks Achieves Centicorn Status

  • Raised $100B valuation in Series K (one of very few centicorns)

  • New products: Lakebase (formerly Neon), Agent Bricks framework

  • Reinforces enterprise AI infrastructure dominance

📌 Sources:


Google’s AI Efficiency Revolution

  • Gemini prompts now 33× more energy-efficient and 44× lower in CO₂ footprint (2024→2025)

  • Typical query: ~0.24 Wh + 0.26 mL water

  • Achieved via system-level optimizations and cleaner energy sourcing

  • Veo 3 milestone: 100M+ videos created, massive TPU rollout, Gemini app expansion

📌 Sources:


🔧 Developer Ecosystem Evolution

  • DeepSeek V3.1 Anthropic API compatibility lowers migration costs

  • OpenAI Responses API adds connectors (Gmail, Calendar, Dropbox)

  • AGENTS.md spec adopted by Cursor, Amp, Jules, Factory

  • vLLM & SGLang support hybrid reasoning

  • Cline’s Auto Compact for multi-million-token workflows

  • MLX optimizes Apple Silicon local deployment

📌 Sources:


📊 Enterprise Reality Check

  • MIT Sloan Management Review (2025): ~95% of enterprise AI deployments fail

  • Human-in-the-loop workflows outperform full automation

  • Bubble risk: overvalued app-layer startups consolidating

  • Infrastructure providers remain resilient

📌 Sources:


🌍 Global Competition Landscape

  • China: DeepSeek’s cost leadership, open-source momentum, abundant compute/energy

  • U.S.: OpenAI’s trillion-dollar datacenter plans, Google’s efficiency focus, enterprise workflow integration

📌 Sources:


🔮 Strategic Implications

  • Efficiency over pure scale: hybrid reasoning & token efficiency

  • API standardization reduces switching costs

  • Multi-tool workflows are becoming enterprise standard

  • Market segmentation: enterprise-grade vs developer-focused ecosystems

📌 Sources:

 
 
 

Recent Posts

See All
ASR's Weekly AI bites # 4: 2/Sep

Safety & Security Developments OpenAI × Anthropic Cross-Evaluations OpenAI Official: Joint Safety Evaluation Findings BankInfoSecurity:...

 
 
 

Comments


© 2024 by Anmol Shantha Ram

bottom of page