Friday, April 24, 2026
Today's AI NEWS
7 stories in this edition.
OpenAI announces GPT-5.1 with native multimodal video understanding
GPT-5.1's video tower lets it analyze hour-long footage without external chunking — a real step beyond prior frame-sampling hacks.
DeepSeek-V3.2 quietly drops with 30% cheaper API
Continues DeepSeek's price-per-quality dominance — frontier reasoning at a fraction of OpenAI/Anthropic rates.
Vercel ships AI Gateway v2 with provider failover
Native cross-provider failover at Vercel's edge means apps survive provider outages without app-level retry code.
Replicate launches per-second billing for image models
Image-gen costs drop ~40% for short jobs that previously rounded up to per-minute billing.
Meta open-sources Llama 3.3 405B Instruct refresh
Refresh focuses on code and tool-use; closes much of the gap to Llama 4's still-unreleased Instruct variants.
Pinecone announces sunset of Starter (free) tier
Indie projects lose another free vector store option — Qdrant Cloud and Supabase pgvector pick up the slack.
Together AI raises Series C, pushes inference price war
Fresh capital funds aggressive open-model hosting prices; cuts published inference rates by ~25%.