TODAY
Modal Labs adds GPU snapshot/restore for sub-second cold starts
Serverless GPU just got real — 70B model cold start under 800ms makes per-request inference economical.
Published: 2026-04-27
Sources
Tags
modalinfrastructuregpuserverless
TODAY
Serverless GPU just got real — 70B model cold start under 800ms makes per-request inference economical.
Sources
Tags
We use cookies
Anonymous analytics help us improve the site. You can opt out anytime. Learn more