OpenAI launches GPT‑5.3‑Codex‑Spark for real-time coding with 128k context

OpenAI has released a research preview of GPT‑5.3‑Codex‑Spark, a smaller and faster version of GPT‑5.3‑Codex, designed for real-time coding tasks. The model is optimized for ultra-low latency hardware in partnership with Cerebras, delivering more than 1,000 tokens per second while maintaining full coding capability. Continue reading “OpenAI launches GPT‑5.3‑Codex‑Spark for real-time coding with 128k context”