OpenAI rolls out GPT-5.5 with improved coding, research, and automation capabilities

OpenAI has introduced GPT-5.5, a new model designed for real-world computing tasks including coding, research, data analysis, document creation, and software interaction. It moves beyond single-response AI toward full workflow execution with planning, tool use, and task completion.

The model focuses on understanding intent faster and handling multi-step tasks with minimal guidance. It is designed for agent-based environments where AI systems can continue working across tools until a task is fully completed.

GPT-5.5

GPT-5.5 is an agentic AI model built to operate across software environments, codebases, documents, and data systems. It can interpret goals, break them into steps, and execute them using tools while maintaining long-context awareness.

It improves reasoning efficiency while maintaining GPT-5.4-level latency in real-world serving. The model is also more token-efficient, producing higher-quality outputs with fewer computational steps, especially in coding and structured workflows.

Key features

GPT-5.5 improves performance across coding, knowledge work, scientific research, and computer-based automation. It is designed to plan, execute, verify, and continue tasks across tools with reduced user intervention.

It is especially effective in long-horizon workflows that require reasoning, adaptation, and context tracking. This includes software engineering, research analysis, and enterprise operations involving multiple tools and systems.

Core capabilities:

Advanced agentic coding and autonomous software execution
82.7% accuracy on Terminal-Bench 2.0 command-line workflows
58.6% performance on SWE-Bench Pro real GitHub issue solving
Strong performance on Expert-SWE long-horizon engineering tasks
Improved understanding of full codebase structure and dependencies
Better debugging, refactoring, testing, and validation workflows

Stronger context retention across large systems
Reduced token usage for equivalent coding tasks
High efficiency on Artificial Analysis Coding Index (lower cost per output)
Improved computer-use actions (navigation, typing, tool interaction)
Strong document, spreadsheet, and presentation generation
Better structured business and operational planning support
Improved reasoning in ambiguous multi-step environments
GPT-5.5 Thinking mode for faster complex reasoning
GPT-5.5 Pro for higher accuracy and deeper analysis

Beyond coding, GPT-5.5 supports enterprise workflows such as reporting, financial analysis, communication planning, and structured data interpretation. It converts unstructured inputs into structured outputs across business environments.

Internal usage shows strong adoption, with over 85% of OpenAI employees using Codex weekly. It is used for dataset analysis, risk scoring systems, automated Slack workflows, large-scale document processing, and automated reporting that reduces manual workload.

In ChatGPT, GPT-5.5 Thinking improves speed and clarity in complex tasks such as coding, synthesis, research, and analysis. GPT-5.5 Pro improves structure, depth, and accuracy in business, legal, education, and technical workflows.

Benchmark performance:

GDPval: 84.9% across 44 occupations
OSWorld-Verified: 78.7% real computer environments
Tau2-bench Telecom: 98.0% customer service workflows
FinanceAgent: 60.0%
Investment banking modeling tasks: 88.5%
OfficeQA Pro: 54.1%

Scientific research capabilities

GPT-5.5 improves multi-step scientific workflows that require experimentation, reasoning, and iterative analysis over time. It supports hypothesis testing, data exploration, and result interpretation across long research cycles.

GeneBench: Improved performance in genetics and quantitative biology tasks involving complex datasets, uncertainty, and statistical modeling, often reflecting multi-day scientific workloads.
BixBench: Strong performance in bioinformatics and real-world biomedical data analysis tasks.
Internal testing: Contributed to a new proof in combinatorics related to Ramsey numbers, later verified using formal methods, demonstrating structured reasoning in mathematical research tasks.

Early testers used GPT-5.5 Pro as a research assistant for manuscript review, iterative analysis, hypothesis development, and multi-source reasoning across code and documents. It performs better in long research workflows requiring progressive refinement.

Efficiency and infrastructure

GPT-5.5 was designed to maintain GPT-5.4-level latency while improving intelligence and efficiency. It was co-developed with NVIDIA GB200 and GB300 NVL72 systems.

A major optimization involved dynamic workload balancing instead of fixed chunk processing. Codex analyzed real production traffic patterns and developed improved partitioning methods, increasing token generation speed by over 20%.

The model also contributed to optimizing its own serving infrastructure, improving overall system performance and efficiency.

Safety and cybersecurity

GPT-5.5 introduces stronger safety systems under OpenAI’s Preparedness Framework, especially for cybersecurity-related capabilities. It is classified as “High” in cybersecurity potential.

Safety measures include:

Stronger classifiers for sensitive cyber-related requests
Improved misuse detection systems
Controls for repeated abuse patterns
Authenticated access for verified users
Monitoring of impermissible usage

The model does not reach “Critical” cybersecurity level but shows improvement over GPT-5.4.

OpenAI is expanding Trusted Access for Cyber via Codex, allowing verified users and organizations to access advanced capabilities under stricter security controls. The company is also working with government partners to support protection of critical infrastructure such as energy systems, water networks, and public digital services.

Pricing and availability

GPT-5.5 is rolling out to ChatGPT Plus, Pro, Business, and Enterprise users, as well as Codex. GPT-5.5 Pro is available for Pro, Business, and Enterprise tiers.

In ChatGPT:

GPT-5.5 Thinking: Plus, Pro, Business, Enterprise
GPT-5.5 Pro: Pro, Business, Enterprise

In Codex:

Available across Plus, Pro, Business, Enterprise, Edu, Go
400K context window support
Fast mode: 1.5x speed at 2.5x cost

API (coming soon):

gpt-5.5: $5 input / $30 output per 1M tokens
Batch/Flex: 50% reduced cost
Priority: 2.5x cost
gpt-5.5-pro: $30 input / $180 output per 1M tokens
1M token context window

OpenAI noted that GPT-5.5 is priced higher than GPT-5.4 but delivers higher capability and improved token efficiency across workloads.