Back to newsPublished on 2026-02-10

OpenAI launches GPT-5.3-Codex, most capable agentic coding model, as Codex App hits 1M downloads

OpenAI launches GPT-5.3-Codex, most capable agentic coding model, as Codex App hits 1M downloads. Perplexity upgrades Deep Research to Anthropic's Claude Opus 4.6, claims benchmark leadership. X rolls.

orchestration safety agents MCP A2A

OpenAI launches GPT-5.3-Codex, most capable agentic coding model, as Codex App hits 1M downloads

OpenAI dropped GPT-5.3-Codex on February 5, their most advanced agentic coding model yet, building on GPT-5.2 with 25% faster inference and top benchmarks like 56.8% on SWE-Bench Pro and 77.3% on Terminal-Bench 2.0.[1] It handles the full software lifecycle, from autonomous app and game building to cybersecurity hardening—earning the first "High capability" rating, complete with $10M in API credits for cyber defense.[1] The model even helped create itself, per OpenAI's cheeky quote.[1]

The new Codex App, launched February 2, racked up over 1M downloads in its first week with 60% week-over-week user growth, now available in CLI, Cursor, GitHub, and VS Code.[2][3] It's temporarily free for ChatGPT Free/Go users, with Sam Altman signaling plans to keep it accessible post-promo.[3] X is buzzing with viral demos of its agentic feats, fueling excitement for dev productivity boosts.

Perplexity upgrades Deep Research to Anthropic's Claude Opus 4.6, claims benchmark leadership

Anthropic unveiled Claude Opus 4.6 on February 5, packing upgrades in coding, agentic planning, and a 1M-token context window in beta, dominating benchmarks like Terminal-Bench 2.0 SOTA, 90.2% on BigLaw Bench, and leading GDPval over GPT-5.2.[4] Perplexity wasted no time, upgrading Deep Research for Max users ($167/mo) immediately, rolling out to Pro soon after, touting SOTA on Google's DSQA and internals with adaptive thinking and higher output tokens.[5][6]

Perplexity's move promises deeper research capabilities, as they claim: "Perplexity Deep Research now runs on Opus 4.6, improving our existing state-of-the-art results."[6] Max users on X are already raving about instant access and enhanced outputs, sparking talks of Perplexity pulling ahead in the benchmark wars.

X rolls out viral Grok-themed animation for like button

X flipped the like button into a spectacle around February 9, triggering a giant Grok logo animation on taps— a temporary gimmick that's got users spamming likes to show it off.[7][8][9] Echoing past hits like the SpaceX rocket effect, it's driving massive engagement in hours, with videos flooding feeds.

X reactions are pure hype: folks yelling "Hit the ❤️ button and see!" and thanking the team before it vanishes, turning mundane likes into a viral party.

Elon Musk warns of 'woke virus' in AI, prioritizes maximally truth-seeking AI for safety

Elon Musk resurfaced in a viral clip, slamming the "woke virus" in AI as a misalignment risk worse than HAL from 2001: A Space Odyssey, pushing for "maximally truth-seeking AI" as the ultimate safety play.[10][11] "My top concern for AI safety is that we need a maximally truth-seeking AI... It is very important to have truth," he said, reviving his TruthGPT crusade against biased models.

X is lit with shares and debates on truth vs. political correctness in alignment, underscoring ongoing tensions in AI ethics.

What This Means For Your Business

Agentic models like GPT-5.3-Codex and Claude Opus 4.6 are supercharging coding and research, but raw power alone won't deliver—your workflows need smart orchestration to avoid chaos. At Up North AI, our multi-agent orchestration (MCP/A2A) and agent workforce design turn these tools into reliable teams, handling full lifecycles from dev to cyber defense without the hype fatigue.

Elon's truth-seeking call and X's Grok fun highlight trust gaps: biased or flashy AI erodes outcomes. We specialize in AI quality & trust review to audit for alignment, plus outcome engineering to ensure business ROI. Code is free. Judgment isn't.

Key takeaway: Prioritize orchestrated agents and rigorous trust reviews now to harness these leaps without the pitfalls—before competitors do.

See what we're exploring →

Sources

https://openai.com/index/introducing-gpt-5-3-codex
https://openai.com/index/introducing-the-codex-app
https://venturebeat.com/technology/openais-new-codex-app-hits-1m-downloads-in-first-week-but-limits-may-be
https://www.anthropic.com/news/claude-opus-4-6
https://www.perplexity.ai/pro
https://www.threads.com/@perplexity/post/DUWOU4dAT5E
https://x.com/suresh_maurya_/status/2020712232712343724
https://x.com/anandchokshi19/status/2020690240395256178
https://x.com/RoRoFli/status/2020960792938451452
https://x.com/XFreeze/status/2020738262432637398
https://www.facebook.com/calfkickercom1/posts/elon-musk-discusses-the-potential-dangers-of-artificial-intelligence-development/1491016953027712

Previous day Next day

Want to go deeper?

Reading the news is one thing. Exploring the frontier is another. See what we're building.

View our projects

OpenAI launches GPT-5.3-Codex, most capable agentic coding model, as Codex App hits 1M downloads

OpenAI launches GPT-5.3-Codex, most capable agentic coding model, as Codex App hits 1M downloads

Perplexity upgrades Deep Research to Anthropic's Claude Opus 4.6, claims benchmark leadership

X rolls out viral Grok-themed animation for like button

Elon Musk warns of 'woke virus' in AI, prioritizes maximally truth-seeking AI for safety

What This Means For Your Business

Sources

Recent articles

Google DeepMind's Robots Are Now Working for a Living

Together AI Hits $8.3B Valuation as Infrastructure Bets Intensify

GitHub Copilot Adds Its First Open-Weight Model, and It's Not Even Close on Price

Want to go deeper?