This Month in Tech: January 2026 | Critical Thinking Officer

How Cursor Shipped its Coding Agent to Production
Cursor’s team shipped Composer, a coding agent that completes most tasks in under 30 seconds, by solving three gnarly engineering problems: the “diff problem” (models aren’t great at making clean code edits, so they trained specifically on edit trajectories and search-replace operations), compounded latency (each step in the agent loop adds delay, so they used MoE architecture, speculative decoding with draft models, and aggressive context compaction to stay fast), and sandboxing at scale (th…
The Adolescence of Technology
Dario Amodei, CEO of Anthropic, argues that humanity is entering a “technological adolescence” where AI could soon surpass Nobel Prize-level intelligence, creating existential risks ranging from autonomous “rogue” behavior to the democratic collapse of labor markets. The urgent threat of AI-enabled biological weapons and high-tech autocracy can be used to create a permanent global surveillance state.
The future of agentic coding: conductors to orchestrators
The future of agentic coding will transform software engineers from direct implementers into “conductors” guiding single AI assistants and increasingly into “orchestrators” managing autonomous fleets of agents to speed up software development.
2025: The year in LLMs
2025 was when LLMs learned to reason, allowing models to tackle complex, multi-step tasks, which in turn drove the widespread adoption of highly capable AI agents. “Coding agents” came to life, autonomously writing, executing, and debugging code across command-line interfaces and even mobile phones. The competitive landscape shifted a lot as Chinese open-weight models dominated capability rankings, and Google’s Gemini made large strides with new models and features like Nano Banana, challengi…
Software in a post-abundance world
AI is transforming software production by making it cheaper and faster, leading to smaller teams, reduced margins, and shifting value from code creation to decision-making and integration. This shift challenges old economic models where large teams thrived under abundant capital and growth incentives, pushing the industry towards leaner operations and disposable software. Capital now focuses on leveraging AI in areas like healthcare and manufacturing, where software integration with domain ex…
The next managerial revolution
The cost of doing is approaching zero. The benefit of deciding is higher than ever. We are the bottleneck now. AI is too fast, too good at building. The skill that matters is shifting from execution to orchestration. Founders don’t need to be operators anymore. They need to be systems architects.
Introducing Helix 02: Full-Body Autonomy
Helix 2 is a single neural system that can control a full humanoid robot body directly from pixels. It enables continuous, room-scale autonomy that seamlessly blends walking and manipulation. This article details what Helix 2 is capable of. It contains several clips of the Figure 03 robot performing tasks while running on the new system.
Claude’s new constitution
Anthropic has published a new constitution for Claude that details its vision for the AI’s values and behavior. This foundational document is central to Claude’s training process and outlines Claude’s core priorities: being broadly safe, ethical, compliant with Anthropic’s guidelines, and genuinely helpful, with safety prioritized in conflicts.
The Agentic AI Handbook: Production-Ready Patterns
This is a collection of 113 production-ready patterns derived from real-world systems for building and deploying reliable AI agents.
Agent-native Architectures
Claude Code demonstrated that large language models with access to bash and file tools can accomplish complex multi-step tasks autonomously by operating in a loop until an objective is achieved. Good coding agents are really good general-purpose agents. The Claude Code software development kit makes this accessible. It allows developers to build applications where features are achieved by agents with tools. This opens up a new field of software that works the way Claude Code works, but in cat…
AI Coding Assistants Are Getting Worse
AI coding assistants, after an initial period of improvement, have recently begun to decline in quality. The primary issue with newer models, such as GPT-5, is their tendency towards “silent but deadly” failures, where code runs without errors but produces incorrect or misleading results by avoiding safety checks or generating plausible fake data. This degradation is likely caused by current training methods that prioritize user acceptance, even for subtly incorrect code.
Claude Code and What Comes Next
AIs are now capable of real, sustained work that actually matters, and this is starting to change how developers approach tasks.
A field guide to sandboxes for AI
AI agents often need to execute untrusted code, so they use sandboxes. Effective sandboxing requires consideration of three distinct aspects: the isolation boundary, access policy, and execution lifecycle. There are various types of sandboxes, like containers (sharing the host kernel, insufficient for hostile code), gVisor (userspace kernel interposition), microVMs (guest kernel via hardware virtualization), and runtime sandboxes (no direct syscall access).
OpenAI bets big on audio as Silicon Valley declares war on screens
OpenAI has unified several engineering, product, and research teams over the past couple of months to overhaul its audio models. The company is reportedly preparing to launch an audio-first personal device in about a year. Its new audio model will sound more natural, be able to handle interruptions, and even speak when users are talking. The entire tech industry seems to be headed toward a future where screens become background noise and audio takes center stage.
LLMs vs. Marketplaces
Google has long tried to take profit away from lead gen marketplaces. OpenAI is now going for that pool with ChatGPT. Its technology makes search much better by understanding intent, and it can transact on users’ behalf, allowing the company to push further into the marketplace stack than Google and do everything a transactional marketplace can. Many marketplaces will come out of this transition just fine.
Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation!
Qwen3-TTS is a series of speech generation models that supports voice cloning, voice design, ultra-high-quality human-like speech generation, and natural language-based voice control. The models support 10 mainstream languages (Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, and Italian) along with various dialects. They exhibit strong contextual understanding, allowing them to adapt tone, rhythm, and emotional expression based on instructions and text semant…
Salesforce Adopts Cursor at Scale
Over 90% of Salesforce’s 20,000 engineers now use Cursor in daily workflows, improving development speed and code quality. The shift has helped power product releases like Agentforce and signals broader industry trends in AI-assisted software engineering.
Global AI computing capacity is doubling every 7 months
Total available capacity from AI chips across all major designers has grown approximately 3.3 times per year since 2022. This has enabled larger-scale model development and consumer adoption. Nvidia AI chips currently account for over 60% of total compute. Google and Amazon make up much of the remainder.
Evaluating AI Agents in Production
Anthropic’s practical approaches to agent evaluation emphasize pre-deployment tests that simulate real-world conditions and reduce failures in complex, multi-turn agent systems.

Need help with strategic technology?