AI Generative

MIT Unveils Recursive Language Models Achieving 10M Token Processing with No Context Rot

MIT’s new Recursive Language Models achieve 91.33% accuracy on the 10M token BrowseComp-Plus benchmark, effectively eliminating context rot in LLMs.

Staff

Published

21 January, 2026

Researchers at MIT CSAIL have developed a novel inference technique known as **recursive language models (RLMs)**, designed to enhance the ability of large language models (LLMs) to process long prompts without the limitations of traditional context windows. This framework allows LLMs to programmatically analyze, decompose, and recursively call upon themselves to handle extensive text inputs, thereby addressing the challenges of processing information exceeding the model’s training constraints. By treating long prompts as a manipulable external environment, RLMs pave the way for more effective solutions in tasks such as codebase analysis, legal review, and multi-step reasoning.

The MIT team’s approach reframes long-context reasoning as a systems problem rather than merely expanding context windows or summarizing data. Current models often struggle with “context rot,” a phenomenon where the relevance of older information diminishes over time, leading to performance degradation as task complexity increases. Alex Zhang, a co-author of the study, emphasized the critical need to extend the effective context size of general-purpose LLMs significantly, particularly as enterprises increasingly adopt these models for complex, long-horizon tasks.

The RLM framework is built on principles derived from “out-of-core” algorithms, a classical computing method that enables the processing of datasets too large for a computer’s main memory by fetching only necessary chunks from a hard drive. In the case of RLMs, instead of inputting a lengthy prompt into the neural network, the framework stores the text as a variable within a Python environment. Once the text is stored, the LLM operates as a programmer, writing code to interact with this variable. For instance, it may utilize regular expressions to identify specific keywords within large texts, allowing it to retrieve only pertinent information for further analysis.

The architecture of RLMs typically involves two distinct agents: a **root language model**, often a powerful variant like **GPT-5**, which orchestrates the process, and a **recursive language model**, generally a faster and more cost-effective model that executes the actual text processing. This design allows RLMs to manage inputs that far exceed the typical context limits of existing models, while appearing seamless to end-users who interact with the system through standard API calls.

The researchers validated the RLM framework against traditional models and alternative agentic approaches like **CodeAct** and summary agents across various long-context tasks. Notably, the RLM powered by GPT-5 achieved a remarkable score of 91.33% on the **BrowseComp-Plus** benchmark, which involves inputs ranging from 6 to 11 million tokens. In contrast, standard LLMs failed to score any points in the same test. Furthermore, on the **OOLONG-Pairs** benchmark, which grows quadratically in difficulty with input length, the RLM significantly outperformed base models, achieving an F1 score of 58% compared to just 0.04% for the base GPT-5 model.

The findings indicate that while traditional models see a decline in performance with increased context complexity, RLMs maintain consistent, robust performance, particularly on tasks requiring extensive reasoning and dense data processing. Despite the intricacies of its operational framework, RLMs also presented cost advantages, being up to three times cheaper than summarization baselines on some benchmarks. However, researchers cautioned that the implementation of RLMs may require custom guardrails to prevent excessive sub-calls or redundant calculations that could inflate costs.

Zhang noted the potential for future models to better manage computational budgets, suggesting that companies like **Prime Intellect** are already looking to incorporate RLM techniques into their training processes. This could mitigate the issues posed by outlier scenarios where models may engage in inefficient behaviors. Looking ahead, RLMs could prove beneficial not only for tasks involving complex contextual data but also for enhancing chatbot interactions by managing long chat histories effectively.

Ultimately, the development of recursive language models represents a promising advancement in the field of AI, offering a new framework that complements existing retrieval methods while addressing the limitations of current LLMs. As enterprise architects evaluate the implications of RLMs, the technology stands to reshape the landscape of information processing and reasoning in artificial intelligence.

AI Research

Appier Reveals Framework to Enhance Reliability of AI Decision-Making in Corporations

Appier introduces a groundbreaking framework for evaluating AI decision-making under risk, enhancing corporate reliability and mitigating costly inaccuracies.

Staff19 hours ago

AI Technology

Prompt Injection Attacks Expose AI LLM Vulnerabilities, Threatening Security and Trust

AI vulnerabilities exposed as prompt injection attacks threaten security and trust in large language models, raising critical risks for autonomous AI systems.

Staff19 hours ago

AI Technology

AI’s Exponential Growth Faces Infrastructure Bottlenecks Amid Rising Demand for Fiber and Power

AI adoption surges, yet the demand for robust fiber networks and energy infrastructure lags, risking a bottleneck in data center expansion and performance.

Staff2 days ago

Runway Launches Real-Time Video Agent API, Enabling Custom Conversational Characters

Runway introduces its Runway Characters API, allowing instant creation of fully customizable conversational avatars from a single image, transforming digital interactions.

Staff2 days ago

AI Tools

AI Study Reveals LLMs Can Unmask Anonymous Social Media Users with High Accuracy

Study reveals AI can link anonymous social media users to real identities with high accuracy, raising urgent privacy concerns and enabling targeted scams.

Staff4 days ago

AI Cybersecurity

Microsoft Report Reveals Cybercriminals Use AI for Phishing, Malware, and Attacks

Microsoft warns that cybercriminals are leveraging AI to enhance phishing and malware attacks, exploiting legitimate credentials and reducing technical barriers to entry.

Rachel Torres4 days ago

AI Cybersecurity

MIT’s Strahinja Janjusevic Enhances Maritime Cybersecurity with AI and Policy Frameworks

MIT's Strahinja Janjusevic advances maritime cybersecurity by combining AI and policy frameworks to counter threats like GPS spoofing, enhancing national security.

Rachel Torres5 days ago

AI Research

MIT Unveils Self-Distillation Fine-Tuning to Combat AI Catastrophic Forgetting

MIT unveils Self-Distillation Fine-Tuning, a groundbreaking method that cuts catastrophic forgetting by enhancing AI's reasoning while retaining 2.5 times more knowledge.

Staff4 March, 2026

AIPRESSA.COM

AI Generative

MIT Unveils Recursive Language Models Achieving 10M Token Processing with No Context Rot

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

Top Stories

DeepMind Achieves Breakthroughs with AlphaFold and AlphaZero, Transforming AI Landscape

You May Also Like

AI Research

Appier Reveals Framework to Enhance Reliability of AI Decision-Making in Corporations

AI Technology

Prompt Injection Attacks Expose AI LLM Vulnerabilities, Threatening Security and Trust

AI Technology

AI’s Exponential Growth Faces Infrastructure Bottlenecks Amid Rising Demand for Fiber and Power

Top Stories

Runway Launches Real-Time Video Agent API, Enabling Custom Conversational Characters

AI Tools

AI Study Reveals LLMs Can Unmask Anonymous Social Media Users with High Accuracy

AI Cybersecurity

Microsoft Report Reveals Cybercriminals Use AI for Phishing, Malware, and Attacks

AI Cybersecurity

MIT’s Strahinja Janjusevic Enhances Maritime Cybersecurity with AI and Policy Frameworks

AI Research

MIT Unveils Self-Distillation Fine-Tuning to Combat AI Catastrophic Forgetting