DeepSeek’s Engram Breakthrough Enhances AI Performance by 3.4-5 Points, Reduces HBM Dependency

DeepSeek’s Engram boosts AI performance by 3.4-5 points while reducing reliance on high-bandwidth memory, revolutionizing efficiency in long-context tasks.

Staff

Published

13 January, 2026

DeepSeek has unveiled a new technical methodology named Engram, which offers a novel approach for artificial intelligence models to utilize a queryable database of information stored in system memory. Released on the company’s GitHub page, the paper outlines how Engram improves performance in long-context queries by enabling AI models to commit data sequences to static memory. This reduces the computational load on graphical processing units (GPUs) by allowing them to focus on more complex tasks, thereby decreasing the reliance on high-bandwidth memory (HBM), which is increasingly under supply pressure.

The research describes how N-grams, statistical sequences of words, are integrated into the models’ neural networks, forming a queryable memory bank. Engram allows AI models to access facts directly instead of reasoning them out, which is computationally expensive. By alleviating the need for GPUs to handle basic memory tasks, DeepSeek aims to address the ongoing demand for HBM, particularly as the supply remains constrained.

According to the paper, an Engram-based model scaled to nearly 27 billion parameters demonstrated superior performance in long-context training compared to standard Mixture of Experts (MoE) architectures. Traditional MoE models often require extensive reasoning to reconstruct data with each query reference, leading to computational waste. Engram’s architecture permits the storage of facts externally, enhancing efficiency.

The Engram model allows AI systems to simply check, “Do I already have this data?” instead of engaging in extensive reasoning processes for each query. The paper emphasizes that this method minimizes unnecessary computations, freeing up resources for higher-level reasoning tasks.

In a comparative analysis, DeepSeek found that reallocating around 20%–25% of the sparse parameter budget to Engram optimized performance, achieving results comparable to pure MoE models. This suggests that balancing memory and computational resources could be key in designing efficient AI systems moving forward.

DeepSeek’s exploration extended to what they termed the “Infinite Memory Regime,” where they maintained a fixed computational budget while attaching a near-infinite number of conditional memory parameters. This led to a linear performance increase with memory size, indicating that as memory expands, performance can improve without necessitating higher computational expenses.

These findings could have substantial implications for the AI industry, as reliance on HBM may lessen if AI models can efficiently leverage system memory through methodologies like Engram. The results from an Engram-27B parameter model indicated that it outperformed a standard 27B MoE model in knowledge-intensive tasks, with an increase of 3.4 to 4 points in performance and a 3.7 to 5 point improvement in reasoning tasks. Notably, in long-context benchmarks, the Engram model’s accuracy reached 97%, a significant leap from the MoE model’s 84.2%.

As DeepSeek prepares to announce a new AI model in the coming weeks, the implementation of Engram may redefine efficiency standards in AI applications. However, the broader market implications could also raise concerns about the existing DRAM supply crisis, as the shift toward system DRAM might exacerbate ongoing shortages. With DeepSeek suggesting that conditional memory functions will be essential for next-generation models, the future direction of AI development could hinge on the successful deployment of these methodologies.

In summary, if Engram delivers as intended in real-world applications, it could signify a pivotal moment for AI technology, moving away from traditional memory constraints and paving the way for more robust and efficient models.

AI Technology

Nebius Acquires Eigen AI for $643M to Enhance AI Infrastructure and Efficiency

Nebius acquires Eigen AI for $643M to enhance AI infrastructure and efficiency, integrating advanced model optimization technologies for scalable applications.

Staff2 May, 2026

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

DeepSeek's V4 open-source model undercuts GPT-5.5 and Claude Opus 4.7 with costs of $1.74 per million tokens, promising a disruptive shift in AI pricing...

Staff2 May, 2026

AI Technology

US Lawmakers Launch Investigation into Cybersecurity Risks from PRC-Origin AI in Critical Infrastructure

US lawmakers initiate a probe into PRC-developed AI systems, citing national security risks and potential exploitation of American innovations by companies like DeepSeek and...

Staff1 May, 2026

Mistral AI Launches 128B-Parameter Model but Faces Mixed Online Reception

Mistral AI launches its 128-billion-parameter Medium 3.5 model, scoring 77.6% on key benchmarks, yet faces criticism for high pricing and mixed performance.

Staff30 April, 2026

AI Generative

DeepSeek Launches V4 AI Model with Enhanced Reasoning and Agentic Capabilities

DeepSeek unveils V4 AI model with advanced reasoning and agentic capabilities, outperforming OpenAI's GPT-5.2 while integrating Huawei chips for enhanced autonomy.

Staff28 April, 2026

Anuma Launches Private AI Platform with One Encrypted Memory for 10 Leading Models

Anuma launches a privacy-first AI platform allowing users access to 10 leading models with a unique encrypted memory, enhancing data control and context retention.

Staff28 April, 2026

AI Technology

Intel Raises Q2 Revenue Outlook to $14.8B as AI Demand Boosts CPU Needs

Intel projects Q2 revenue of up to $14.8B, driven by AI demand for its Xeon CPUs, despite a GAAP loss per share of $0.73...

Staff28 April, 2026

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks

DeepSeek's V4-Pro eclipses GPT-5 and Claude in key benchmarks, achieving a Codeforces rating of 3,206 while undercutting OpenAI's costs by 89% per million tokens.

Staff27 April, 2026

AIPRESSA.COM

Top Stories

DeepSeek’s Engram Breakthrough Enhances AI Performance by 3.4-5 Points, Reduces HBM Dependency

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Technology

Nebius Acquires Eigen AI for $643M to Enhance AI Infrastructure and Efficiency

Top Stories

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

AI Technology

US Lawmakers Launch Investigation into Cybersecurity Risks from PRC-Origin AI in Critical Infrastructure

Top Stories

Mistral AI Launches 128B-Parameter Model but Faces Mixed Online Reception

AI Generative

DeepSeek Launches V4 AI Model with Enhanced Reasoning and Agentic Capabilities

Top Stories

Anuma Launches Private AI Platform with One Encrypted Memory for 10 Leading Models

AI Technology

Intel Raises Q2 Revenue Outlook to $14.8B as AI Demand Boosts CPU Needs

Top Stories

DeepSeek Launches V4, Surpassing GPT-5 and Claude in Key AI Benchmarks