Connect with us

Hi, what are you looking for?

AI Technology

ZTE Unveils AI Factory Solution, Achieving Major Breakthroughs in Compute Efficiency

ZTE’s “AI Factory” achieves up to 20x token throughput improvement, optimizing AI infrastructure with innovative compute-network co-design for global deployment.

At the recent MWC Barcelona 2026, Chen Xinyu, Vice President of ZTE, unveiled the company’s ambitious “AI Factory” full-stack solution, designed to tackle the increasing demands for computing power posed by large AI models. This innovative approach employs end-to-end co-design techniques to reshape traditional hardware infrastructures, aiming to provide global customers with an AI infrastructure that optimizes total cost of ownership (TCO) over its lifecycle.

As artificial intelligence applications continue to expand, the infrastructure demands associated with large models have reached unprecedented levels. Chen highlighted that conventional methods of hardware stacking are no longer sufficient to harmonize scale, efficiency, and cost. “We need comprehensive architectural reconstruction to maximize resource compute efficiency and accelerate the large-scale deployment of AI,” he stated. The “AI Factory” employs a full-stack co-design strategy, integrating advanced AI servers, hyper-nodes, a lossless network, the AI Booster operating system, and development platforms like the AI Agent Studio, all augmented by customized IDC infrastructure.

Central to this initiative is the concept of compute-network co-design, aimed at breaking the physical limits of computing density and scale. Chen explained two main breakthroughs: vertical scaling, which enhances density, and horizontal scaling, which expands the overall system capacity. The vertical scaling utilizes an OEX (Orthogonal Electrical eXchange) architecture within hyper-nodes, allowing for physical connections between compute and switching trays through vertical crosslinking. This design minimizes the reliance on high-speed cables, significantly boosting compute density while ensuring stable communication and eliminating risks associated with cable loosening. Enhanced by ZTE’s proprietary high-capacity switching chip, this system is capable of supporting terabyte-level bandwidth with latency under 100 nanoseconds, compatible with global standards.

On the horizontal scaling front, the “AI Factory” constructs clusters within single data centers through Scale-Out networks and further aggregates compute power across multiple data centers via Scale-Across networks. This approach facilitates the establishment of scalable AI factory foundations, allowing for extensive deployment of AI solutions.

The software-hardware co-design component of the “AI Factory” is equally critical in enhancing energy efficiency. Chen emphasized that to fully exploit hardware capabilities, a deeply integrated software system is essential. ZTE’s software stack acts as the operating system for intelligent computing resource pools, transforming disparate physical resources into cohesive compute services. Collaborating with major GPU manufacturers, ZTE incorporates techniques such as framework optimization and intelligent scheduling to significantly elevate performance metrics. Chen noted that these optimizations can lead to an increase in token throughput by five to twenty times, a substantial improvement in efficiency.

To address the complexities of large-scale cluster engineering, ZTE has introduced the “AI Factory Twin Platform.” This platform leverages digital twin technology to simulate various aspects of AI infrastructure, such as hardware selection and thermal management, enabling optimized performance and cost-effective design throughout the lifecycle of AI factories.

As a multifaceted engineering undertaking, the “AI Factory” encompasses various components, including chips, algorithms, hardware, clusters, software, and data centers. Chen emphasized ZTE’s commitment to applying its vast experience in communication systems and large-scale networking to advance AI infrastructure. “Extreme co-design is our core philosophy,” he concluded, underscoring ZTE’s intention to maintain an open and collaborative approach with global partners to foster a future-oriented intelligent computing ecosystem. This initiative aims not only to democratize AI technology but also to drive significant advances across various sectors of the economy.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

AI in medicine is set to skyrocket from $29.27 billion in 2026 to $3.36 trillion by 2040, driven by a 40.3% CAGR and innovations...

AI Regulation

As Congress stalls on AI regulation, 97% of Americans support state-level protections against rising threats, including AI-enabled fraud and unsafe technologies.

AI Technology

OpenAI plans a transformative $20 billion investment in Cerebras chips, aiming to enhance AI capabilities and secure a significant equity stake in the startup.

AI Research

Norm Ai launches the Legal AGI Lab to develop essential legal frameworks for AI integration in high-stakes sectors like healthcare and finance.

Top Stories

Anthropic's Mythos model boosts software engineering performance, prompting a potential reevaluation of IT services growth projections and escalating disruption risks.

AI Government

Japan's Justice Ministry launches a study panel to assess civil liability for unauthorized AI-generated content, meeting five times from April to July.

AI Finance

OpenAI enhances ChatGPT for Excel with new AI tools to streamline finance workflows, reducing manual effort and increasing productivity for enterprise teams.

AI Cybersecurity

Rubrik Zero Labs reveals 86% of organizations fear AI agents will surpass their security measures, highlighting urgent oversight challenges in an evolving landscape.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.