Connect with us

Hi, what are you looking for?

AI Technology

ZTE Unveils AI Factory Solution, Achieving Major Breakthroughs in Compute Efficiency

ZTE’s “AI Factory” achieves up to 20x token throughput improvement, optimizing AI infrastructure with innovative compute-network co-design for global deployment.

At the recent MWC Barcelona 2026, Chen Xinyu, Vice President of ZTE, unveiled the company’s ambitious “AI Factory” full-stack solution, designed to tackle the increasing demands for computing power posed by large AI models. This innovative approach employs end-to-end co-design techniques to reshape traditional hardware infrastructures, aiming to provide global customers with an AI infrastructure that optimizes total cost of ownership (TCO) over its lifecycle.

As artificial intelligence applications continue to expand, the infrastructure demands associated with large models have reached unprecedented levels. Chen highlighted that conventional methods of hardware stacking are no longer sufficient to harmonize scale, efficiency, and cost. “We need comprehensive architectural reconstruction to maximize resource compute efficiency and accelerate the large-scale deployment of AI,” he stated. The “AI Factory” employs a full-stack co-design strategy, integrating advanced AI servers, hyper-nodes, a lossless network, the AI Booster operating system, and development platforms like the AI Agent Studio, all augmented by customized IDC infrastructure.

Central to this initiative is the concept of compute-network co-design, aimed at breaking the physical limits of computing density and scale. Chen explained two main breakthroughs: vertical scaling, which enhances density, and horizontal scaling, which expands the overall system capacity. The vertical scaling utilizes an OEX (Orthogonal Electrical eXchange) architecture within hyper-nodes, allowing for physical connections between compute and switching trays through vertical crosslinking. This design minimizes the reliance on high-speed cables, significantly boosting compute density while ensuring stable communication and eliminating risks associated with cable loosening. Enhanced by ZTE’s proprietary high-capacity switching chip, this system is capable of supporting terabyte-level bandwidth with latency under 100 nanoseconds, compatible with global standards.

On the horizontal scaling front, the “AI Factory” constructs clusters within single data centers through Scale-Out networks and further aggregates compute power across multiple data centers via Scale-Across networks. This approach facilitates the establishment of scalable AI factory foundations, allowing for extensive deployment of AI solutions.

The software-hardware co-design component of the “AI Factory” is equally critical in enhancing energy efficiency. Chen emphasized that to fully exploit hardware capabilities, a deeply integrated software system is essential. ZTE’s software stack acts as the operating system for intelligent computing resource pools, transforming disparate physical resources into cohesive compute services. Collaborating with major GPU manufacturers, ZTE incorporates techniques such as framework optimization and intelligent scheduling to significantly elevate performance metrics. Chen noted that these optimizations can lead to an increase in token throughput by five to twenty times, a substantial improvement in efficiency.

To address the complexities of large-scale cluster engineering, ZTE has introduced the “AI Factory Twin Platform.” This platform leverages digital twin technology to simulate various aspects of AI infrastructure, such as hardware selection and thermal management, enabling optimized performance and cost-effective design throughout the lifecycle of AI factories.

As a multifaceted engineering undertaking, the “AI Factory” encompasses various components, including chips, algorithms, hardware, clusters, software, and data centers. Chen emphasized ZTE’s commitment to applying its vast experience in communication systems and large-scale networking to advance AI infrastructure. “Extreme co-design is our core philosophy,” he concluded, underscoring ZTE’s intention to maintain an open and collaborative approach with global partners to foster a future-oriented intelligent computing ecosystem. This initiative aims not only to democratize AI technology but also to drive significant advances across various sectors of the economy.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Cohere Health launches a Global Capability Center in Hyderabad to enhance AI-driven clinical intelligence and improve global healthcare outcomes.

AI Generative

The global AI market is projected to soar from $638.2 billion in 2024 to $3.6 trillion by 2033, driven by automation and generative AI...

AI Regulation

DOJ's September 2024 update mandates companies to integrate AI risk management into compliance programs, emphasizing accountability amid rising enforcement actions.

AI Business

Cushman & Wakefield reveals AI is redefining retail, transforming physical stores into adaptive environments that enhance customer engagement and profitability.

AI Generative

MIT is now offering seven free AI courses through its OpenCourseWare, catering to all skill levels, to meet the surging demand for AI literacy...

AI Regulation

Starting March 1, 2026, the 2025 AI Law mandates clear labeling for all AI-generated images and videos to combat misinformation and enhance transparency.

AI Cybersecurity

Cydome's Maritime Cyber Trends Report reveals a shocking 60% of software vulnerabilities are weaponized within 48 hours, urging shipping firms to enhance AI-driven cybersecurity.

AI Tools

UK police forces face criticism over AI tools like Microsoft's Copilot and predictive analytics, as £4M investment raises concerns about bias and accountability.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.