AI Technology

AMD Advocates Integrated AI Compute with EPYC CPUs for 50% Cost Savings and Enhanced Performance

AMD’s EPYC processors enable firms like Kakao Enterprise to cut AI infrastructure costs by 50% while boosting performance by 30%, redefining compute strategies for AI.

Staff

Published

27 December, 2025

Graphics processing units (GPUs) have emerged as the primary upgrade for companies enhancing their AI systems, particularly in the inferencing stage, where trained models produce outputs from new data. However, semiconductor firm AMD warns that solely depending on GPUs can hinder performance and escalate costs.

In a recent interview with Newsbytes.PH, AMD’s Asia Pacific general manager, Alexey Navolokin, emphasized the growing need for effective coordination among CPUs, GPUs, memory, and networking as AI workloads expand and agentic AI systems shift towards real-world applications.

“Today’s large models operate across clusters of GPUs that must work in parallel and exchange data constantly,” Navolokin explained. He noted that overall performance hinges not only on GPU speed but also on the efficiency with which data is transferred and computation is coordinated across the entire system architecture.

Navolokin pointed out a prevalent misconception that GPUs serve as the singular powerhouse for AI inferencing. He highlighted that modern AI models typically exceed the capacity of a single device, necessitating substantial support from host CPUs to facilitate data movement, synchronization, and latency-sensitive tasks. “A fast CPU keeps the GPU fully utilized, reduces overhead in the inference pipeline, and cuts end-to-end latency,” he stated, adding that even minor reductions in CPU delays can significantly enhance application responsiveness.

Tokenization, the process of converting inputs into numerical units, is heavily reliant on the interaction between CPU and GPU. “Inference runs token by token, and tasks such as tokenization, batching, and synchronization sit directly on the critical path,” Navolokin said. “Delays on the host CPU can slow the entire response.”

Beyond performance, Navolokin argued that optimizing CPU-GPU balance can lead to lower infrastructure costs by increasing GPU utilization and decreasing hardware requirements. “Higher efficiency enables teams to meet demand with fewer CPU cores or GPU instances,” he noted.

He cited a case study involving South Korean IT firm Kakao Enterprise, which reportedly reduced its total cost of ownership by 50% and its server count by 60%, while improving AI and cloud performance by 30% after deploying AMD’s EPYC processors.

The fifth-generation EPYC processors, according to Navolokin, can deliver comparable integer performance to earlier systems while using up to 86% fewer racks, effectively lowering both power consumption and software licensing requirements. He added that the demand for CPUs is exacerbated by the rise of agentic AI systems, designed to plan, reason, and act autonomously.

“These systems generate significantly more CPU-side work than traditional inference,” Navolokin explained. “Tasks such as retrieval, prompt preparation, multi-model routing, and synchronization are CPU-driven.” In these scenarios, the CPU functions as a control node across distributed resources that span data centers, cloud platforms, and edge systems.

AMD is focusing its EPYC processors as host CPUs for these demanding workloads. The latest EPYC 9005 Series boasts up to 192 cores, expanded AVX-512 execution, DDR5-6400 memory support, and PCIe Gen 5 I/O—features designed to support large-scale inferencing and GPU-accelerated systems. Navolokin mentioned that this latest generation shows a 37% improvement in instructions per cycle for machine learning and high-performance computing workloads compared to previous EPYC processors.

He also referenced Malaysian reinsurance firm Labuan Re, which anticipates reducing its insurance assessment turnaround time from weeks to less than a day after migrating to an EPYC-powered AI platform.

As AI deployments extend beyond centralized data centers, Navolokin urged organizations to rethink their infrastructure design. “The priority should not be the performance of a single compute resource, but the ability to deploy AI consistently across heterogeneous environments,” he advised. He underscored the importance of open platforms and distributed compute strategies, noting that real-time inference often runs more efficiently on edge devices or AI PCs closer to data sources.

“Success in inferencing is no longer defined solely by raw compute power,” Navolokin concluded. “It depends on latency, efficiency, and the ability to operate across data center, cloud, and edge environments.”

AI Technology

AMD Launches Ryzen AI Halo Mini-PC with 128GB RAM and NPU for Local AI Development

AMD unveils the Ryzen AI Halo Mini-PC, boasting a 16-core Ryzen AI Max+ 395 APU and the capability to process models with up to...

Staff3 May, 2026

AI Technology

Nebius Acquires Eigen AI for $643M to Enhance AI Infrastructure and Efficiency

Nebius acquires Eigen AI for $643M to enhance AI infrastructure and efficiency, integrating advanced model optimization technologies for scalable applications.

Staff2 May, 2026

AI Technology

AMD Set to Boost Revenue with Next-Gen Consoles, Driving Stock Growth Beyond 60%

AMD predicts over 60% revenue growth driven by next-gen consoles and AI data center expansion, potentially elevating stock to $660 within five years

Staff1 May, 2026

AI Technology

Intel Raises Q2 Revenue Outlook to $14.8B as AI Demand Boosts CPU Needs

Intel projects Q2 revenue of up to $14.8B, driven by AI demand for its Xeon CPUs, despite a GAAP loss per share of $0.73...

Staff28 April, 2026

AI Education

Acer Edu Summit 2026 Unveils AI Innovations to Transform Education Across Asia Pacific

Acer's Edu Summit 2026 showcased AI innovations, uniting 12 Asian nations to drive a human-centered educational framework and enhance learning outcomes through technology.

David Park27 April, 2026

AI Regulation

Trump Administration Challenges Colorado’s AI Hiring Law with Musk’s Support

Trump administration challenges Colorado's forthcoming AI hiring law, backed by Elon Musk, amid rising scrutiny on automated employment practices.

Staff26 April, 2026

AI Technology

AI Chip Market: Broadcom’s Revenue Set to Surge 35.6%, Overtaking Nvidia’s Growth

Broadcom's revenue is projected to soar by 35.6%, potentially surpassing Nvidia's growth as the semiconductor market shifts towards custom AI chip solutions.

Staff25 April, 2026

AI Generative

New 3D Multi-Modal Foundation Model Revolutionizes Optical Coherence Tomography Analysis

Revolutionizing OCT analysis, a new 3D multi-modal model enhances retinal diagnosis accuracy by 30%, promising significant advances in AMD management.

Staff25 April, 2026

AIPRESSA.COM

AI Technology

AMD Advocates Integrated AI Compute with EPYC CPUs for 50% Cost Savings and Enhanced Performance

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Technology

AMD Launches Ryzen AI Halo Mini-PC with 128GB RAM and NPU for Local AI Development

AI Technology

Nebius Acquires Eigen AI for $643M to Enhance AI Infrastructure and Efficiency

AI Technology

AMD Set to Boost Revenue with Next-Gen Consoles, Driving Stock Growth Beyond 60%

AI Technology

Intel Raises Q2 Revenue Outlook to $14.8B as AI Demand Boosts CPU Needs

AI Education

Acer Edu Summit 2026 Unveils AI Innovations to Transform Education Across Asia Pacific

AI Regulation

Trump Administration Challenges Colorado’s AI Hiring Law with Musk’s Support

AI Technology

AI Chip Market: Broadcom’s Revenue Set to Surge 35.6%, Overtaking Nvidia’s Growth

AI Generative

New 3D Multi-Modal Foundation Model Revolutionizes Optical Coherence Tomography Analysis