AI Technology

AMD Advocates Integrated AI Compute with EPYC CPUs for 50% Cost Savings and Enhanced Performance

AMD’s EPYC processors enable firms like Kakao Enterprise to cut AI infrastructure costs by 50% while boosting performance by 30%, redefining compute strategies for AI.

Staff

Published

27 December, 2025

Graphics processing units (GPUs) have emerged as the primary upgrade for companies enhancing their AI systems, particularly in the inferencing stage, where trained models produce outputs from new data. However, semiconductor firm AMD warns that solely depending on GPUs can hinder performance and escalate costs.

In a recent interview with Newsbytes.PH, AMD’s Asia Pacific general manager, Alexey Navolokin, emphasized the growing need for effective coordination among CPUs, GPUs, memory, and networking as AI workloads expand and agentic AI systems shift towards real-world applications.

“Today’s large models operate across clusters of GPUs that must work in parallel and exchange data constantly,” Navolokin explained. He noted that overall performance hinges not only on GPU speed but also on the efficiency with which data is transferred and computation is coordinated across the entire system architecture.

Navolokin pointed out a prevalent misconception that GPUs serve as the singular powerhouse for AI inferencing. He highlighted that modern AI models typically exceed the capacity of a single device, necessitating substantial support from host CPUs to facilitate data movement, synchronization, and latency-sensitive tasks. “A fast CPU keeps the GPU fully utilized, reduces overhead in the inference pipeline, and cuts end-to-end latency,” he stated, adding that even minor reductions in CPU delays can significantly enhance application responsiveness.

Tokenization, the process of converting inputs into numerical units, is heavily reliant on the interaction between CPU and GPU. “Inference runs token by token, and tasks such as tokenization, batching, and synchronization sit directly on the critical path,” Navolokin said. “Delays on the host CPU can slow the entire response.”

Beyond performance, Navolokin argued that optimizing CPU-GPU balance can lead to lower infrastructure costs by increasing GPU utilization and decreasing hardware requirements. “Higher efficiency enables teams to meet demand with fewer CPU cores or GPU instances,” he noted.

He cited a case study involving South Korean IT firm Kakao Enterprise, which reportedly reduced its total cost of ownership by 50% and its server count by 60%, while improving AI and cloud performance by 30% after deploying AMD’s EPYC processors.

The fifth-generation EPYC processors, according to Navolokin, can deliver comparable integer performance to earlier systems while using up to 86% fewer racks, effectively lowering both power consumption and software licensing requirements. He added that the demand for CPUs is exacerbated by the rise of agentic AI systems, designed to plan, reason, and act autonomously.

“These systems generate significantly more CPU-side work than traditional inference,” Navolokin explained. “Tasks such as retrieval, prompt preparation, multi-model routing, and synchronization are CPU-driven.” In these scenarios, the CPU functions as a control node across distributed resources that span data centers, cloud platforms, and edge systems.

AMD is focusing its EPYC processors as host CPUs for these demanding workloads. The latest EPYC 9005 Series boasts up to 192 cores, expanded AVX-512 execution, DDR5-6400 memory support, and PCIe Gen 5 I/O—features designed to support large-scale inferencing and GPU-accelerated systems. Navolokin mentioned that this latest generation shows a 37% improvement in instructions per cycle for machine learning and high-performance computing workloads compared to previous EPYC processors.

He also referenced Malaysian reinsurance firm Labuan Re, which anticipates reducing its insurance assessment turnaround time from weeks to less than a day after migrating to an EPYC-powered AI platform.

As AI deployments extend beyond centralized data centers, Navolokin urged organizations to rethink their infrastructure design. “The priority should not be the performance of a single compute resource, but the ability to deploy AI consistently across heterogeneous environments,” he advised. He underscored the importance of open platforms and distributed compute strategies, noting that real-time inference often runs more efficiently on edge devices or AI PCs closer to data sources.

“Success in inferencing is no longer defined solely by raw compute power,” Navolokin concluded. “It depends on latency, efficiency, and the ability to operate across data center, cloud, and edge environments.”

AI Technology

TCS and AMD Launch 200 MW ‘Helios’ AI Infrastructure Blueprint for India

TCS and AMD unveil a 200 MW 'Helios' AI infrastructure blueprint to enhance India's AI capabilities and support large-scale enterprise deployments.

Staff20 hours ago

AI Cybersecurity

AI Cyber Attacks on Supply Chains Surge 263% in Asia-Pacific, Group-IB Reports

Group-IB's report reveals a staggering 263% surge in supply chain cyber attacks across Asia-Pacific, reshaping the cybersecurity landscape with interconnected threats.

Rachel Torres1 day ago

AI Research

Asia Pacific AI Market Set to Surge to $890.7B by 2033 with 34.2% CAGR Growth

Asia Pacific's AI market is set to skyrocket from $63.09B in 2024 to $890.7B by 2033, driven by 34.2% CAGR and robust government initiatives.

Staff3 days ago

TSMC Expected to Outperform Nvidia with 30% Revenue Growth by 2026

TSMC's stock surges 43% as it anticipates 30% revenue growth to $159 billion by 2026, positioning it as a stronger alternative to Nvidia amid...

Staff4 days ago

NVIDIA and AMD Invest $315M in Runway as Company Shifts to World Model Technology

Runway secures $315 million in Series E funding led by General Atlantic to pivot towards world model technology, enhancing its AI video capabilities.

Staff5 days ago

AI Technology

AxonDAO Partners with Oracle Cloud to Launch Secure GPU Infrastructure for AI Workloads

AxonDAO partners with Oracle Cloud to build secure GPU infrastructure for AI and life sciences, enhancing compliance and performance for sensitive data workloads.

Staff10 February, 2026

AI Technology

AMD Reports 34% Revenue Growth but Projected Slowdown to 32% Raises Investor Concerns

AMD reports $10.3 billion in revenue growth at 34%, but a projected slowdown to 32% raises questions about its long-term investment viability.

Staff9 February, 2026

South Korea Leads Asia-Pacific with Groundbreaking AI Regulation Amid Data Sovereignty Risks

South Korea's AI Basic Act, effective January 2026, establishes Asia's first comprehensive AI legislation to enhance data governance amid rising data sovereignty concerns.

Staff9 February, 2026

AIPRESSA.COM

AI Technology

AMD Advocates Integrated AI Compute with EPYC CPUs for 50% Cost Savings and Enhanced Performance

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Technology

TCS and AMD Launch 200 MW ‘Helios’ AI Infrastructure Blueprint for India

AI Cybersecurity

AI Cyber Attacks on Supply Chains Surge 263% in Asia-Pacific, Group-IB Reports

AI Research

Asia Pacific AI Market Set to Surge to $890.7B by 2033 with 34.2% CAGR Growth

Top Stories

TSMC Expected to Outperform Nvidia with 30% Revenue Growth by 2026

Top Stories

NVIDIA and AMD Invest $315M in Runway as Company Shifts to World Model Technology

AI Technology

AxonDAO Partners with Oracle Cloud to Launch Secure GPU Infrastructure for AI Workloads

AI Technology

AMD Reports 34% Revenue Growth but Projected Slowdown to 32% Raises Investor Concerns

Top Stories

South Korea Leads Asia-Pacific with Groundbreaking AI Regulation Amid Data Sovereignty Risks