AI Generative

KPMG’s Fabiana Clemente Reveals Key Insights on Synthetic Data’s Role in AI Systems

KPMG’s Fabiana Clemente reveals that synthetic data is revolutionizing AI training and fraud detection, enabling compliance with privacy laws while enhancing analytical capabilities.

Staff

Published

12 February, 2026

Synthetic data, despite being a concept that’s been around for decades, continues to be surrounded by misconceptions, according to Fabiana Clemente, a senior director at KPMG. In a recent discussion with tech expert Ben Lorica, Clemente explored the growing applications of synthetic data and its evolving role in areas such as privacy, fraud detection, and artificial intelligence.

Clemente emphasized that synthetic data is defined as data generated independently of real-world events. It is increasingly being utilized for diverse applications, ranging from straightforward test data management to complex AI training processes. “Understanding the nuances of synthetic data is crucial for successful implementation,” she remarked, pointing out that its effectiveness varies greatly depending on the specific use case.

Among the most prominent applications mentioned were the need for data sharing with offshore teams while adhering to strict privacy controls and improving the training of AI agents. “When you can’t share a real dataset, synthetic replicas offer a viable alternative,” Clemente noted. The application of synthetic data in fraud detection was another surprising success story, showcasing its potential to enhance analytical capabilities.

However, Clemente highlighted common pitfalls for organizations new to synthetic data. One significant mistake is the oversimplification of its complexity. “People often expect that generating synthetic data is as simple as clicking a button,” she cautioned. Understanding the requirements and methodologies behind synthetic data generation is essential for achieving desired outcomes.

Historically, synthetic data applications mainly centered around structured data, but its reach has expanded significantly. “Text has become the dominant form of synthetic data today,” Clemente explained. This trend reflects the broader adoption of generative AI technologies, which have gained traction in recent years. While synthetic data generated from language models can be useful, it still prompts concerns regarding quality and structure.

As organizations increasingly incorporate synthetic data into their workflows, they must also address potential technical challenges. Clemente underscored that issues like data drift and model bias remain relevant even in synthetic scenarios. “The processes around building data solutions are critical,” she stated, emphasizing the necessity for governance and training to avoid propagating errors in model training.

With advancements in generative AI, the landscape of synthetic data is evolving. Major tech companies such as Meta and OpenAI are progressively integrating synthetic data into their AI frameworks. “These companies are leveraging synthetic data to optimize knowledge spaces and enhance multi-agent systems,” Clemente noted. This transition reflects a broader shift in AI development, where testing and validation of models are increasingly conducted through synthetic environments rather than solely relying on historical data.

Clemente also mentioned the interplay between synthetic data and emerging technologies, such as robotics, where the incorporation of simulations can bridge gaps in real-world data acquisition. “Synthetic data can help cover scenarios that may be difficult to capture using traditional methods,” she explained, promoting a pragmatic approach to data collection.

Looking ahead, the conversation highlighted the importance of adapting synthetic data practices to meet the growing complexity of AI systems. As the industry continues to grapple with data scarcity, the strategic use of synthetic data could be vital for maintaining model accuracy and efficacy. “Synthetic data serves as a necessary accelerator in the evolution of AI,” Clemente concluded, underscoring its potential to redefine how organizations approach data in a rapidly changing technological landscape.

AI Regulation

AI Revolutionizes Fashion: New Laws Address Digital Likeness Rights and Advertising

New York's upcoming AI legislation mandates explicit consent for using models' likenesses, reshaping digital advertising and protecting rights in the fashion industry.

Staff1 May, 2026

AI Education

AI Leaders Emphasize Need for AI Literacy in Education at EduVision Summit 2025

EduVision Summit 2025 highlights urgent need for AI literacy in education, pushing for a new focus on soft skills and ethical AI use among...

David Park1 May, 2026

AI Government

Agentic AI Forum 2026 Unveils Strategies for Ethical Government Data Governance

Agentic AI Forum 2026 set for July 29-30 in Canberra will equip leaders with actionable strategies for ethical AI governance amid rapid technological change.

Staff30 April, 2026

Meta’s Ad Revenue Soars 33% to $55B, Google Grows 15% to $77B Amid AI Investments

Meta's ad revenue surged 33% to $55B, surpassing Google's 15% growth to $77B, amid escalating AI investments that could reshape digital advertising.

Staff30 April, 2026

AI Research

Machine Learning’s Hot Topics Drive $10B U.S. AI Investment Surge and Career Growth in 2026

U.S. AI investments surge to $10B, driving deep learning and HCI innovations as companies like Google and OpenAI reshape career paths for tech professionals.

Staff28 April, 2026

Amazon Expects 14% Revenue Growth to $188B in Q1 2026, Driven by AWS and AI Demand

Amazon anticipates a 14% revenue surge to $188B in Q1 2026, fueled by AWS growth and a 21% rise in advertising revenue to $16.84B

Staff28 April, 2026

AI Research

Top Medical Journal Warns AI Tools Risk Premature Adoption Amid Rising Flaws

Nature Medicine warns that reliance on AI tools in healthcare is risky, citing misdiagnosis rates over 80% and a lack of credible evidence for...

Staff26 April, 2026

AI Cybersecurity

Dell Technologies Launches Quantum-Ready Security Enhancements for AI Workloads and Cyber Resilience

Dell Technologies unveils quantum-ready security features to enhance cyber resilience, empowering organizations to recover 46% faster from incidents.

Rachel Torres25 April, 2026

AIPRESSA.COM

AI Generative

KPMG’s Fabiana Clemente Reveals Key Insights on Synthetic Data’s Role in AI Systems

Trending

Top Stories