Connect with us

Hi, what are you looking for?

AI Technology

FlashLabs Launches Chroma 1.0, First Open-Source Real-Time Voice AI with 135ms Latency

FlashLabs unveils Chroma 1.0, the world’s first open-source real-time voice AI with an impressive 135ms latency and advanced personalized voice cloning capabilities.

SAN FRANCISCO, Jan. 22, 2026 /PRNewswire/ — FlashLabs, an applied AI research and engineering lab, has launched Chroma 1.0, touted as the world’s first open-source, end-to-end, real-time speech-to-speech AI model capable of personalized voice cloning. This innovative system aims to eliminate latency issues that have long hampered human-AI interaction, enabling more fluid and immediate conversations.

Chroma operates natively in voice, bypassing the traditional pipeline of automatic speech recognition (ASR), large language models (LLM), and text-to-speech (TTS) technologies. This architecture allows for natural conversational exchanges that feel more human-like and responsive, according to the company.

“Voice is the most universal interface in the world, yet it has remained closed, fragmented, and delayed,” stated Yi Shi, Founder and Chief Research & Engineering at FlashLabs. “With Chroma, we’re open-sourcing real-time voice intelligence so builders, researchers, and companies can create AI systems that truly work at human speed.”

Chroma is designed specifically for real-time applications, achieving an impressive end-to-end time-to-first-token (TTFT) of under 150 milliseconds. This capability supports features such as natural conversational turn-taking, low-latency emotional and prosodic control, and stable real-time inference free from cascading delays. With the introduction of Day-0 SGLang support, the model further reduces latency, achieving approximately 135ms TTFT, making it suitable for live deployment.

One of the standout features of Chroma is its ability to perform few-second reference voice cloning. This allows users to create highly realistic, personalized voices from minimal audio input. Internal evaluations report a speaker similarity score of 0.817, which is over 10.96% above the human baseline and demonstrates best-in-class performance among both open and closed systems.

Despite using a compact architecture of approximately 4 billion parameters, Chroma offers robust reasoning and dialogue capabilities. The model leverages modern multimodal backbones and optimized real-time inference, making it suitable for various applications, including edge deployment, virtual agents, call centers, and interactive systems where latency and cost are critical factors.

Chroma enables a new range of real-time voice applications, from autonomous voice agents and AI call centers to real-time translators and conversational assistants. Its potential also extends to interactive characters and non-player characters (NPCs) in gaming, as well as multimodal AI systems that require seamless communication across different modes.

Chroma 1.0 is available today, representing a significant advancement in voice-based AI. FlashLabs continues to focus on building open and production-grade systems that enhance agency across voice, text, and actions in various domains.

For more information, media inquiries can be directed to Koki Kobayashi at 650-609-7501 or via email at [email protected].

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Enhans launches CommerceOS, an AI-driven commerce platform that optimizes over 1,000 marketplaces globally, while establishing a new San Francisco HQ for North American growth.

AI Marketing

Higgsfield secures $80M in funding, boosting its valuation to $1.3B as demand for AI-driven video content surges, targeting social media marketers.

AI Technology

MIT engineers unveil a stacked memory transistor technology that enhances AI chip energy efficiency by 130%, addressing soaring data center demands.

AI Research

Microsoft acquires AI startup Bonsai to enhance Azure’s capabilities in deep reinforcement learning and machine teaching for autonomous systems.

AI Generative

Owkin unveils a groundbreaking agentic infrastructure for biomedical research, enhancing drug discovery with AI agents that improve analysis accuracy by 23.7% and streamline workflows.

AI Regulation

GC AI's ROI study reveals in-house legal teams save an average of 14 hours weekly and cut outside counsel costs by 14%, translating to...

Top Stories

Industry leaders at a San Francisco conference emphasized that without standardized ethical guidelines, the deployment of AI in healthcare could jeopardize patient trust and...

Top Stories

Meta's AI lab suffers a leadership crisis as VP Jitendra Malik departs to lead Amazon's robotics research amid ongoing internal turmoil and budget cuts.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.