Connect with us

Hi, what are you looking for?

AI Technology

FlashLabs Launches Chroma 1.0, First Open-Source Real-Time Voice AI with 135ms Latency

FlashLabs unveils Chroma 1.0, the world’s first open-source real-time voice AI with an impressive 135ms latency and advanced personalized voice cloning capabilities.

SAN FRANCISCO, Jan. 22, 2026 /PRNewswire/ — FlashLabs, an applied AI research and engineering lab, has launched Chroma 1.0, touted as the world’s first open-source, end-to-end, real-time speech-to-speech AI model capable of personalized voice cloning. This innovative system aims to eliminate latency issues that have long hampered human-AI interaction, enabling more fluid and immediate conversations.

Chroma operates natively in voice, bypassing the traditional pipeline of automatic speech recognition (ASR), large language models (LLM), and text-to-speech (TTS) technologies. This architecture allows for natural conversational exchanges that feel more human-like and responsive, according to the company.

“Voice is the most universal interface in the world, yet it has remained closed, fragmented, and delayed,” stated Yi Shi, Founder and Chief Research & Engineering at FlashLabs. “With Chroma, we’re open-sourcing real-time voice intelligence so builders, researchers, and companies can create AI systems that truly work at human speed.”

Chroma is designed specifically for real-time applications, achieving an impressive end-to-end time-to-first-token (TTFT) of under 150 milliseconds. This capability supports features such as natural conversational turn-taking, low-latency emotional and prosodic control, and stable real-time inference free from cascading delays. With the introduction of Day-0 SGLang support, the model further reduces latency, achieving approximately 135ms TTFT, making it suitable for live deployment.

One of the standout features of Chroma is its ability to perform few-second reference voice cloning. This allows users to create highly realistic, personalized voices from minimal audio input. Internal evaluations report a speaker similarity score of 0.817, which is over 10.96% above the human baseline and demonstrates best-in-class performance among both open and closed systems.

Despite using a compact architecture of approximately 4 billion parameters, Chroma offers robust reasoning and dialogue capabilities. The model leverages modern multimodal backbones and optimized real-time inference, making it suitable for various applications, including edge deployment, virtual agents, call centers, and interactive systems where latency and cost are critical factors.

Chroma enables a new range of real-time voice applications, from autonomous voice agents and AI call centers to real-time translators and conversational assistants. Its potential also extends to interactive characters and non-player characters (NPCs) in gaming, as well as multimodal AI systems that require seamless communication across different modes.

Chroma 1.0 is available today, representing a significant advancement in voice-based AI. FlashLabs continues to focus on building open and production-grade systems that enhance agency across voice, text, and actions in various domains.

For more information, media inquiries can be directed to Koki Kobayashi at 650-609-7501 or via email at [email protected].

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Business

U.S. Federal Court grants Amazon a temporary injunction against Perplexity AI's Comet AI, halting its use on Amazon amid security concerns and potential fraud.

AI Technology

NVIDIA unveils ComfyUI update with 2.5x performance boost for local AI video generation on RTX GPUs, streamlining workflows for artists and developers.

AI Generative

Veeso AI launches a groundbreaking platform enabling users to generate fully editable PSD and PPTX files from content uploads, amassing 100,000 users in just...

AI Generative

All major LLMs, including OpenAI's GPT series, showed significant potential for academic fraud, with Grok-3 facilitating misconduct over 30% of the time.

AI Regulation

Custom Legal Marketing's study reveals AI content has no significant impact on Google rankings for law firms, showing only a 0.065 correlation across 2,435...

Top Stories

OpenAI faces backlash as 50 protesters rally against its Pentagon partnership, sparking a shift in user preference toward rival Anthropic's Claude model.

AI Government

NationGraph secures $18 million in Series A funding to streamline U.S. government procurement processes, enhancing AI-driven access to critical vendor data.

AI Technology

Aikido Technologies unveils the AO60DC, a floating offshore data center capable of delivering 10-12 MW of AI-grade compute power alongside 15-18 MW of wind...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.