Connect with us

Hi, what are you looking for?

Top Stories

VIDRAFT Launches MARL Middleware to Cut LLM Hallucinations, Now on Hugging Face and GitHub

VIDRAFT launches MARL, a groundbreaking middleware now on Hugging Face and GitHub, enhancing LLM reasoning and reducing hallucinations significantly.

SEOUL, South Korea – VIDRAFT, an AI startup based at Seoul AI Hub, has announced the global release of MARL, or Model-Agnostic Runtime Middleware for LLMs. This innovative reasoning middleware aims to significantly reduce hallucinations and enhance self-correction in large language models (LLMs). MARL is now accessible on platforms including Hugging Face, GitHub, PyPI, and ClawHub, the skill marketplace associated with the AI agent platform OpenClaw.

MARL serves as a runtime layer that enhances the reasoning capabilities of language models without necessitating fine-tuning or retraining. Developers and enterprises can implement MARL with minimal code alterations across a variety of models that align with the OpenAI API format. This includes widely recognized models such as GPT, Claude, Gemini, DeepSeek, Grok, and Llama.

According to VIDRAFT, many LLMs frequently deliver incorrect responses with high confidence, often lacking robust mechanisms for error detection and correction. MARL addresses this challenge by restructuring a model’s response into a multi-stage reasoning process. This method first involves planning an approach, followed by drafting an answer, conducting independent verification of that draft, and ultimately generating a revised final response. The outcome is a more deliberate and consistent output compared to the traditional single-pass generation process.

The company’s internal evaluation framework, known as FINAL Bench, highlighted a noticeable gap among current frontier models in recognizing and rectifying potential errors. VIDRAFT reported that MARL demonstrated substantial improvements in more complex tasks, with a significant portion of the progress attributed to its self-correction phase.

In addition to general reasoning enhancements, MARL incorporates specialized reasoning engines tailored for specific domains such as drug discovery, legal analysis, and creative work. This feature enables organizations to increase response reliability without altering model weights or committing to a single model vendor.

Moreover, MARL has been listed on ClawHub, which serves as a marketplace for OpenClaw. VIDRAFT emphasized that while agent platforms are intended to execute tasks, MARL enhances the reasoning quality underpinning those tasks by adding a structured layer of reflection and verification prior to delivering a final answer.

“We started with a simple observation: even the most advanced AI models still struggle to reliably assess the limits of their own answers,” remarked Min-Sik Kim, CEO of VIDRAFT. “MARL does not replace the model. It changes how the model thinks at runtime. We believe trustworthy AI begins with systems that can question, review, and improve their own outputs before presenting them to users.”

VIDRAFT is in the process of preparing an enterprise edition of MARL, which is expected to launch in the first half of 2026. The company plans to submit its validation results for academic publication and has already conducted proof-of-concept engagements in the U.S. market, with ongoing efforts in localization.

About VIDRAFT

Founded in 2024, VIDRAFT is an AI startup focused on advancing toward True AGI by 2030. The company developed FINAL Bench, an evaluation benchmark for AI metacognition, and has achieved a series of results across global AI platforms and leaderboards.

Media Contact
Company Name: VIDRAFT
Contact Person: CHUNG HOON LEE
Email: Send Email
Country: South Korea
Website: https://vidraft.net/

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Cybersecurity

Alibaba unveils the JVS Claw app to streamline OpenClaw's adoption, amid rising security concerns as AI tools rapidly infiltrate daily tasks.

AI Research

AI transforms research workflows by enhancing efficiency, but human oversight is essential to ensure accountability and maintain innovation integrity.

AI Government

Chinese cybersecurity officials warn that improper use of OpenClaw, the AI assistant adopted by firms like Tencent and Alibaba Cloud, poses severe data security...

AI Technology

AMD Ryzen AI Max+ enables local execution of advanced LLMs like Qwen 3.5 122B, revolutionizing AI performance and enhancing user privacy.

AI Research

New research reveals that generative AI models may unintentionally lead to cultural homogenization, risking the loss of unique human expression and thought diversity.

AI Government

Hong Kong bans the installation of AI agent OpenClaw amid rising security concerns, prompting financial institutions on the mainland to restrict employee access.

AI Regulation

China's rapid adoption of OpenClaw, an AI tool embraced by tech giants like Tencent and Alibaba, sparks urgent data security concerns as youth unemployment...

AI Government

OpenClaw surges in popularity among Chinese tech professionals, despite government warnings, as users seek innovative AI solutions to enhance productivity and workflow efficiency.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.