Connect with us

Hi, what are you looking for?

Top Stories

VIDRAFT Launches MARL Middleware to Cut LLM Hallucinations, Now on Hugging Face and GitHub

VIDRAFT launches MARL, a groundbreaking middleware now on Hugging Face and GitHub, enhancing LLM reasoning and reducing hallucinations significantly.

SEOUL, South Korea – VIDRAFT, an AI startup based at Seoul AI Hub, has announced the global release of MARL, or Model-Agnostic Runtime Middleware for LLMs. This innovative reasoning middleware aims to significantly reduce hallucinations and enhance self-correction in large language models (LLMs). MARL is now accessible on platforms including Hugging Face, GitHub, PyPI, and ClawHub, the skill marketplace associated with the AI agent platform OpenClaw.

MARL serves as a runtime layer that enhances the reasoning capabilities of language models without necessitating fine-tuning or retraining. Developers and enterprises can implement MARL with minimal code alterations across a variety of models that align with the OpenAI API format. This includes widely recognized models such as GPT, Claude, Gemini, DeepSeek, Grok, and Llama.

According to VIDRAFT, many LLMs frequently deliver incorrect responses with high confidence, often lacking robust mechanisms for error detection and correction. MARL addresses this challenge by restructuring a model’s response into a multi-stage reasoning process. This method first involves planning an approach, followed by drafting an answer, conducting independent verification of that draft, and ultimately generating a revised final response. The outcome is a more deliberate and consistent output compared to the traditional single-pass generation process.

The company’s internal evaluation framework, known as FINAL Bench, highlighted a noticeable gap among current frontier models in recognizing and rectifying potential errors. VIDRAFT reported that MARL demonstrated substantial improvements in more complex tasks, with a significant portion of the progress attributed to its self-correction phase.

In addition to general reasoning enhancements, MARL incorporates specialized reasoning engines tailored for specific domains such as drug discovery, legal analysis, and creative work. This feature enables organizations to increase response reliability without altering model weights or committing to a single model vendor.

Moreover, MARL has been listed on ClawHub, which serves as a marketplace for OpenClaw. VIDRAFT emphasized that while agent platforms are intended to execute tasks, MARL enhances the reasoning quality underpinning those tasks by adding a structured layer of reflection and verification prior to delivering a final answer.

“We started with a simple observation: even the most advanced AI models still struggle to reliably assess the limits of their own answers,” remarked Min-Sik Kim, CEO of VIDRAFT. “MARL does not replace the model. It changes how the model thinks at runtime. We believe trustworthy AI begins with systems that can question, review, and improve their own outputs before presenting them to users.”

VIDRAFT is in the process of preparing an enterprise edition of MARL, which is expected to launch in the first half of 2026. The company plans to submit its validation results for academic publication and has already conducted proof-of-concept engagements in the U.S. market, with ongoing efforts in localization.

About VIDRAFT

Founded in 2024, VIDRAFT is an AI startup focused on advancing toward True AGI by 2030. The company developed FINAL Bench, an evaluation benchmark for AI metacognition, and has achieved a series of results across global AI platforms and leaderboards.

Media Contact
Company Name: VIDRAFT
Contact Person: CHUNG HOON LEE
Email: Send Email
Country: South Korea
Website: https://vidraft.net/

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Business

Red Hat advances enterprise AI with Small Language Models that achieve over 98% validity in structured tasks, prioritizing reliability and data sovereignty.

AI Technology

Apple CEO Tim Cook warns of several-month supply shortages for the Mac mini and Mac Studio as demand surges, pushing Mac revenue to $8.4...

AI Generative

Apple's new LaDiR framework enhances large language model accuracy by 20% in math reasoning and code generation, revolutionizing AI problem-solving.

Top Stories

Mistral AI launches its 128-billion-parameter Medium 3.5 model, scoring 77.6% on key benchmarks, yet faces criticism for high pricing and mixed performance.

Top Stories

Nvidia enters South Korea's AI market by launching 7 million Korean-language personas and the multimodal Nemotron3 Nano, aiming to establish market dominance.

Top Stories

Multiverse Computing unveils the LittleLamb AI model family on Hugging Face, reducing model size by 50% while enhancing performance for edge and mobile applications.

Top Stories

Google DeepMind's Alexander Lerchner claims AI can't achieve consciousness, challenging AGI narratives and revealing it as mere advanced simulation.

AI Technology

Lumai unveils the Iris inference server, the world's first optical system enabling real-time execution of billion-parameter AI models with 90% lower energy consumption.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.