Connect with us

Hi, what are you looking for?

AI Generative

HELIOS Achieves 85%+ Compilability for LLM Binary Decompilation Using Control Flow Graphs

UC Irvine’s HELIOS framework boosts binary decompilation compilability from 45% to over 85% using control flow graphs, transforming security analysis methods.

Researchers at the University of California, Irvine, have unveiled a pioneering framework named HELIOS that significantly enhances the decompilation of binary code through large language models (LLMs). This innovative approach addresses the limitations of existing methods, which often treat code simply as text and fail to account for essential control flow graphs. The research team, comprising Yonatan Gizachew Achamyeleh, Harsh Thomare, and Mohammad Abdullah Al Faruque, has reframed binary decompilation as a structured reasoning process, enabling a more accurate understanding of complex, optimized binaries.

The core advantage of HELIOS lies in its ability to summarize a binary’s control flow into a hierarchical text representation. This representation details basic blocks, their connections, and high-level constructs like loops and conditionals, thus providing critical structural context that traditional LLM approaches overlook. The research demonstrated significant improvements in the compilability of object files, with performance metrics soaring from 45.0% to 85.2% when using Gemini 2.0, and from 71.4% to 89.6% with GPT-4.1 Mini on the HumanEval-Decompile benchmark.

Incorporating compiler feedback further boosted compilability rates beyond 94%, while also improving functional correctness by up to 5.6 percentage points compared to text-only methods. This represents a substantial leap in the reliability and usability of LLM-driven decompilation, particularly for security analysts who often grapple with complex binary analysis. The framework has shown adaptability across six architectures, including x86, ARM, and MIPS, effectively reducing variations in functional correctness while maintaining high syntactic accuracy.

The method employed by the team includes a static analysis backend to derive both control flow and call graphs, which form the basis for the hierarchical textual representation. This representation is then used in conjunction with raw decompiler output and optional compiler feedback to guide the LLM’s interpretation. By crafting prompts that summarize each function’s role and detailing the control flow, researchers have created a system that mirrors the analytical approach of human experts, facilitating a fine-tuning-free and architecture-agnostic pipeline.

In their experiments, the research team utilized the HumanEval-Decompile benchmark focused on the x86_64 architecture, where they achieved impressive results. The framework’s ability to translate intricate control flow data into a format that LLMs can effectively process has proven essential for improving the accuracy and consistency of decompiled code. The results reveal that HELIOS not only enhances the rate at which code can be successfully compiled but also strengthens the logical consistency of the output, positioning it as a transformative tool for reverse engineering and security analysis.

As software security evolves and the demand for skilled reverse engineers increases, HELIOS addresses a pressing need within the industry by automating challenging tasks like reverse engineering, malware analysis, and vulnerability assessment. With this framework, analysts can obtain recompilable, semantically faithful code across various hardware platforms, making it a practical asset for security settings. The researchers highlight the potential of HELIOS to reshape the landscape of binary analysis and software security, paving the way for more efficient and effective security research methodologies.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Technology

UC Riverside's Test-Time Matching method enhances AI reasoning by 89.4%, surpassing GPT-4 with a groundbreaking self-improvement approach.

AI Generative

UC Berkeley researchers unveil diffusion language models that achieve optimal parallel text generation, outperforming autoregressive models in speed and efficiency.

AI Finance

AMD launches its Helios AI data center rack to rival Nvidia, promising a groundbreaking 1,000x performance boost with the upcoming MI500 series GPUs.

AI Generative

Chinese researchers unveil TurboDiffusion, slashing AI video generation times by 200x, enabling a five-second HD clip in just 24 seconds.

AI Technology

Alphabet acquires Intersect for $4.75B to enhance data center energy solutions, aiming for advanced renewable initiatives and optimized infrastructure by 2026

AI Research

AI adoption in research boosts productivity by up to 89% but raises concerns over quality, with complex language masking weak scientific contributions.

AI Research

New research from UCLA and UC Berkeley introduces a protocol that halves communication complexity in distributed estimation, improving efficiency in data-driven decision-making.

AI Research

AMD targets a $280 price with the upcoming Helios AI system set to transform data centers, bolstering its long-term growth strategy in AI.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.