CodeT5 Reaches 22,172 Monthly Downloads, Surpassing OpenAI’s Code Models

Salesforce Research’s CodeT5 model surges to 22,172 monthly downloads, outperforming OpenAI’s models with a 35% HumanEval pass rate and 51.5 billion tokens trained.

Staff

Published

6 January, 2026

Salesforce Research has made significant strides in the field of open-source code intelligence through its model CodeT5, which reached 22,172 monthly downloads on Hugging Face as of December 2025. This impressive figure underscores its status as a leading tool among developers, fueled by a versatile architecture that includes variants ranging from 60 million to 16 billion parameters. Notably, the InstructCodeT5+ 16B variant achieved a pass rate of 35.0% on the HumanEval benchmark, setting a new standard for performance in code evaluation tasks.

The CodeT5 model family, which spans several sizes and capabilities, has garnered over 3,100 stars and 487 forks on GitHub, indicative of robust engagement from the developer community. This model architecture, which is built on the T5 encoder-decoder framework, supports flexible functionality tailored for both understanding and generating code. The fine-tuned variants developed by community programmers are also noteworthy, with 86 specialized applications focusing on tasks such as vulnerability detection and code review automation.

Significantly, the CodeT5 family’s training dataset has expanded considerably, processing 51.5 billion tokens compared to its predecessor’s 8.35 million training instances. This shift reflects a commitment to improving multilingual code representation, now supporting nine programming languages including the recently added C++. Training was conducted using permissively licensed repositories, ensuring compliance for commercial applications.

Performance benchmarks reveal that larger models yield significant advantages in code evaluation. The InstructCodeT5+ 16B model not only exceeded OpenAI’s code-cushman-001 in terms of pass rates but also highlighted the advantages of greater parameter counts in achieving improved results. For example, the model achieved a pass rate of 42.9% when augmented with CodeT generation strategies, demonstrating effective code synthesis capabilities.

Notably, the environmental impact of these training processes has been addressed, with the CodeT5-base variant generating 49.25 kg of CO2 during training, a figure that has been fully offset through carbon credits from Google Cloud Platform. This commitment to sustainability aligns with growing concerns over the ecological footprint of AI development.

CodeT5’s influence extends into the academic realm as well, with over 1,500 research citations noted as of late 2025. The underlying methodologies from Salesforce Research have contributed significantly to the advancement of techniques in code generation and understanding, positioning CodeT5 as a vital resource in the ongoing evolution of code intelligence.

As developers continue to explore its capabilities, the sustained interest shown in CodeT5, along with its community-driven enhancements, suggests that it will remain a pivotal tool in software engineering and natural language processing. The model’s ability to adapt to diverse programming tasks while maintaining high performance indicates a promising future for open-source initiatives in AI innovation.

AI Generative

OpenAI Faces Defamation Lawsuit Over False Claims from ChatGPT Outputs

OpenAI faces defamation lawsuits in multiple countries, as generative AI's false outputs provoke significant legal challenges and reputational risks for public figures.

Staff4 hours ago

Underrated AI Tools: Gamma, Perplexity, and Runway Transform Productivity and Creativity

Gamma, Perplexity AI, and Runway are revolutionizing productivity and creativity, enabling users to create presentations, streamline research, and edit videos significantly faster and with...

Staff14 hours ago

AI Generative

OpenAI Reveals Best Free AI Generator Apps for 2024 to Boost Creativity and Productivity

OpenAI unveils top free AI generator apps for 2024, enabling users to create stunning visuals and content, democratizing technology for all.

Staff16 hours ago

Sam Altman Faces Backlash Over Controversial Comments on AI Energy Use and Human Comparison

Sam Altman, CEO of OpenAI, faces criticism for dismissing claims of AI's water usage as "totally fake" while advocating for sustainable energy in tech...

Staff17 hours ago

AI Generative

OpenAI Retires GPT-4o and Legacy Models in Shift to New AI Systems

OpenAI retires GPT-4o and other legacy models to streamline offerings, focusing on advanced AI systems as user demand evolves and safety concerns grow

Staff20 hours ago

AI Education

MAHE Partners with OpenAI to Boost AI Integration Across Academic Programs

MAHE partners with OpenAI to integrate AI tools across its programs, aiming to enhance learning outcomes and raise AI literacy for diverse disciplines.

David Park1 day ago

AI Generative

Sarvam AI Launches India’s First Large Language Models, Secures $41M Funding

Sarvam AI secures $41M funding and launches India's first large language models, Sarvam-30B and Sarvam-105B, marking a pivotal step in the AI landscape.

Staff1 day ago

AI Tools

Sarvam Launches Indus AI Chatbot with 105B Model and Multilingual Support in India

Sarvam launches its Indus chat app leveraging a groundbreaking 105B AI model for multilingual support, aiming to transform India's generative AI landscape.

Staff2 days ago

AIPRESSA.COM

Top Stories

CodeT5 Reaches 22,172 Monthly Downloads, Surpassing OpenAI’s Code Models

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Generative

OpenAI Faces Defamation Lawsuit Over False Claims from ChatGPT Outputs

Top Stories

Underrated AI Tools: Gamma, Perplexity, and Runway Transform Productivity and Creativity

AI Generative

OpenAI Reveals Best Free AI Generator Apps for 2024 to Boost Creativity and Productivity

Top Stories

Sam Altman Faces Backlash Over Controversial Comments on AI Energy Use and Human Comparison

AI Generative

OpenAI Retires GPT-4o and Legacy Models in Shift to New AI Systems

AI Education

MAHE Partners with OpenAI to Boost AI Integration Across Academic Programs

AI Generative

Sarvam AI Launches India’s First Large Language Models, Secures $41M Funding

AI Tools

Sarvam Launches Indus AI Chatbot with 105B Model and Multilingual Support in India