AI Generative

Aniket Roy Reveals Resource-Constrained Image Generation Techniques in PhD Research

Aniket Roy, a PhD from Johns Hopkins, unveils FeLMi and DiffNat, enhancing image generation efficiency in low-resource environments for practical AI applications.

Staff

Published

7 April, 2026

In a recent interview, Aniket Roy, a newly minted PhD from Johns Hopkins University, shared insights into his groundbreaking research in generative models for computer vision tasks. Under the guidance of Bloomberg Distinguished Professor Rama Chellappa, Roy’s work focuses on enhancing efficiency and adaptability in image generation, especially in resource-constrained environments.

Roy’s PhD research traverses the realms of generative AI, multimodal learning, and few-shot learning. He has sought to create methodologies that enable models to learn new concepts or execute intricate visual tasks with minimal data and computational resources. His work addresses longstanding challenges such as data scarcity and personalized image synthesis, aiming to make advanced vision systems more practical for real-world applications.

One significant contribution from Roy is FeLMi, a few-shot learning framework that utilizes uncertainty-guided hard mixup strategies. This innovation improves robustness when working with a limited number of labeled samples. Another noteworthy project is Cap2Aug, which employs textual descriptions to guide synthetic image generation, effectively enhancing visual diversity and bridging the gap between real and generated data.

In addition to these frameworks, Roy developed DiffNat, a regularization method that improves the perceptual quality of images generated by diffusion models. By applying a kurtosis-concentration loss, DiffNat encourages generated images to exhibit more natural texture statistics, a crucial element in enhancing visual realism for downstream vision tasks.

Furthermore, Roy has made strides in personalizing generative models. He introduced DuoLoRA, a framework designed for efficient control over content and style, allowing for fine-tuning without necessitating a complete model retraining. This innovation extends to zero-shot settings, enabling users to customize objects during generation simply through textual input. His MultiLFG framework further refines this process by incorporating wavelet-domain representations to facilitate accurate and training-free fusion of various concepts within diffusion models.

Among the projects that Roy found particularly engaging is DiffNat, which he presented at the International Conference on Learning Representations (TMLR) in 2025. This project highlights the importance of improving the perceptual quality of images generated by diffusion models, addressing a challenge that has persisted despite significant advancements in generative AI. Roy’s method not only enhances the statistical consistency of generated images but also integrates a condition-agnostic perceptual guidance strategy that boosts image fidelity without needing additional training.

The transition from academic research to practical applications is a key focus for Roy as he embarks on a new chapter at NEC Laboratories America as a Research Scientist. He aims to develop new generative model methodologies while exploring their interactions with multimodal systems. His interests lie at the intersection of generative models, vision-language-action models, and embodied AI, with the broader goal of enhancing intelligent systems that can proficiently understand and generate visual information.

Reflecting on his journey, Roy’s fascination with computer vision and machine learning was ignited during his undergraduate studies. The immediate visual impact of signal and image processing algorithms captivated him, fostering a deep curiosity about how machines can emulate human visual perception. His intellectual curiosity was further nurtured by mentorship from Dr. Kuntal Ghosh, who inspired him to approach complex problems with scientific rigor.

Roy’s experience at the recent AAAI Doctoral Consortium, although marred by visa issues that prevented his attendance, was nonetheless fruitful. His colleague’s presentation of his research poster sparked insightful discussions with fellow researchers, yielding constructive feedback and potential collaborative opportunities. Roy expressed appreciation for the platform, recognizing it as a valuable avenue for sharing early-stage ideas and engaging with the academic community.

Beyond his research endeavors, Roy finds joy in music, stand-up comedy, and travel. He considers exploring diverse cultures a refreshing escape and is also a budding poet who combines humor and storytelling through his performances. This creative outlet contrasts with his rigorous analytical research, allowing him to maintain a well-rounded perspective on life and work.

As Roy moves forward, he remains committed to advancing the capabilities of generative models and their applications, striving to contribute to the scientific understanding of intelligent systems that can interact effectively with the visual world.

AI Generative

Nvidia Expands Partnerships with Asian Firms, Boosting AI Chip Demand by 90%

Nvidia's partnerships with Asian firms like LG and Nanya surge AI chip demand to 90% of production costs, reshaping the tech landscape in Asia.

Staff3 May, 2026

AI Finance

Alphabet Invests $40 Billion in Anthropic, Securing Key AI Infrastructure and Growth

Google invests $10 billion in Anthropic, boosting its valuation to $350 billion and securing critical AI infrastructure ahead of a potential IPO.

Marcus Chen27 April, 2026

Meta Cuts 8,000 Jobs as Microsoft Offers Voluntary Buyouts to 8,750 Employees

Meta cuts 8,000 jobs amid a strategic pivot to AI investment, while Microsoft offers buyouts to 8,750 employees as tech companies adapt to evolving...

Staff24 April, 2026

AI Technology

Intel Announces Robust AI-Driven Sales Forecast, Shares Surge 20% to Record High

Intel's robust sales forecast of up to $14.8 billion for June, driven by soaring AI demand, propelled shares 20% higher to record levels.

Staff23 April, 2026

Tencent and Alibaba Eye $40B AI Startup DeepSeek, Seek Major Stake in Funding Round

Tencent aims for a 20% stake in $40B AI startup DeepSeek as Alibaba joins funding talks, intensifying the competition in China's AI landscape

Staff23 April, 2026

AI Cybersecurity

Anthropic Cyberattack Exposes Vulnerabilities in AI Models, Highlights Security Risks

Anthropic’s Mythos AI model was breached through a simple exploit, raising alarms about the vulnerability of advanced AI systems in cybersecurity.

Rachel Torres22 April, 2026

AI Tools

Unauthorized Group Accesses Anthropic’s Mythos Cybersecurity Tool via Third-Party Vendor

Unauthorized users accessed Anthropic's Mythos cybersecurity tool through a third-party vendor, raising serious enterprise security concerns.

Staff22 April, 2026

AI Finance

Anthropic’s Mythos Raises Cybersecurity Concerns in U.S. Banking Sector

Treasury Secretary Scott Bessent and Fed Chair Jerome Powell convened banking leaders to address escalating cybersecurity threats from Anthropic's AI model, Mythos, highlighting urgent...

Marcus Chen12 April, 2026

AIPRESSA.COM

AI Generative

Aniket Roy Reveals Resource-Constrained Image Generation Techniques in PhD Research

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Generative

Nvidia Expands Partnerships with Asian Firms, Boosting AI Chip Demand by 90%

AI Finance

Alphabet Invests $40 Billion in Anthropic, Securing Key AI Infrastructure and Growth

Top Stories

Meta Cuts 8,000 Jobs as Microsoft Offers Voluntary Buyouts to 8,750 Employees

AI Technology

Intel Announces Robust AI-Driven Sales Forecast, Shares Surge 20% to Record High

Top Stories

Tencent and Alibaba Eye $40B AI Startup DeepSeek, Seek Major Stake in Funding Round

AI Cybersecurity

Anthropic Cyberattack Exposes Vulnerabilities in AI Models, Highlights Security Risks

AI Tools

Unauthorized Group Accesses Anthropic’s Mythos Cybersecurity Tool via Third-Party Vendor

AI Finance

Anthropic’s Mythos Raises Cybersecurity Concerns in U.S. Banking Sector