BigQuery Launches SQL-Native Inference for Open Models, Simplifying AI Deployments

Google’s BigQuery introduces SQL-native inference for open models, enabling users to deploy advanced AI with just two SQL statements, simplifying access to generative AI technologies.

Staff

Published

15 January, 2026

Google’s BigQuery has introduced a new feature that enhances accessibility to various large language models (LLMs) for text and embedding generation, including its own Gemini models and those managed in collaboration with partners like Anthropic and Mistral. This capability, which facilitates the use of these models directly within SQL queries, aims to simplify the deployment and management of generative AI models for users, regardless of their technical expertise. Alongside this, BigQuery is extending its support to models available on platforms such as Hugging Face and Vertex AI Model Garden, marking a significant advancement in database management and AI integration.

With the launch of managed third-party generative AI inference in BigQuery (currently in Preview), users can execute open models with just two SQL statements. This streamlined approach offers four primary advantages: simplified deployment, automated resource management, granular resource control, and a unified SQL interface. The deployment process is designed to be straightforward; users can create an open model by issuing a single CREATE MODEL statement that includes the model ID string, such as google/gemma-3-1b-it. BigQuery handles the provisioning of compute resources automatically, making it accessible even for those less familiar with AI model management.

One of the standout features is automated resource management. BigQuery actively releases idle compute resources, which helps prevent unexpected costs for users. This functionality can be customized through the endpoint_idle_ttl configuration, allowing users to define how long resources should remain active without use. Additionally, users have the option to customize backend computing resources, adjusting parameters like machine types and minimum or maximum replicas within the CREATE MODEL statement. This flexibility ensures that users can tailor the performance and expenses of their models to suit their needs.

To illustrate how the process works, one can create a BigQuery managed open model by first executing a CREATE MODEL statement with the appropriate open model ID. Depending on the size of the model and the chosen machine type, the query typically completes within a few minutes. For models sourced from Hugging Face, users must specify the hugging_face_model_id in a format that includes the provider name and model name—an example being sentence-transformers/all-MiniLM-L6-v2.

This initiative reflects a broader commitment by Google to democratize access to advanced AI capabilities while ensuring that organizations can leverage these technologies without extensive resources or expertise. As generative AI continues to evolve and become more integrated within various sectors, the implications of such developments are significant. The ability to utilize powerful models through a familiar SQL interface could redefine workflows across industries, enabling more users to harness the potential of AI-driven analytics and insights.

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

Anthropic's Mythos exposes thousands of critical vulnerabilities in major systems, prompting $100M in defensive action from tech giants and U.S. banks.

Rachel Torres3 May, 2026

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

BusySeed unveils Rankxa, a tool tracking brand visibility across AI-generated responses, revealing 90% of brands lack meaningful presence in this new landscape.

Sofía Méndez3 May, 2026

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

Google is set to unveil its new video-generation tool, Omni, at I/O 2026, potentially integrating Gemini's capabilities and enhancing competition against ByteDance's Seedance 2.0.

Staff2 May, 2026

AI Technology

A1 Public Relations Enhances AI Visibility for Entertainment Brands in 2026

A1 Public Relations helps entertainment brands enhance AI visibility in 2026 by integrating structured content and fresh, authoritative media, ensuring they are recognized by...

Staff2 May, 2026

AI Business

Jensen Huang Critiques AI Doom Predictions, Calls for Fact-Based Discussions

Nvidia CEO Jensen Huang urges industry leaders to avoid alarmist claims about AI's future, citing concerns over inaccurate predictions like a 50% job displacement...

Marcus Chen2 May, 2026

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

Anthropic accuses Moonshot AI of 3.4M unauthorized exchanges with its Claude chatbot, prompting a global U.S. State Department campaign against IP theft.

Staff2 May, 2026

AI Cybersecurity

Anthropic Launches Beta of Claude Security AI Tools to Combat Cyber Threats

Anthropic unveils Claude Security’s public beta, leveraging AI to automate vulnerability scanning and patch generation, poised to enhance enterprise cybersecurity.

Rachel Torres2 May, 2026

AIPRESSA.COM

Top Stories

BigQuery Launches SQL-Native Inference for Open Models, Simplifying AI Deployments

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

AI Technology

A1 Public Relations Enhances AI Visibility for Entertainment Brands in 2026

AI Business

Jensen Huang Critiques AI Doom Predictions, Calls for Fact-Based Discussions

AI Government

Anthropic Accuses Moonshot AI of 3.4M Unauthorized Claude Exchanges Amid US State Response

AI Cybersecurity

Anthropic Launches Beta of Claude Security AI Tools to Combat Cyber Threats