AI’s Limitations Exposed: Skyscraper and Trombone Illustrate Lack of Common Sense

AI models, including Google’s Gemini, struggle with understanding scale and context, as shown by consistent errors in size comparisons, despite 800M weekly ChatGPT users relying on them.

Staff

Published

2 hours ago

Artificial Intelligence (AI) has become an integral part of daily life, impacting a wide array of industries and applications. Despite its impressive capabilities, experts caution that AI fundamentally relies on statistical patterns rather than genuine intelligence. This becomes evident when AI-generated outputs deviate from the data it has been trained on. For instance, when prompted to create an image of a skyscraper and a sliding trombone side-by-side, AI models can produce results where the two objects appear nearly identical in size, raising questions about their understanding of scale and context.

This observation underscores a significant limitation in AI’s learning process. Although models like Google’s Gemini have made strides since the introduction of ChatGPT in November 2022, the technology remains in its infancy, with only three years of development and an extraordinary adoption rate. OpenAI reports that approximately 800 million users engage with ChatGPT weekly, showcasing a profound reliance on AI for various tasks, especially among students, half of whom are frequent users.

The evolving role of AI raises important questions about its value and limitations. While some critics advocate for a pause in AI research due to concerns over potential superintelligent systems, others argue that AI could render traditional education obsolete. This dichotomy reflects the ongoing debate regarding AI’s societal impact and ethical implications.

To illustrate AI’s limitations, the author conducted an experiment by asking generative models to depict two disparate objects and analyze the results. Using a prompt to compare the size of a banana and an aircraft carrier, the AI consistently produced nonsensical images, highlighting its lack of common sense and understanding of spatial relationships. Such outcomes are particularly concerning in light of AI’s ability to perform complex tasks, like passing bar examinations and interpreting medical scans.

The root of these issues lies in the underlying mechanics of AI models. While the theoretical frameworks are established, models like Gemini and its counterparts—such as Mistral and Claude—are built on complex architectures that blend machine learning and diffusion processes. Machine Learning Lifecycles (MLL) enable AI to generate statistical representations of text, while diffusion models generate images by introducing noise to existing images and teaching the network to reverse that process. This complexity is compounded by the evolving nature of user prompts, which can lead to inconsistent outputs over time.

In practical terms, AI models are trained on vast datasets, including numerous images of skyscrapers and aircraft carriers, but often lack comparative representations of the two. Consequently, the models cannot accurately depict relative dimensions, which becomes evident when prompted to illustrate contrasting objects. This limitation is not just a technical glitch; it reflects a deeper truth about AI—models lack an internal representation or understanding of the world.

For example, a recent interaction with Gemini involved a question about the leap year status of the year the United States was established. While the model correctly applied the leap year rules, it ultimately arrived at an incorrect conclusion, illustrating that these systems lack logical reasoning and rely solely on statistical correlations rather than genuine understanding.

As AI continues to permeate various sectors, it raises critical considerations for both developers and users. The growing prevalence of AI-generated content—now rivaling human-produced articles on the internet—should prompt a careful evaluation of its reliability and implications. The technology holds immense potential for innovation, but the discrepancies in AI outputs highlight the necessity of ongoing scrutiny and oversight.

The conversation around AI’s future remains dynamic, as society grapples with the balance between embracing its capabilities while understanding its limitations. As the technology matures, stakeholders must remain vigilant to ensure AI aligns with ethical standards and supports meaningful human advancement.

OpenAI | Google DeepMind | Microsoft | Nvidia

Anthropic’s AI Disruption Triggers $1 Trillion Market Shift, Impacts Energy Sector Investments

Anthropic's new AI model Claude triggers a $1 trillion decline in major tech stocks, reshaping energy sector investments and AI infrastructure dynamics.

Staff60 minutes ago

AI Marketing

Destination NSW Launches Digital Skills Program to Boost AI and Marketing for Tourism

Destination NSW launches the fully funded "Digital Skills Future Ready" program, offering eight free webinars to elevate AI-driven marketing for tourism operators.

Sofía Méndez1 hour ago

AI Tools

Svedka Launches First AI-Generated Super Bowl Ad Amid OpenAI Anthropic Feud

Svedka unveils the first AI-generated Super Bowl ad, leveraging four months of AI training, amid rising tensions with Anthropic over advertising strategies.

Staff2 hours ago

AI Finance

Agentic AI Enhances Finance Systems with Real-Time Monitoring and Error Detection

Agentic AI transforms finance systems with real-time monitoring and error detection, enabling companies to proactively mitigate risks and enhance operational efficiency.

Marcus Chen2 hours ago

Australia’s Economy Faces Inflation Risks Amid AI Investments and Geopolitical Tensions in 2026

CommBank forecasts a 5% rise in Australia's property values for 2026 amid inflation risks and heightened borrowing costs as AI investments drive economic growth.

Staff3 hours ago

AIPRESSA.COM

Top Stories

AI’s Limitations Exposed: Skyscraper and Trombone Illustrate Lack of Common Sense

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

You May Also Like

Top Stories

Anthropic’s AI Disruption Triggers $1 Trillion Market Shift, Impacts Energy Sector Investments

AI Marketing

Destination NSW Launches Digital Skills Program to Boost AI and Marketing for Tourism

AI Tools

Svedka Launches First AI-Generated Super Bowl Ad Amid OpenAI Anthropic Feud

AI Finance

Agentic AI Enhances Finance Systems with Real-Time Monitoring and Error Detection

Top Stories

Australia’s Economy Faces Inflation Risks Amid AI Investments and Geopolitical Tensions in 2026

AI Regulation

Access Denied: Insights on Fintech 2.0 Amidst Evolving AI Governance Challenges

Top Stories

Meta’s Child Safety Lawsuit Advances Amid $10B AI Data Center Energy Crisis

Top Stories

Google and Meta Boost Broadcom’s AI Outlook Despite Share Price Decline