Connect with us

Hi, what are you looking for?

Top Stories

AI’s Limitations Exposed: Skyscraper and Trombone Illustrate Lack of Common Sense

AI models, including Google’s Gemini, struggle with understanding scale and context, as shown by consistent errors in size comparisons, despite 800M weekly ChatGPT users relying on them.

Artificial Intelligence (AI) has become an integral part of daily life, impacting a wide array of industries and applications. Despite its impressive capabilities, experts caution that AI fundamentally relies on statistical patterns rather than genuine intelligence. This becomes evident when AI-generated outputs deviate from the data it has been trained on. For instance, when prompted to create an image of a skyscraper and a sliding trombone side-by-side, AI models can produce results where the two objects appear nearly identical in size, raising questions about their understanding of scale and context.

This observation underscores a significant limitation in AI’s learning process. Although models like Google’s Gemini have made strides since the introduction of ChatGPT in November 2022, the technology remains in its infancy, with only three years of development and an extraordinary adoption rate. OpenAI reports that approximately 800 million users engage with ChatGPT weekly, showcasing a profound reliance on AI for various tasks, especially among students, half of whom are frequent users.

The evolving role of AI raises important questions about its value and limitations. While some critics advocate for a pause in AI research due to concerns over potential superintelligent systems, others argue that AI could render traditional education obsolete. This dichotomy reflects the ongoing debate regarding AI’s societal impact and ethical implications.

To illustrate AI’s limitations, the author conducted an experiment by asking generative models to depict two disparate objects and analyze the results. Using a prompt to compare the size of a banana and an aircraft carrier, the AI consistently produced nonsensical images, highlighting its lack of common sense and understanding of spatial relationships. Such outcomes are particularly concerning in light of AI’s ability to perform complex tasks, like passing bar examinations and interpreting medical scans.

The root of these issues lies in the underlying mechanics of AI models. While the theoretical frameworks are established, models like Gemini and its counterparts—such as Mistral and Claude—are built on complex architectures that blend machine learning and diffusion processes. Machine Learning Lifecycles (MLL) enable AI to generate statistical representations of text, while diffusion models generate images by introducing noise to existing images and teaching the network to reverse that process. This complexity is compounded by the evolving nature of user prompts, which can lead to inconsistent outputs over time.

In practical terms, AI models are trained on vast datasets, including numerous images of skyscrapers and aircraft carriers, but often lack comparative representations of the two. Consequently, the models cannot accurately depict relative dimensions, which becomes evident when prompted to illustrate contrasting objects. This limitation is not just a technical glitch; it reflects a deeper truth about AI—models lack an internal representation or understanding of the world.

For example, a recent interaction with Gemini involved a question about the leap year status of the year the United States was established. While the model correctly applied the leap year rules, it ultimately arrived at an incorrect conclusion, illustrating that these systems lack logical reasoning and rely solely on statistical correlations rather than genuine understanding.

As AI continues to permeate various sectors, it raises critical considerations for both developers and users. The growing prevalence of AI-generated content—now rivaling human-produced articles on the internet—should prompt a careful evaluation of its reliability and implications. The technology holds immense potential for innovation, but the discrepancies in AI outputs highlight the necessity of ongoing scrutiny and oversight.

The conversation around AI’s future remains dynamic, as society grapples with the balance between embracing its capabilities while understanding its limitations. As the technology matures, stakeholders must remain vigilant to ensure AI aligns with ethical standards and supports meaningful human advancement.

OpenAI | Google DeepMind | Microsoft | Nvidia

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Generative

Google unveils TurboQuant, achieving a 6x reduction in memory usage and 8x performance boost for large language models, streamlining AI applications.

Top Stories

OpenAI shuts down Sora, its AI video app, just nine months after launch, amid rising concerns over copyright violations and user trust erosion.

AI Marketing

Clickout Media's £40 million revenue strategy transforms reputable news sites into AI-driven casino content hubs, raising serious ethical concerns in journalism.

Top Stories

CrowdStrike's stock dropped 4% to $396.45 as Wedbush forecasts 2026 as a pivotal year for AI, raising concerns over growth versus valuation sustainability.

AI Tools

1min.AI offers a limited-time $85 lifetime subscription, slashing its Advanced Business Plan from $540, unifying writing, visuals, and audio tools for streamlined content creation.

AI Cybersecurity

Intel unveils Core Ultra Series 3 vPro processors featuring AI-driven DTECT security, promising 59% lower CPU usage and 30% faster performance for business PCs.

AI Marketing

Revieve launches AI Skin Advisor for ChatGPT, enabling beauty brands to deliver personalized skincare insights and transform consumer product discovery.

Top Stories

Google's AI tool misleads users by altering headlines without consent, risking credibility and prompting users to consider alternatives like DuckDuckGo and Brave.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.