Connect with us

Hi, what are you looking for?

Top Stories

AI’s Limitations Exposed: Skyscraper and Trombone Illustrate Lack of Common Sense

AI models, including Google’s Gemini, struggle with understanding scale and context, as shown by consistent errors in size comparisons, despite 800M weekly ChatGPT users relying on them.

Artificial Intelligence (AI) has become an integral part of daily life, impacting a wide array of industries and applications. Despite its impressive capabilities, experts caution that AI fundamentally relies on statistical patterns rather than genuine intelligence. This becomes evident when AI-generated outputs deviate from the data it has been trained on. For instance, when prompted to create an image of a skyscraper and a sliding trombone side-by-side, AI models can produce results where the two objects appear nearly identical in size, raising questions about their understanding of scale and context.

This observation underscores a significant limitation in AI’s learning process. Although models like Google’s Gemini have made strides since the introduction of ChatGPT in November 2022, the technology remains in its infancy, with only three years of development and an extraordinary adoption rate. OpenAI reports that approximately 800 million users engage with ChatGPT weekly, showcasing a profound reliance on AI for various tasks, especially among students, half of whom are frequent users.

The evolving role of AI raises important questions about its value and limitations. While some critics advocate for a pause in AI research due to concerns over potential superintelligent systems, others argue that AI could render traditional education obsolete. This dichotomy reflects the ongoing debate regarding AI’s societal impact and ethical implications.

To illustrate AI’s limitations, the author conducted an experiment by asking generative models to depict two disparate objects and analyze the results. Using a prompt to compare the size of a banana and an aircraft carrier, the AI consistently produced nonsensical images, highlighting its lack of common sense and understanding of spatial relationships. Such outcomes are particularly concerning in light of AI’s ability to perform complex tasks, like passing bar examinations and interpreting medical scans.

The root of these issues lies in the underlying mechanics of AI models. While the theoretical frameworks are established, models like Gemini and its counterparts—such as Mistral and Claude—are built on complex architectures that blend machine learning and diffusion processes. Machine Learning Lifecycles (MLL) enable AI to generate statistical representations of text, while diffusion models generate images by introducing noise to existing images and teaching the network to reverse that process. This complexity is compounded by the evolving nature of user prompts, which can lead to inconsistent outputs over time.

In practical terms, AI models are trained on vast datasets, including numerous images of skyscrapers and aircraft carriers, but often lack comparative representations of the two. Consequently, the models cannot accurately depict relative dimensions, which becomes evident when prompted to illustrate contrasting objects. This limitation is not just a technical glitch; it reflects a deeper truth about AI—models lack an internal representation or understanding of the world.

For example, a recent interaction with Gemini involved a question about the leap year status of the year the United States was established. While the model correctly applied the leap year rules, it ultimately arrived at an incorrect conclusion, illustrating that these systems lack logical reasoning and rely solely on statistical correlations rather than genuine understanding.

As AI continues to permeate various sectors, it raises critical considerations for both developers and users. The growing prevalence of AI-generated content—now rivaling human-produced articles on the internet—should prompt a careful evaluation of its reliability and implications. The technology holds immense potential for innovation, but the discrepancies in AI outputs highlight the necessity of ongoing scrutiny and oversight.

The conversation around AI’s future remains dynamic, as society grapples with the balance between embracing its capabilities while understanding its limitations. As the technology matures, stakeholders must remain vigilant to ensure AI aligns with ethical standards and supports meaningful human advancement.

OpenAI | Google DeepMind | Microsoft | Nvidia

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Business

Kilkenny's Tech Thursday event features AI expert John Cleere discussing market shifts and organizational transformation as AI adoption accelerates.

Top Stories

Astera Labs shares surged 18.9% to $170 as analysts project a $202 target, driven by soaring demand for AI connectivity solutions.

AI Regulation

IFR's latest report reveals that by 2030-2035, AI will be integral to most robotic systems, enhancing efficiency as safety and regulatory challenges persist.

Top Stories

Anthropic unveils its ad-free Claude chatbot amid OpenAI's ad rollout for ChatGPT, emphasizing user experience while exploring innovative monetization strategies.

Top Stories

Anthropic's new AI model Claude triggers a $1 trillion decline in major tech stocks, reshaping energy sector investments and AI infrastructure dynamics.

AI Marketing

Destination NSW launches the fully funded "Digital Skills Future Ready" program, offering eight free webinars to elevate AI-driven marketing for tourism operators.

AI Tools

Svedka unveils the first AI-generated Super Bowl ad, leveraging four months of AI training, amid rising tensions with Anthropic over advertising strategies.

AI Finance

Agentic AI transforms finance systems with real-time monitoring and error detection, enabling companies to proactively mitigate risks and enhance operational efficiency.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.