A recent study has revealed that AI chatbots can match or even surpass human performance on certain creative tasks, while simultaneously producing responses that are markedly similar to one another. The findings, reported by “PsyPost,” raise concerns about the potential for these systems to limit the diversity of ideas generated in creative contexts.
Conducted by researchers Emily Wenger and Yoed N. Kenneth, the study aimed to analyze the influence of large language models—foundational technology behind popular AI chatbots—on human creativity. These models generate text by predicting the next word based on extensive training datasets consisting of billions of sentences harvested from books, articles, and web pages.
Wenger, an assistant professor at Duke University, noted that the algorithms powering these chatbots are typically trained on similar datasets sourced from the internet, which raises questions about the uniformity of their outputs. “This suggests common characteristics in their results,” she stated, pondering the broader implications of such uniformity.
The research involved 102 participants recruited via the online platform Prolific, alongside 22 distinct language models developed by companies including Google, Meta, and OpenAI. Each participant completed three standardized tasks designed to assess verbal creativity. The individual performance of the language models was found to be at or slightly above the average human level, creating the illusion of originality when considered in isolation.
However, a stark contrast emerged when comparing results across different models, revealing a significant degree of similarity in their outputs. In all tasks, the models generated responses that were markedly more uniform than those from human participants. This phenomenon was particularly prominent among models developed by the same company, which often shared overlapping vocabulary.
Wenger expressed surprise at the extent of this similarity, stating that while some degree was anticipated, the scale of it was unexpected. To address this issue, the researchers experimented with adjusting the “temperature”—a variable that controls the randomness in text generation. Higher temperature settings resulted in more diverse responses but often led to outputs that were nonsensical or inadequate for creative tasks.
Another strategy involved modifying the instructions given to the models, encouraging them to adopt a more unconventional approach to problem-solving. This slight adjustment yielded a marginal increase in originality for individual responses, yet it failed to fundamentally alter the overarching issue of uniformity.
The implications of these findings suggest that the increasingly popular use of generative AI for ideation and problem-solving may inadvertently constrain the scope of human creativity. Wenger cautioned, “If you use AI chatbots for creative tasks, the results will likely be very similar to those another user would get, even with a different tool. For truly unique content, it is better to avoid relying on such systems.”
The researchers acknowledged certain limitations in their analysis, noting that their evaluation focused solely on specific verbal tasks. Consequently, the results may not be representative of all forms of creativity, such as in visual arts or music. Additionally, the study exclusively examined commercial models, which are designed to adhere to stringent safety and communication guidelines, potentially influencing their behavior.
Future research will investigate other dimensions of creativity, such as fluency and flexibility, as well as explore engineering solutions to mitigate the observed uniformity in AI-generated outputs. As the technology continues to evolve, understanding its limitations will be crucial in ensuring that it enhances rather than restricts human creativity.
See also
AI Study Reveals Generated Faces Indistinguishable from Real Photos, Erodes Trust in Visual Media
Gen AI Revolutionizes Market Research, Transforming $140B Industry Dynamics
Researchers Unlock Light-Based AI Operations for Significant Energy Efficiency Gains
Tempus AI Reports $334M Earnings Surge, Unveils Lymphoma Research Partnership
Iaroslav Argunov Reveals Big Data Methodology Boosting Construction Profits by Billions



















































