AI Model Collapse: Threat to Generative AI

Download PDF Copy

By Muhammad OsamaReviewed by Susha Cheriyedath, M.Sc.Jul 31 2024

A recent study published in the journal Nature explored "model collapse," showing that artificial intelligence (AI) models can degrade and produce gibberish when trained on data generated by other AI models. This phenomenon poses a serious challenge to the sustainability and reliability of generative AI models in the future. The researchers aimed to address this issue and explore potential solutions.

*Study: AI Model Collapse: Threat to Generative AI. Image Credit: chayanuphol/Shutterstock.com*

Background

AI models are powerful tools for generating realistic and diverse content, such as text, images, and audio, from large-scale data sources. Generative AI models use statistical learning methods to capture the underlying patterns and distributions of data. For example, large language models (LLMs) like generative pre-trained transformers version 2 (GPT-2), GPT-3, and GPT-4 can produce coherent and diverse text by training on massive amounts of human-written text from the web. Similarly, stable diffusion models create realistic images from descriptive text by training on large collections of images and captions.

However, as generative AI models become more accessible and widely used, AI-generated content on the web is increasing. For example, AI-generated blogs, images, and other content are now common and can be easily created by anyone using online platforms or tools. This raises the question of what happens to generative AI models when trained on data contaminated by their outputs or those of their predecessors.

About the Research

In this paper, the authors investigated the effects of training generative AI models on recursively generated data, meaning data produced by previous generations of the same or similar models. They considered three generative models: LLMs, variational autoencoders (VAEs), and Gaussian mixture models (GMMs), and applied them to different domains such as text, images, and synthetic data. They simulated model collapse by training each model on data generated by its predecessor and repeating this cycle several times. Additionally, they analyzed the sources of errors and the theoretical mechanisms behind model collapse.

Research Findings

The researchers found that model collapse is a universal phenomenon affecting all types of generative models they tested. Over time, the models lose information about the original data distribution and become biased towards the most common events. For example, an LLM trained on text data generated by another LLM produced more frequent words and phrases and forgot less frequent or rare ones. This leads to a decline in the model’s performance and quality, as well as a loss of diversity and creativity.

The study showed that model collapse is inevitable, even under ideal conditions such as infinite data, perfect expressivity, and no function estimation error. Generally, the model collapse was caused by three main sources of error that compound over generations: statistical approximation error, functional expressivity error, and functional approximation error.

These errors resulted from the finite number of samples, the limited expressiveness of the function approximator, and the limitations of the learning procedures. The researchers demonstrated that model collapse negatively impacted the quality and diversity of generated content, as well as the fairness and robustness of the models.

Applications

This study has important implications for the future of online content and generative AI. It showed that model collapse poses a serious threat to the sustainability and reliability of these models, as they may become corrupted by their own generated content. The authors suggest that access to the original data distribution is crucial for preserving the ability of generative AI models to model low-probability events, which are often relevant to marginalized groups and complex systems.

They also highlight the need to track the provenance of online content and distinguish between human-generated and AI-generated data. Additionally, community-wide coordination and information sharing are essential to prevent or mitigate model collapse.

Conclusion

In summary, this research systematically investigated model collapse in generative AI models, revealing its causes and consequences. The researchers found that recursively training generative AI models on data generated by other models leads to a loss of information and diversity. They provided a theoretical framework and empirical evidence to support their findings. Their research opens a new direction for future studies on the long-term dynamics and stability of generative AI models, as well as the ethical and social implications of their widespread use.

Journal reference:

Shumailov, I., Shumaylov, Z., Zhao, Y. et al. AI models collapse when trained on recursively generated data. Nature 631, 755–759 (2024). DOI: 10.1038/s41586-024-07566-y, https://www.nature.com/articles/s41586-024-07566-y

Posted in: AI Research News

Comments (1)

Written by

Muhammad Osama

Muhammad Osama is a full-time data analytics consultant and freelance technical writer based in Delhi, India. He specializes in transforming complex technical concepts into accessible content. He has a Bachelor of Technology in Mechanical Engineering with specialization in AI & Robotics from Galgotias University, India, and he has extensive experience in technical content writing, data science and analytics, and artificial intelligence.

Download PDF Copy

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

APA
Osama, Muhammad. (2024, July 31). AI Model Collapse: Threat to Generative AI. AZoAi. Retrieved on April 19, 2025 from https://www.azoai.com/news/20240731/AI-Model-Collapse-Threat-to-Generative-AI.aspx.
MLA
Osama, Muhammad. "AI Model Collapse: Threat to Generative AI". AZoAi. 19 April 2025. <https://www.azoai.com/news/20240731/AI-Model-Collapse-Threat-to-Generative-AI.aspx>.
Chicago
Osama, Muhammad. "AI Model Collapse: Threat to Generative AI". AZoAi. https://www.azoai.com/news/20240731/AI-Model-Collapse-Threat-to-Generative-AI.aspx. (accessed April 19, 2025).
Harvard
Osama, Muhammad. 2024. AI Model Collapse: Threat to Generative AI. AZoAi, viewed 19 April 2025, https://www.azoai.com/news/20240731/AI-Model-Collapse-Threat-to-Generative-AI.aspx.

Comments

F

f p says:

August 1, 2024 at 5:43 AM

AI is the 1st tier, Gen AI is the 2nd tier but to gain the 3rd tier of user augmented experience, Gen Ai it has to be implemented with some features and specific algorithm on where the user experience it will be augmented using secure holograms and more precise predictive modelling based on quantum computational methodologies simplification.
Francesc

0 0

Reply