Exploring ChatGPT’s Performance as an AI-Generated Text Detector

Large language models (LLMs) such as ChatGPT have become ubiquitous for generating text at scale. In a recent paper submitted to the arXiv* server, researchers explored ChatGPT’s performance as a detector for artificial intelligence (AI)-generated text, inspired by its role as a data labeler. The study aimed to evaluate ChatGPT’s zero-shot performance in distinguishing human-written text from AI-generated text, with findings suggesting potential applications in automated detection pipelines.

Study: Exploring ChatGPT’s Performance as an AI-Generated Text Detector. Image credit: Mr.Cheangchai Noojuntuk/Shutterstock
Study: Exploring ChatGPT’s Performance as an AI-Generated Text Detector. Image credit: Mr.Cheangchai Noojuntuk/Shutterstock

*Important notice: arXiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.

Background

LLMs are deep neural networks that model natural language using transformer-based architectures. They are of two types: autoregressive language models predict the next word based on previous tokens, while masked language models are bidirectional and predict masked tokens in a sequence. Autoregressive models excel at language generation, while masked models are better at language understanding due to their bi-directional attention mechanisms. LLMs are trained on extensive internet-scale data and use various sampling strategies, such as greedy, top-k, and nucleus sampling, to generate text one token at a time.

The advancement of LLMs capable of producing high-quality, human-like text has revolutionized various tasks by assisting humans. However, the widespread accessibility of such models has raised concerns about potential misuse by malicious actors. There is a risk of these actors using LLMs to spread misinformation, create fake websites, and generate misleading content. Additionally, inexperienced users may overestimate the capabilities of these models, leading to flawed outputs and serious consequences in critical situations.

Related work

Recent language models, including ChatGPT and Generative Pre-trained Transformer 4 (GPT-4), have displayed impressive performance across various natural language processing (NLP) tasks, such as natural language inference, sentiment analysis, and fact-checking. They have also been evaluated as annotators and controllers for AI tasks. However, evidence suggests that ChatGPT might not perform as well in subjective NLP tasks. The increasing prevalence of LLMs and conversational AI assistants has normalized the use of AI-generated text, leading to growing concerns about potential misuse.

Consequently, research on detecting AI-generated text has garnered significant attention, with various computational methods explored, including feature-based, statistical, and fine-tuned language models. Commercial AI content detectors, such as OpenAI and ZeroGPT detectors, have also been introduced. Additionally, watermarking techniques for embedding indistinguishable artifacts have been studied for detection purposes.

ChatGPT as an AI text-detector

Using ChatGPT as an AI-text detector to determine if ChatGPT can distinguish between human-written and AI-generated text. The TuringBench dataset provides AI-generated text from 19 different generators and human-written news articles from Cable News Network (CNN) and The Washington Post. The experiments involve ChatGPT and GPT-4 as detectors with specific prompts for classification. The temperature parameter is set to zero for stable output. The results classify each article into ‘human-written,’ ‘AI-generated,’ or ‘unclear’ categories. The experiments are conducted on the test split of the datasets, with GPT-4 limited to 500 samples due to rate constraints.

Study Results

The study focused on ChatGPT's ability to detect AI-generated text and yielded interesting findings. It struggled to identify AI-generated text from various generators, performing satisfactorily only with GPT-1 and a few others. As model size increased, false negatives also grew, indicating larger models produced more ‘human-like’ text. Conversely, GPT-4 excelled at detecting AI-generated text but faced challenges differentiating it from human-written text, leading to misclassifications and raising concerns about its reliability.

These results shed light on the complexities of AI text detection and highlight important considerations about the reliability of state-of-the-art language models. While ChatGPT struggles with AI-generated text, GPT-4’s overconfidence in labeling everything as ‘AI-generated’ can lead to misleading outcomes. Ongoing research and vigilance are necessary to address the limitations and challenges posed by advanced language models in distinguishing human and AI-generated text effectively.

Additional experiments assessed ChatGPT and GPT-4's sensitivity to human-written text styles from various datasets. ChatGPT consistently identified human-written text across different sources, with only a small fraction misclassified as AI-generated. However, GPT-4 exhibited varying performance, performing well on some datasets such as NeuralNews (The New York Times) and the Internet Movie Database (IMDb) texts but struggling with human texts from TuringBench, possibly due to dataset-specific characteristics and noise. This indicated that GPT-4 is less reliable in identifying human-written text than ChatGPT, confirming previous studies' findings.

Performance on ChatGPT-generated text: To evaluate ChatGPT-generated text, a new dataset called ChatNews was created, mimicking TuringBench. The results showed ChatGPT misclassified over 50% of ChatNews articles as human-written, whereas GPT-4 correctly identified around 38%. This suggests GPT-4’s potential for detecting text from older models such as ChatGPT.

Conclusions

In summary, researchers explored ChatGPT’s ability to detect AI-generated text and found it struggles with this task while performing well on human-written text. This asymmetry in performance could be utilized to build detectors that focus on identifying human-written text, indirectly solving the AI-generated text detection problem. The experiments showed that ChatGPT outperformed GPT-4 in identifying AI-generated vs. human-written text and GPT-4’s sensitivity to noise and dataset artifacts, affecting its reliability. Future work will investigate the reasons behind this performance difference, exploring the impact of the training data and leveraging ChatGPT’s capability for automated detection pipelines using few-shot prompting and ensemble methods.

*Important notice: arXiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.

Journal reference:
Dr. Sampath Lonka

Written by

Dr. Sampath Lonka

Dr. Sampath Lonka is a scientific writer based in Bangalore, India, with a strong academic background in Mathematics and extensive experience in content writing. He has a Ph.D. in Mathematics from the University of Hyderabad and is deeply passionate about teaching, writing, and research. Sampath enjoys teaching Mathematics, Statistics, and AI to both undergraduate and postgraduate students. What sets him apart is his unique approach to teaching Mathematics through programming, making the subject more engaging and practical for students.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Lonka, Sampath. (2023, August 06). Exploring ChatGPT’s Performance as an AI-Generated Text Detector. AZoAi. Retrieved on July 06, 2024 from https://www.azoai.com/news/20230806/Exploring-ChatGPTe28099s-Performance-as-an-AI-Generated-Text-Detector.aspx.

  • MLA

    Lonka, Sampath. "Exploring ChatGPT’s Performance as an AI-Generated Text Detector". AZoAi. 06 July 2024. <https://www.azoai.com/news/20230806/Exploring-ChatGPTe28099s-Performance-as-an-AI-Generated-Text-Detector.aspx>.

  • Chicago

    Lonka, Sampath. "Exploring ChatGPT’s Performance as an AI-Generated Text Detector". AZoAi. https://www.azoai.com/news/20230806/Exploring-ChatGPTe28099s-Performance-as-an-AI-Generated-Text-Detector.aspx. (accessed July 06, 2024).

  • Harvard

    Lonka, Sampath. 2023. Exploring ChatGPT’s Performance as an AI-Generated Text Detector. AZoAi, viewed 06 July 2024, https://www.azoai.com/news/20230806/Exploring-ChatGPTe28099s-Performance-as-an-AI-Generated-Text-Detector.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoAi.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
JASCO: Advancing Music Generation from Text Descriptions