Skip to main content Skip to navigation

AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics

Project Overview

The document explores the role of generative AI, specifically ChatGPT, in educational settings by evaluating its performance in discussing controversial topics on platforms such as Kialo. It emphasizes the advancements in AI models that enhance their ability to moderate responses, aiming to provide balanced perspectives while minimizing biases. The study compares the quality of AI-generated answers with those produced by humans, revealing notable improvements in the AI's capacity to engage with complex issues. Despite these advancements, the findings indicate persistent concerns regarding biases, especially in socio-political discussions and philosophical contexts. Overall, the document underscores the potential of generative AI as a tool for facilitating dialogue in education, while also highlighting the need for continued scrutiny of its limitations and biases in order to ensure fair and equitable learning environments.

Key Applications

ChatGPT

Context: Educational context focusing on controversial topics for debate among users on platforms like Kialo; target audience includes students and educators engaged in discussions about ethics, politics, and social issues.

Implementation: AI models were queried with controversial topics from Kialo to assess their responses and biases.

Outcomes: Improved moderation in responses, reduced explicit biases in newer models, and a general alignment with human knowledge levels on most topics.

Challenges: Persistent implicit biases in socio-political topics and lower performance in the philosophy domain.

Implementation Barriers

Bias in AI Responses

Generative AI models may exhibit biases towards certain political or social perspectives, leading to uneven representation of viewpoints.

Proposed Solutions: Implementing stricter moderation policies and continuous retraining of models to recognize and reduce implicit biases.

Complexity of Topics

Controversial topics require nuanced understanding and responses, which can be difficult for AI to achieve consistently.

Proposed Solutions: Utilizing prompt engineering to guide AI in providing balanced perspectives and encouraging critical thinking.

Project Team

Vahid Ghafouri

Researcher

Vibhor Agarwal

Researcher

Yong Zhang

Researcher

Nishanth Sastry

Researcher

Jose Such

Researcher

Guillermo Suarez-Tangil

Researcher

Contact Information

For information about the paper, please contact the authors.

Authors: Vahid Ghafouri, Vibhor Agarwal, Yong Zhang, Nishanth Sastry, Jose Such, Guillermo Suarez-Tangil

Source Publication: View Original PaperLink opens in a new window

Project Contact: Dr. Jianhua Yang

LLM Model Version: gpt-4o-mini-2024-07-18

Analysis Provider: Openai

Let us know you agree to cookies