Stochastic Parrots Case Study

Case study 3: Opening up Twitter conversations for controversy analysis
By Matias Valderrama Barragan with Greta Timaite and Iain Emsley
The Stochastic Parrots Twitter data set is a database of tweets that were collected by the Shaping AI research project in order to capture public controversy surrounding the research paper “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?” by Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, Shmargaret ShmitchellLink opens in a new window. The paper became the subject of extensive debate online, especially after one of its authors, Timnit Gebru, was ousted from her job as Co-lead of the Ethical AI Research Team at Google AI following internal disagreements about its publication.
The Stochastic Parrots Twitter dataset was captured by the CIM RSE team to support AI controversy analysis as part of the international social research project Shaping AILink opens in a new window. Based on an online consultation of UK-based experts in AI and society (Marres et al., 2024, p. 2)Link opens in a new window, the Shaping AI research team at the University of Warwick led by prof. Noortje Marres selected five notable research controversies around AI and society that had taken place between 2012 and 2022 for further study. The Stochastic Parrot controversy was one of the most frequently mentioned by UK experts during the expert consultation. Follow-up interviews with selected experts confirmed that the social media platform Twitter served as one of the main stages, or ‘primary settings,’ for this public controversy about AI, hence the relevance of studying its traces on this platform.