Skip to main content Skip to navigation

CIM Tools



LE-CAT is a Lexicon-based Categorization and Analysis Tool developed by the Centre for Interdisciplinary Methodologies in collaboration with the Media of Cooperation Group at the University of Siegen.

The tool allows you to apply a set of word queries associated with a category (a lexicon) to a data set of textual sources (the corpus). LE-CAT determines the frequency of occurrence for each query and category in the corpus, as well as the relations between categories (co-occurrence) by source.

The tool also allows you to quickly generate the data for lexicon-based analysis, by extracting descriptions from the Youtube API for URLs provided by the user.

The purpose of this technique is to automate and scale up user-led data analysis as it allows the application of a custom-built Lexicon to large data sets. The quick iteration of analysis allows the user to refine a corpus and deeply analyse a given phenomenon.

LE-CAT was coded by James Tripp. It has been used to support the workshop Youtube as Test Society (University of Siegen) and the Digital Test of the News (University of Warwick) and will soon be tested by students on the MA Module Digital Objects, Digital Methods.

Academic correspondence should be sent to Noortje Marres.