Skip to Main Content

Digital Humanities

This guide provides an introduction to digital humanities (DH) theory and practice and an overview of DH methods, tools, and resources.

Email this link:

What is Text Mining?

Also see the UC Berkeley Library Research Guide on text mining, which provides helpful resources on where to find datasets to use, legal and ethical considerations, and citation help. 

Text Mining and Data Tools Available Online

Text and Data Mining Tools (Free)

"Constellate enables you to easily and confidently incorporate text analysis into your curriculum. Whether you're new to the practice or a seasoned pro, our user-friendly software and pedagogical approach will meet your educational needs."

Text analysis, sometimes referred as text mining, is the automated process of understanding and sorting unstructured text, making it easier to manage. Text analysis tools are often used to gain valuable insights from social media comments, survey responses, and online reviews.

                  Screengrab from MonkeyLearn

undefined

Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

Free open source software to analyze and process your texts visually.

Tools for Data Cleaning and Processing (Free)

Free integrated workflow of pre-processing, analysis, and visualization tools for finding and exploring patterns in texts

OpenRefine is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.