Skip to Main Content
Go to Penn Libraries homepage   Go to Guides homepage
Banner: RDDS; Research Data & Digital Scholarship displayed between 3D mesh surfaces

Text Analysis

A guide to text mining tools and methods

Software for Text Analysis

Once you have built your corpus, you will need to use specialized software to analyze it. Different kinds of software are suited to different disciplines and research questions. The software listed below do not require programming language knowledge.

  • Google NGram Viewer: Google Ngram Viewer is a tool that allows you to explore language usage trends over time.
  • Google Pinpoint: Part of Google’s Journalist Studio, search keywords and identify entities in large amounts of text.
  • Voyant Tools: Voyant tool is an open-source, web-based text reading and analysis environment.
  • AntConc: (Tutorial) A freeware corpus analysis toolkit for concordancing and finding clusters (frequency patterns of word sequences) or n-grams (sequences of n words within your corpus or document).
  • MALLET:  MAchine Learning for LanguagE Toolkit is a Java programming language-based software for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. 
Penn Libraries Home Search the Catalog
(215) 898-7555