Skip to main content

Distant Reader Gateway

Licensed according to this deed.

Published on


The Distant Reader is a tool for reading.The Distant Reader empowers you to use & understand large amounts of textual information both quickly & easily. For example, the Distant Reader can consume the entire issue of a scholarly journal, the complete works of a given author, or the content found at the other end of an arbitrarily long list of URLs. Thus, the Distant Reader is akin to a book's table-of-contents or back-of-the-book index but at scale. It simplifies the process of identifying trends & anomalies in a corpus, and then it enables you to further investigate those trends & anomalies.The Distant Reader is a system which locally harvests/caches content you specify. It then transforms the content into plain text, performs sets of natural language processing & text mining against the text, saves the results in a number of formats, reduces the whole to a cross-platform database file, queries the database thus summarizing the collection, zips the results of the entire process into a single file, and makes the file available to you for further investigation -- "reading".


Morgan, Eric Lease. (2020, April 10). Distant Reader (Version Alpha). Zenodo.


Eric Lease Morgan Room 250-E, Hesburgh Libraries University of Notre Dame Notre Dame, IN 46556 +1 (574) 631-8604



Distant Reader Code:
List of Software used in Distant reader:



Cite this work

Researchers should cite this work as follows:

  • (2020), "Distant Reader Gateway,"

    BibTex | EndNote