Skip to Main Content
Purdue University Purdue Logo Purdue Libraries

Data Scholarship in Business and Economics

Get started with data scholarship in Business and Economics

Text and Data Mining

Overview

  • Text and Data Mining (TDM) "refers to the process of discovering useful patterns in very large databases. It uses methods from statistics, machine learning, and database management to restructure and analyze data in ways that permit knowledge or information to be extracted from the material" according to Sage Campus. They provide nearly 50 short videos to help you get started with TDM.

Caveats

  • Most Purdue Libraries license agreements with database vendors prohibit systematic downloading of content for TDM.
  • Purdue Libraries does not subscribe to the following TDM subscription services sometimes used by Business & Economics researchers:
    • LexisNexis REST API
    • ProQuest services including Congressional Record TDM, Historical Newspapers TDM, and TDM Studio
  • While researchers can bootstrap their own TDM solutions, systematic downloading of content can result in litigation even if done for academic purposes.

TDM Via Purdue