Answered By: James Adams
Last Updated: Jun 16, 2023     Views: 717

Several of Harvard Library's databases allow text and data mining for research purposes under certain conditions: 

ProQuest

A large amount of content that Harvard subscribes to through ProQuest is available through their Text Data Mining platform (contact a librarian to learn more). Some highlights include:

Gale NewsVault 

  • British Library Newspapers, parts I-V, 1800-1950
  • Daily Mail Historical Archive 
  • Economist Historical Archive, 1843-2012 
  • Times Digital Archive, 1785-2006

JSTOR

Some JSTOR text-analysis functionality is available using their Constellate service, though programmatic access to the data itself is not available to Harvard users.

HathiTrust

HathiTrust allows text mining, subject to certain conditions, via the HathiTrust Research Center (HTRC). The HathiTrust policies are posted at http://www.hathitrust.org/datasets.

ScienceDirect 

Elsevier allows some text mining of content in its ScienceDirect database that Harvard subscribes to. For details on Elsevier's policy, as well instructions for accessing and using their API, please consult the Elsevier website

Please contact an HKS Librarian for more information about text-mining the above resources.