Skip to content

Notebooks for the NEH TAPI Workshop, "How to Do Things with Topic Models."

License

Notifications You must be signed in to change notification settings

jingxu1983/TAPI_Topic_Models

 
 

Repository files navigation

TAPI Topic Models

Notebooks for the NEH TAPI Workshop, "How to Do Things with Topic Models."

Binder

Get Corpus Data

Corpus data files are CSV files in a specific format (machine learning corpus format). A collection of them have been prepared for this workshop. Download them and then upload them to the ./corpora folder in your Binder repository. Sorry that the process is not more direct!

Corpus data may be downloaded from the following shared Dropbox link:

Additionally, these files may be downloaded individually:

  • Wine Reviews — A collection of terse wine reviews.
  • JSTOR Hyperparameter — Abstracts from a JSTOR search for "hyperparameter."
  • Tamilnet — A sample of news stories from the website Tamilnet.
  • Anphoblach — A sample of news stories from the website Anphoblacht.

Each link goes to a Dropbox item that has a download link. Download the file to your desktop and then upload to the appropriate directory.

Get Sample Output Data

The notebooks in this workshop will generate a digital analytical edition from a given source corpus file. The results of the various analytical processes will be put in the ./db directory. A demonstration edition is provided for one of the notebooks. To get the demo data, download then upload the files with the prefix jstor_hyperparameter_demo into to your ./db directory.

About

Notebooks for the NEH TAPI Workshop, "How to Do Things with Topic Models."

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%