Skip to content

Finding Data (including legacy versions)

srdee edited this page Aug 4, 2015 · 10 revisions

#Basic Overview

This page assumes you know the legacy Perseus document name and/or standard common name for a particular primary source file and are trying to find it's URN and/or location in the CVS repo, or the other way around. For secondary sources the process is much the same, but please also see the wiki page dedicated to secondary sources.

Some resources that are most frequently helpful in finding files and document names include:

#Canonical repos

PerseusDL/canonical was the first public GitHub repository home for the TEI XML texts of the Perseus Digital Library.

As our strategy for working with the texts through GitHub evolved and as the repository grew, we decided to move the texts to individual repositories subdivided by the CTS namespace to which the texts have been assigned. In general, namespace corresponds to language of original transmission for the work.

Greek works are now in http://github.com/PerseusDL/canonical-greekLit

Latin works are now in http://github.com/PerseusDL/canonical-latinLit

Anglo-Saxon works are now in http://github.com/PerseusDL/canonical-angLit

Italian works are now in http://github.com/PerseusDL/canonical-itaLit

Norse works are now in http://github.com/PerseusDL/canonical-norseLit

Farsi works are now in http://github.com/PerseusDL/canonical-farsiLit

For now you can find secondary sources and reference works in http://github.com/PerseusDL/canonical-pdlrefwk but this is subject to change. For more on this, see Secondary Sources.

If you are unsure of where to find a work you are interested in, please use the Perseus Catalog. The Catalog interface prominently displays the CTS URN for each edition or translation of a work. The filestructure in our GitHub repositories for texts currently adhere to the following structure:

canonical-NAMESPACE/data/TEXTGROUP/WORK/TEXTGROUP-WORK-VERSION.xml

More information on the CTS identifier structure of the Perseus texts can be found in the Catalog documentation.

Note that all GitHub file locations are subject to change, and that URLS to the GitHub files should NOT be used as Permanent Stable Identifiers for the Perseus texts. Information on where and how to find stable identifiers for the Perseus texts is provided at

http://sites.tufts.edu/perseusupdates/beta-features/perseus-stable-uris/ and http://sites.tufts.edu/perseuscatalog/documentation/user-guide/catalogdata-uris/

Clone this wiki locally