Skip to content

NLP application on Latin/Spanish 16th-17th century text corpora

Notifications You must be signed in to change notification settings

AlbaCili/School-of-Salamanca

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

School-of-Salamanca-word-frequencies

Word frequencies in the Digital Collection of Sources in the Works of the School of Salamanca.

This project aims to give a quantitive analysis of lemmas extracted from a digital comparable corpus. The content of this data collection is related to law, politics, religion, and ethics, which are the core of early modern discourse of the School of Salamanca. These early modern texts serve to analyze the history of the Salamanca School's origins and influence, as well as its internal discourse contexts within the context of the future.

This objective was achieved through the use of NLP applications, such as lemmatization and frequency distribution of lemmas, extracted from 16th-17th century Latin and Spanish works.

This repository displays the results, namely, 100 most common lemmas, obtained from the applied methodology, as long as the list of stop words and corrected Latin/Spanish names, which were used in the normalization process, the text format and python codes, through which the data collection was processed.

MIT License

This license is valid for the following folders:

  1. stopwords
  2. Works-spanish
  3. Works-latin
  4. Results
  5. Correct_Lemmas-EsLa_names

The NLP_codes folder is protectd under CC Copyright license. For more information check the pdf file Work_Copyrights.pdf in the NLP_codes folder, located in the word-frequency branch.

Copyright (c) 2023 Cindy Rico Carmona

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

About

NLP application on Latin/Spanish 16th-17th century text corpora

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published