All materials of the involvement in the FutureTDM project.
All materials worked out in this repository where conducted within the EU Horizon2020 project Future TDM - The Future of Text and Data Mining, an EU Horizon2020 research project with participation of Open Knowledge International and ContentMine.
The main outcomes are:
- three tutorials about specific use-cases of text data mining techniques
- one workshop
- one presentation of the outcomes at a conference
All content and data is licensed under the Creative Commons Attribution 4.0 International License. All code is under the MIT license.
To do text data mining with the content mine software you need two things:
- Install the ContentMine software. Find out more in installation.md.
- Learn about text data mining. As preparation we recommend to have a look at the resources list in installation.md.
We worked out three different use-cases to show the power of text data mining with our software.
Use text data mining to get an overview on the research around the zika virus. How did the research field evolve in the last decades? What authors and journals contributed most and how are they connected? and and dive into the data/publications to get a better understanding, on the state of it and have a look at the species mentioned.
Go to the Zika Tutorial.
(soon to come...)
Filter out and find relevant publications, to support you doing a systematic review around your research question - in a fully open and reproducible way.
Go to the Systematic Literature Review Tutorial.
Date: 29th of March 2017
Location: EU Parliament, Brussels
Go to the documentation.
Date: 6th of June 2017
Location: 21st ELPUB Conference at Limassol, Cyprus
Go to the documentation.
Date: 13th of June 2017
Location: University of Applied Sciences in Salzburg, Austria
Go to the documentation.
All content is openly licensed under the Creative Commons Attribution 4.0 license, unless otherwisely stated.
All sourcecode is free software: you can redistribute it and/or modify it under the terms of the MIT License. Visit http://opensource.org/licenses/MIT to learn more about the MIT License.
In the spirit of free software, everyone is encouraged to help improve the content created and curated here.
Here are some ways you can contribute:
- by reporting bugs
- by suggesting new sections
- by translating to a new language
- by writing or editing documentation
- by analyzing the data
- by visualizing the data
- by writing code (no pull request is too small: fix typos in the user interface, add code comments, clean up inconsistent whitespace)
- by refactoring code
- by closing issues
- by reviewing pull requests
- by enriching the data with other data sources
When you are ready, submit a pull request.
We use the GitHub issue tracker to track bugs and features. Before submitting a bug report or feature request, check to make sure it hasn't already been submitted. When submitting a bug report, please try to provide a screenshot that demonstrates the problem.
FutureTDM
- Tutorials
- Workshops
ContentMine
- Materials: Software tutorials, training guidelines and trainign modules for ContentMine.
- pyCProject: Python wrapper for CProject.
- Dictionaries
- Discourse