Physalia courses: Big Data Biogeography

Date & time: February 1st - February 5th

Times (CET):

9:00 - 12:00 Exercises (teachers available for questions via slack)
13:00 - 17:00 Lectures, demonstrations and group work (synchronous)

Location: on-line

Teachers: Alexander Zizka, German Center for Integrative Biodiversity research & Daniele Silvestro, University of Fribourg, Switzerland

Schedule

The course consists of five days with different topics. You can find a detailed schedule in the overview tab for each day.

Day 1 - Biodiversity databases. Introduction to different types of large-scale biodiversity data and methods for reproducible data retrieval.
Day 2 - Data quality and processing. Understanding common issues with species distribution data from large-scale biodiversity databases and introduction to methods to address them.
Day 3 - Historical biogeography
Day 4 - Big data conservation assessment
Day 5 - Fossil biogeography

Important

We will use Slack for most of the communication before and during the course. Please make sure to check the slack channel regularly.

How the course works

We will meet for the first time on Monday 13:00 CET on zoom. Please make sure to have a look at @before_start, so that we can start swiftly.
The course will be split into daily live sessions (13:00 - 17:00 CET) and related asynchronous exercise session on which you will work between the live sessions. You can chose the timing of the exercise sessions, and we will be available to answer questions via slack from 9:00 - 12:00 CET every day.
During the live session we will consist of lectures where we will present theoretical concepts and a broader context for the exercises of each day and demonstrations where we will briefly present the analysis work-flows for each day and you will then have time to explore them and ask questions. During the asynchronous sessions you will independently address specific exercises following tutorials.
Detailed information on the course are available on the course webpage (https://azizka.github.io/big_data_biogeography/) which we will update constantly during the course. On the webpage you will find for each day:
- the exercises with tutorial
- a detailed schedule
- learning objectives and expected outcomes
- further reading
During the course you will work on your own project, applying the methods presented during the course to your own data. To do so please bring a taxonomic group of interest (ideally up to 200 species, and if possible with a phylogeny available), and think of some questions regarding this dataset that you would like to answer. This will give you the opportunity to chose questions and exercises most suitable for your work and get feedback from the teacher. There will be example data for all exercises, in case you do not have your own data yet. At the end of the course you will briefly present your results. You can find more information on the project on the course webpage.

Contact

Course content: Alexander Zizka Organisational: Physalia courses

Physalia courses webpage

Objectives

After this course, students will be able to:

Obtain and prepare large scale species occurrence records from public databases in R (including data mining, data cleaning and exploration)
Apply novel methods for handling and processing ‘big data’ in biogeographic research, including area classification, bioregionalization and automated conservation assessments
Reconstruct species ancestral ranges based on species occurrences and phylogenetic trees, using different evolutionary models
Understand the potential and caveats of fossil based biogeography, and be familiar with novel methods to estimate ancestral ranges and evolutionary rates from ranges of extinct and extant taxa

Background

The public availability of large-scale species distribution data has increased drastically over the last ten years. In particular, due to the aggregation of records from museums and herbaria, and citizen science in public databases such as the Global Biodiversity Information Facility (GBIF). This is leading to a ‘big data’ revolution in biogeography, which holds an enormous but still poorly explored potential for understanding large scale patterns and drivers of biodiversity in space and time.

Course literature

Meyer et al. (2015) Global priorities for an effective information basis of biodiversity distributions. Nature Communications, 8 pp.
Antonelli et al. (2018) Amazonia is the primary source of Neotropical biodiversity. PNAS 115(23): 6034–6039.
One of the following suggestions (depending on your own interests): a. Edler et al. (2017) Infomap Bioregions: Interactive mapping of biogeographical regions from species distributions. Systematic Biology 66(2):197–204.

b. Zizka et al. (2019) CoordinateCleaner: Standardized cleaning of occurrence records from biological collection databases. Methods in Ecology and Evolution 10:744-751.

d. Price et al. (2019) Big data little help in megafauna mysteries. Nature 558(7):23-25

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
docs		docs
.gitignore		.gitignore
01_schedule.Rmd		01_schedule.Rmd
02_student_project.Rmd		02_student_project.Rmd
IUCN_results.xlsx		IUCN_results.xlsx
Readme.md		Readme.md
ToDo.md		ToDo.md
_site.yml		_site.yml
big_data_biogeography.Rproj		big_data_biogeography.Rproj
bon_endemism.Rmd		bon_endemism.Rmd
bon_extracting_environmental_data.Rmd		bon_extracting_environmental_data.Rmd
bon_species_list.Rmd		bon_species_list.Rmd
bon_species_richness_ecoregions.Rmd		bon_species_richness_ecoregions.Rmd
example_data.lnk		example_data.lnk
fr_overview.Rmd		fr_overview.Rmd
fr_presentations.Rmd		fr_presentations.Rmd
index.md		index.md
mo_download_gbif.Rmd		mo_download_gbif.Rmd
mo_download_iucn.Rmd		mo_download_iucn.Rmd
mo_download_paleobioDB.Rmd		mo_download_paleobioDB.Rmd
mo_overview.Rmd		mo_overview.Rmd
mo_setup.Rmd		mo_setup.Rmd
thu_aa_criterion_B.Rmd		thu_aa_criterion_B.Rmd
thu_aa_neural_network.Rmd		thu_aa_neural_network.Rmd
thu_further_reading.Rmd		thu_further_reading.Rmd
thu_overview.Rmd		thu_overview.Rmd
tue_clean_fossils.Rmd		tue_clean_fossils.Rmd
tue_clean_geographic_data.Rmd		tue_clean_geographic_data.Rmd
tue_further_reading.Rmd		tue_further_reading.Rmd
tue_overview.Rmd		tue_overview.Rmd
tue_probabilistic_cleaning.Rmd		tue_probabilistic_cleaning.Rmd
tue_sampling_bias.Rmd		tue_sampling_bias.Rmd
tue_species_ranges_and_richness.Rmd		tue_species_ranges_and_richness.Rmd
wed_DEC_bsm.Rmd		wed_DEC_bsm.Rmd
wed_ancestral_areas_DEC.Rmd		wed_ancestral_areas_DEC.Rmd
wed_bioregionalization.Rmd		wed_bioregionalization.Rmd
wed_data_preparation.Rmd		wed_data_preparation.Rmd
wed_diversification_rates_geosse.Rmd		wed_diversification_rates_geosse.Rmd
wed_further_reading.Rmd		wed_further_reading.Rmd
wed_overview.Rmd		wed_overview.Rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Physalia courses: Big Data Biogeography

Schedule

Important

How the course works

Contact

Objectives

Background

Course literature

About

Releases

Packages

pelow22/big_data_biogeography

Folders and files

Latest commit

History

Repository files navigation

Physalia courses: Big Data Biogeography

Schedule

Important

How the course works

Contact

Objectives

Background

Course literature

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages