-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathportfolio.qmd
230 lines (124 loc) · 9.76 KB
/
portfolio.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
---
title: "Workshop portfolio"
---
## Workshop structure, style and expectation
All workshops are run under The Carpentries [Code of Conduct](https://docs.carpentries.org/topic_folders/policies/code-of-conduct.html) to promote a safe and inclusive working environment.
Our training workshops are interactive live-coding events. One or more Instructors (subject matter experts who have received formal teaching training) will demonstrate by writing code in real-time while attendees follow along. Instructors provide details about what they are doing and explain the decision making and thought processes that they are going through. Attendees will follow along and work through all code at the same time as the Instructor. Short exercises, either for individuals or groups, will reinforce your learning. All workshops will also have one or more Helpers: experienced members of the training community or subject matter experts who are on hand to answer questions and solve issues as they arise. Helpers ensure that no attendee is left behind and that everyone is able to complete tasks in a timely manner.
Our workshops run with the help of New Zealand eScience Infrastructure (NeSI). Our partnership with NeSI ensures there is no need to install software, no compatibility issues, no minimum computing requirements. All you will need to complete the workshop is a computer with an internet connection and a web browser.
All workshop material, which includes written notes that cover all main points, code examples, and tasks (as well as their solutions), is provided in advance and is freely available.
Workshops are either in person or online (zoom), and are usually one or two days (10am - 4pm).
## Introductory workshops
Our series of introductory workshops are designed to take you from absolute beginner to self-driving learner. Each of these workshops can be done as a standalone crash course in a particular topic, and they can also be strung together to provide a strong foundation in bioinformatics principals and practices.
### Introduction to Bash Scripting and HPC Job Scheduler
Get started with writing your own scripts for data analysis and working on an HPC (high performance computing) environment.
Some of the topics covered in the workshop are:
- Designing a variant calling workflow.
- Automating a workflow.
- An introduction to HPC.
- Working with job scheduler.
View the full workshop material here:
[Introduction to Bash Scripting and HPC Scheduler](https://genomicsaotearoa.github.io/Workshop-Bash_Scripting_And_HPC_Job_Scheduler/)
### Introduction to the R Programming Language
Get started with R, a highly popular programming language in the fields of biology and statistics. R is world-renowned for producing high-quality, publication-ready figures and tables.
Note that this workshop is a pre-requisite for the RNA-seq Data Analysis workshop.
Some of the topics covered in the workshop are:
- An introduction to R and RStudio.
- R basics: The R language, reading data into R, storing data as objects.
- R packages.
- Publication-quality data presentation using ggplot2.
- Knitr: keep track of workflow and produce easy-to-follow reports of your work.
- Where to get more help when you are ready to do more.
We assume the learner has no prior experience with the tools covered in the workshop. However, learners are expected to have some familiarity with biological concepts.
View the full workshop material here:
[Introduction to the R Programming Language](https://genomicsaotearoa.github.io/Introduction-to-R/)
### RNA-seq Data Analysis
Get started with analysing RNA-seq datasets, identifying differentially expressed genes and highlighting impacted biological processes.
Some of the topics covered in the workshop are:
- Quality assessment
- Trimming and filtering
- Mapping and read counts
- Differential expression analysis
- Over-representation analysis
View the full workshop material here:
[RNA-seq Data Analysis](https://genomicsaotearoa.github.io/RNA-seq-workshop/)
### Genomics Data Carpentry (introduction to Shell)
Learn the fundamentals of working with the Command Line Interface (CLI). Shell is a program that allows you to interact with the command line. Familiarity with the shell will allow you to access remote servers, automate tasks, and use a wide range of tools that are unavailable on a Graphical User Interface (GUI)
During this workshop you will learn:
- The importance of the shell.
- How to navigate files and directories.
- How to create, view and modify files.
- Pipes, redirection, and scripts, which will allow you to automate your workflow.
View the full workshop material here:
[Introduction to the Command Line for Genomics](https://otagobioinformaticsspringschool.github.io/shell-genomics-nesi/index.html)
## Intermediate workshops
### Intermediate Shell for Bioinformatics
Shell overview, downloading and verifying data, inspecting and manipulating text data with Unix tools, automating file-processing.
This includes:
- An overview of the Shell, UNIX and Linux.
- Downloading data from a remote source and checking data integrity.
- Recap navigating files and directories, and commands used in routine tasks.
- Inspecting and manipulating data, part 1 (the head, less, grep, and sed commands).
- Inspecting and manipulating data, part 2 (using awk and bioawk to process text).
- Automating file processing.
- Challenges: solve example molecular biology problems using shell scripts.
View the full workshop material here:
[Intermediate Shell for Bioinformatics](https://genomicsaotearoa.github.io/shell-for-bioinformatics/)
### Intermediate R: advance your skills with the R programming language
Advance your skills with R! You will learn to complete R tasks with fewer lines of code, scale your analyses, and write readable code.
Some of the topics covered in the workshop are:
- Introduction to relational data and the join function.
- Working with regular expressions and functions from the stringr package.
- Writing custom functions, working with conditional statements.
- 'Defensive programming'.
- Iterations - for loops, and map_*() functions.
- The importance of data structure in R.
View the full workshop material here:
[Intermediate R: advancing your skills with the R programming language](https://genomicsaotearoa.github.io/Intermediate-R/)
### Outlier Analysis
Identify regions of the genome that are under forms of selection during a two-day workshop.
During this workshop you will:
- Download example genomic data (or prepare your own).
- Use the PCAdapt tool to identify outlier loci within a genome.
- Use VCFtools to identify outlier SNPS in population comparisons.
- Use Bayescan to identify outlier SNPS based on allele frequencies.
- Relate identified SNPS to phenotypic variation.
- Compare the results of the different methods and discuss the results.
The focus of this workshop is on identifying signals of selection in an example genome using the outlier analysis method. Outlier analysis assumes that the majority of the genome is under neutral selection and some loci will appear as outliers relative to this background.
This lesson assumes learner has no prior experience with the tools covered in the workshop. However, learners are expected to have some familiarity with biological concepts, including the concept of selection. You are expected to have some familiarity with both the R programming language and with basic command line (bash).
View the full workshop material here:
[Outlier Analysis](https://genomicsaotearoa.github.io/Outlier_Analysis_Workshop/)
## Advanced workshops
### Reproducible Bioinformatics with Nextflow and nf-core
Reproducible research is of the utmost importance. Nextflow is workflow management software that enables the writing of scalable and reproducible scientific workflows. It can integrate various software packages and environment management systems from environment modules to Docker, Singularity, and Conda. It allows for existing pipelines written in common scripting languages to be seamlessly coupled together. Nextflow simplifies the implementation and execution of workflows on cloud or high-performance computing infrastructures.
In this workshop you will:
- Be introduced to Nextflow and execute an example pipeline.
- Be introduced to nf-core, an online repository of curated pipelines.
- Learn how to configure and customise an existing nf-core pipeline.
- Generate metrics and reports.
View the full workshop material here:
[Reproducible Bioinformatics Workflows with Nextflow and nf-core](https://genomicsaotearoa.github.io/Nextflow_Workshop/)
### Constructing Pangenome Graphs with PGGB
How to construct a pangenome graph using a popular tool (PGGB), including QC, variant extraction and short-read mapping.
This workshop will include:
- An introduction to pangenome graphs.
- Setup guide for using the tools and data in the workshop.
- Overview of the PGGB toolkit.
- Choosing parameters to construct a graph.
- QC
- Extracting variant data.
- Mapping short reads against a pangenome graph.
View the full workshop material here:
[Unlock the Power of Pangenome Graphs](https://genomicsaotearoa.github.io/Pangenome-Graphs-Workshop/)
### Scaling Gene Regulatory Networks Simulations
Simulate gene regulatory networks using R and Julia.
This workshop will include:
- Why simulations are valuable in systems biology,
- What regulatory networks are and how they can be modelled,
- How to use the sismonr R package to simulate a small regulatory network, and
- Introduction to High performance computing (HPC)
- HPC architectures
- Batch systems and using a Slurm scheduler
- HPC architectures
- How to scale up simulations on a HPC through profiling and optimisation
View the full workshop material here:
[Scaling Gene Regulatory Networks Simulations](https://genomicsaotearoa.github.io/Gene_Regulatory_Networks_Simulation_Workshop/)