Getting Started Guide

Getting Started with `SpaDES`

Attach the SpaDES package in your R session using:

library(SpaDES)

Set your default working directories

Simulations make use of several working directories:

an inputs directory, inputPath, where SpaDES looks to find simulation inputs;
an outputs directory, outputPath, where simulation outputs are saved;
a cache directory, cachePath, where simulation outputs are cached;
a modules directory, modulePath, where modules and their data are downloaded and saved.

Unless otherwise specified during simInit (by passing a paths argument), the default working directories are set via options. Unless these options are changed by the user, the temporary locations are used.

To configure the location of these working directories:

## use 'setPaths' to quickly set all paths to a default location
setPaths() ## set all paths to defaults

## alternatively, custom paths can be set as arguments to 'setPaths'
setPaths(inputPath = 'path/to/my/inputs') ## set custom inputPath; all others set to defaults
setPaths(inputPath = 'path/to/my/inputs', outputPath = 'path/to/my/outputs',
         cachePath = 'path/to/my/cache', modulePath = 'path/to/my/modules') ## set all paths custom

## or by chaging the global options directly
options(spades.inputPath = 'path/to/my/inputs')
options(spades.outputPath = 'path/to/my/outputs')
options(spades.cachePath = 'path/to/my/cache')
options(spades.modulePath = 'path/to/my/modules')

Remember that once custom paths are set by the user, calling setPaths will reset the the directories to default. So if custom set paths have to be called, use

myinputPaths <- getPaths()$inputPath

Using pre-existing modules

Browse locally available modules:

openModules(path = "/path/to/my/modules") # opens all modules in a directory
openModules("moduleName", "/path/to/my/modules") # opens only the named module

Browse modules at https://github.com/PredictiveEcology/SpaDES-modules
Download modules for use:
```
downloadModule("moduleName", path = "/path/to/my/modules", data = TRUE)
openModules("moduleName", "/path/to/my/modules")
```
If no path is specified, modules and data will be downloaded and saved in the location returned by getOption('spades.modulePath'). See above to change this default location.

Try the LCC2005 module tutorial to see SpaDES at work.

Creating new modules

Create an empty module template:

newModule("moduleName", path = "/path/to/my/modules")

Read the modules vignette for more details.

Module development checklist

Metadata

are module metadata fully and correctly specified (module description, authorship and citation info, parameters and inputs/outputs, etc.)?
citation should specify how to cite the module, or if published, the paper that describes the module.
module object dependencies: use moduleDiagram and objectDiagram to confirm how data objects are passed among modules.

Events

are all event types defined in doEvent?
use function(sim) to access event functions from within a module: functions calls are correctly namespaced (i.e., it looks first inside the functions built in the module)
use sim$object to access and make "global" data objects, shared among events and modules
use mod$object to access and make module-specific functions, not intended to be shared with other modules
use e.g., sim[[globals(sim)$objectName]] to access variable-named objects

Documentation

have you provided useful (meaningful) documentation in the module's .Rmd file and README?
have you built (knitted) the .Rmd file to generate a .pdf or .html version?
have you specified the terms under which your module code can be reused and/or modified? Add a license!

Data

we suggest that data you wish to include with your module are saved in data/; this makes modules more easily shareable with other people. Access those data with dataPath(sim)
verify that external data sources are included in the sourceURL metadata field
verify that any additional data preparation/transformation steps used in .inputObjects are correct; SpaDES.tools::prepInputs( ) may be very useful
write CHECKSUMS.txt file for all data using checksums(..., write = TRUE)

Distributing your module

where will your module code/data be hosted? Currently Google Drive and Dropbox appear to be easy places which can be private or public, and can now be easily accessed with googledrive and rdrop2 packages
test downloadModule and downloadData from a temp dir to ensure your module can be downloaded correctly by others

Strategies for module development

Speed considerations

Since modules will often have to run many, many times because of replication, there are a few strategies that should be followed:

Always write fast code. This likely means using data.table (usually fastest) or dplyr (not quite as fast) for data and data wrangling.
- Avoid data.frame if possible.
- Matrices and vectors are generalyl fastest, if they provide the necessary features.
- Avoid loops.
Use memoise or reproducible package to Cache functions for speed.
For computationally intensive functions, consider writing them in C++, via the Rcpp package.
For large (out of RAM) situations, use ff or bigMemory. Sometimes, these can be done seamlessly inside functions using the getOption("spades.lowMemory"), where two alternatives a provided, one "in Memory" the other "on disk". See "if (lowMemory)" code block about 20 lines from start of spread function for one way to do this with ff.

Other best practices

Don't write modules that depend internally on other modules. Instead, pass data via the inputObjects and outputObjects in the metadata. This means avoid scheduling one event in module A from module B, if possible.
Use and push publicly sharable modules from and to the SpaDES-Modules repository (https://github.com/PredictiveEcology/SpaDES-modules) using downloadModule() or via pull request.

Types of modules

The concept of a "module" can be very broadly defined, i.e., what a particular module does can vary widely. The only components that must exist are the metadata and the init event. This means that many, many types of modules can be written. As we slowly build a SpaDES ecosystem of modules designed to be used and re-used, we can consider writing our entire work flow -- raw data, data wrangling, data analysis, calibration of simulation model, simulation, output analysis, decision support -- all in one chain. We can cache everything along the way, so that if something must run again, but its inputs are identical to a previous run, then it can just read from disk.

This is an evolving list of types of modules that would be useful to have in this "re-use" cycle:

dynamic forecasting
- "classical" simulation models
- NetLogo-type models
- SELES-type models
- time is a component of the model
static forecasting
- e.g., predict methods from statistical outputs
agent based models
- animals, plants
- processes, such as fire
raster models
- e.g., forest succession, cellular automata
statistical
- Bayesian
calibration and optimization
- taking outputs from other modules and rescheduling those other modules again, iterating through a heuristic optimization
translators
- from one data type to another to allow two different modules to talk
GIS
- reprojection, crop, mask etc.
data fetching
- modules that go to specific web resources (e.g., Dryad etc.)
data manipulation
- simplifying, joining etc.
- interpolators
output analysis
- e.g., takes time series of rasters and visualizes them
quality scanning - e.g., from external databases

Current modules on the SpaDES-Modules repository (see above) include simple versions of dynamic forecasting (Forest Succession, fireSpreadLcc, forestAge), GIS (cropReprojectLccAge), translators (LccToBeaconsReclassify),

Project links

Working with `SpaDES`

Contributing

Completed projects

Wolves recolonizing the Italian Alps - This is a rewrite in SpaDES of Marucco & McIntire 2010.

Demonstration apps

LandWeb Demonstration App

SpaDES Workshops

Upcoming workshops

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting Started Guide

Getting Started with `SpaDES`

Set your default working directories

Using pre-existing modules

Creating new modules

Module development checklist

Metadata

Events

Documentation

Data

Distributing your module

Strategies for module development

Speed considerations

Other best practices

Types of modules

Project links

Working with `SpaDES`

Contributing

Completed projects

Demonstration apps

SpaDES Workshops

Clone this wiki locally

Getting Started Guide

Getting Started with SpaDES

Set your default working directories

Using pre-existing modules

Creating new modules

Module development checklist

Metadata

Events

Documentation

Data

Distributing your module

Strategies for module development

Speed considerations

Other best practices

Types of modules

Project links

Working with SpaDES

Contributing

Completed projects

Demonstration apps

SpaDES Workshops

Clone this wiki locally

Getting Started with `SpaDES`

Working with `SpaDES`