Globus

Documentation

Technology Description
- Globus, developed at the University of Chicago, is a comprehensive toolkit for building and managing large datasets. It provides essential tools for secure authentication, job management, data transfer, and resource discovery across distributed computing environments. Widely used in scientific research and beyond, Globus facilitates seamless collaboration and efficient utilization of computing and data resources on a global scale.
Learn from an expert
- Recorded presentation from Natasha Pavlovikj: https://youtu.be/VduJropUrOw
- Slides: https://drive.google.com/file/d/1y3-AGURTgF9mQ4kzbAlid1-lQBF5aogT
More Information
- The Globus official website: https://www.globus.org

Pros and Cons

Cost to setup
- Many features of Globus are freely available for academic and research purposes.
- Advanced features are available through a subscription based service.
Pros
- Fully distributed data architecture
- Strong emphasis on data sharing and data transfer
- Well suited for large datasets and large files
Cons
- Built for file sharing, not database access
- Not useful for data discovery, you must already know about a dataset from a different source to access it

Example use cases

Use Case
- Use Case

FAIR Principles

Findability - Metadata and data should be easy to find for both humans and computers.
- F1 - (Meta)data are assigned a globally unique and persistent identifier
- F2 - Data are described with rich metadata (defined by R1 below)
- F3 - Metadata clearly and explicitly include the identifier of the data they describe
- F4 - (Meta)data are registered or indexed in a searchable resource
Accessibility - Once the user finds the required data, it should be clear how the data can be fully accessed.
- A1 - (Meta)data are retrievable by their identifier using a standardized communications protocol
- A1.1 - The protocol is open, free, and universally implementable
- A1.2 - The protocol allows for an authentication and authorization procedure, where necessary
- A2 - Metadata are accessible, even when the data are no longer available
Interoperability - The data should easily interoperate with other data, as well as applications for analysis, storage, and processing.
- I1 - (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.
- I2 - (Meta)data use vocabularies that follow FAIR principles
- I3 - (Meta)data include qualified references to other (meta)data
Reusability - Metadata and data should be well-described so that they can be replicated and/or combined in different settings.
- R1 - (Meta)data are richly described with a plurality of accurate and relevant attributes
- R1.1 - (Meta)data are released with a clear and accessible data usage license
- R1.2 - (Meta)data are associated with detailed provenance
- R1.3 - (Meta)data meet domain-relevant community standards

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Globus

Table of Contents

Documentation

Pros and Cons

Example use cases

FAIR Principles

Data Federation Technology Overview

Faidare

iRODS

RDF (YARRRML/Shallot)

BrAPI

GraphQL

Globus

SOLID

Clone this wiki locally