`alpaca-qc`

A package for tidying plate reader data and adding drug annotation information from ALPACA (Australian Library of Paediatric Anti-Cancer Agents) drug libraries.

Introduction

Plate readers such as the Clariostar produce a .CSV file for each plate which contains well data and metadata. This data is not in tidy format, and needs to be parsed and tidied. Well locations also need to be annotated with drug information for downstream analysis.

alpacaqc is an R package designed to assist with the parsing of Clariostar results into a tidied long format, as well as the annotation of drug information to the results.

Features

Collects both experimental data and metadata from multiple Clariostar .CSV files, returning as a list of tidy_data and tidy_metadata tibbles in long-format.
Imports a drug_key.csv provided by the user as a tibble which corresponds drug information to well location. The drug key is validated in several ways:
- Checks that wells are unique, avoiding id collision
- Check that plate row and column values fall within the range of a 384 well plate [rows A - P, cols 1 - 24]
- Ensure that all fields are complete [plate, plate_row, plate_col, drug_name] Ensure that cas_no field is complete for all drugs (Except PBS/PURO)
Annotates tidy_data with drug information from drug_key based on well position.

Quick Start

Install devtools and alpacaqc:

install.packages("devtools")
devtools::install_github("Hudson-Institute-of-Medical-Research/alpaca-qc")

Gather and tidy data:

# Get experimental data csv using glob pattern
result_paths <- Sys.glob("data/sample/*.CSV")
# Get and validate drug key
drug_key_df <- import_drug_key("data/config/drug_well_key.csv")

# Tidy data
batch_data <- gather_plates(result_paths)

Annotate data with drug metadata and save:

# Annotate
annot_data <- add_drug_annot(batch_data$tidy_data, drug_key_df)
# Save
readr::write_csv(annot_data, "tidy_data.csv")
readr::write_csv(batch_data$tidy_metadata, "tidy_metadata.csv")

Usage

Drug Key Design

Design a drug_key .csv with the following field information:

[plate]: An integer corresponding to a particular drug layout on a plate. This distinguishes different drugs that are in the same well position, but in different plate layouts.
[plate_row]: A character letter [A-P] indicating a drug's row position on a 384 well plate.
[plate_col]: A integer [1-24] indicating a drug's column position on a 384 well plate.
[catalog_no] (optional): A character string indicating a drugs catalog number.
[cas_no]: A character string of the form XXXXXX-XX-X indicating a drug's CAS Registry Number. Mandatory as it is used downstream for database purposes.

plate	plate_row	plate_col	catalog_no	cas_no	drug_name
1	A	1	PURO	NA	PURO
1	A	2	PBS	NA	PBS
1	A	3	XXXXX	XXXXXX-XX-1	drug_a
1	A	4	XXXXX	XXXXXX-XX-2	drug_b
1	A	5	XXXXX	XXXXXX-XX-3	drug_c
...	...	...	...	...	...

ID Names

Read plates using the Clariostar. Export the data with the following default ID string format:
```
<sample_name> <media> <concentration>nM P<plate>R<rep>
```
This format is important as metadata is parsed from the ID string using the following default regex expression:
```
r"(ID1: (?<sample>\w+) (?<media>\w+) (?<conc>\d+)nM P(?<plate>\d)R(?<rep>\d+))"
```
Alternatively, a custom regex pattern can be supplied using the id_regex argument to gather_plates():
```
batch_data <- gather_plates(
    result_paths,
    id_regex = r"(ID1: (?<sample>\w+) (?<media>\w+) (?<conc>\d+)nM P(?<plate>\d)R(?<rep>\d+))"
)
```
Ensure that the custom regex pattern has capturing groups for [sample, media, conc, plate, rep]

Tidy Data

Load alpacaqc in R and tidy the data:

library(alpacaqc)

# Get experimental data csv using glob pattern
result_paths <- Sys.glob("data/sample/*.CSV")
# Get and validate drug key
drug_key_df <- import_drug_key("data/config/drug_well_key.csv")

# Tidy data
batch_data <- gather_plates(result_paths)

The batch_data variable is now a list with the contents batch_data$tidy_data and batch_data$tidy_metadata. The data is now in long-format:

batch_data$tidy_data

test_run_no	datetime	sample	rep	plate	plate_row	plate_col	conc_nm	fluor
19344	2023-04-06T03:24:54Z	sample_name	1	1	A	1	10	5181
...	...	...	...	...	...	...	...	...

batch_data$tidy_metadata

filename	test_run_no	datetime	sample	rep	plate	conc_nm	media	no_of_flashes_per_well	presetname	excitation	dichroic_filter	emission	gain	wells_used_for_gain_adjustment	focal_height_mm
filename.CSV	19344	2023-04-06T03:24:54Z	sample_name	1	1	10	DMSO	5	Alamar Blue	545-10	auto 565	590-20	1130	C3	3.9
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...

Annotate Data

Annotate batch_data$tidy_data with drug information from drug_key_df

annot_data <- add_drug_annot(batch_data$tidy_data, drug_key_df)

annot_data now has the following structure:

test_run_no	datetime	sample	rep	plate	plate_row	plate_col	catalog_no	cas_no	drug_name	conc_nm	fluor
19344	2023-04-06T03:24:54Z	sample_name	1	1	A	1	XXXXX	NA	drug_a	10	5181
...	...	...	...	...	...	...	...	...	...	...	...

Save data for downstream use.

readr::write_csv(annot_data, "tidy_data.csv")
readr::write_csv(batch_data$tidy_metadata, "tidy_metadata.csv")

Additional Information

Versioning

dev-0.0.2

License

This project is licensed under the terms of the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github		.github
R		R
inst		inst
man		man
renv		renv
tests/rules		tests/rules
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
renv.lock		renv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`alpaca-qc`

Introduction

Features

Quick Start

Usage

Drug Key Design

ID Names

Tidy Data

Annotate Data

Additional Information

Versioning

License

About

Releases

Packages

Languages

License

Hudson-Institute-of-Medical-Research/alpaca-qc

Folders and files

Latest commit

History

Repository files navigation

alpaca-qc

Introduction

Features

Quick Start

Usage

Drug Key Design

ID Names

Tidy Data

Annotate Data

Additional Information

Versioning

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`alpaca-qc`

Packages