XspecT - Acinetobacter Species Assignment Tool

XspecT 1.0.0 is a Python-based tool to taxonomically classify Acinetobacter sequence-reads (or assembled genomes) on the species and/or sub-type level using Bloom Filters and a Support Vector Machine. It also identifies existing blaOxa-genes and provides a list of relevant research papers for further information.

XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a reference database. Bloom Filter ensure a fast lookup in this process. For a final prediction the results are classified using a Support Vector Machine.

Local extensions of the reference database are supported.

The tool is available as a web-based application and a smaller command line interface.

Installation and Usage

Python Modules - Install Requirements

On Linux you need the python-dev package:

sudo apt install python3.10-dev

XspecT requires the latest 64 bit Python version and a list of Python Modules (see below).

pip install -r requirements.txt

List of used Modules for Python (3.10):

Flask
Flask-Bcrypt
Flask-Login
Flask-WTF
WTForms
Werkzeug
Bcrypt
Biopython
bitarray
mmh3
numpy
pandas
requests
scikit-learn
Psutil
Matplotlib
Pympler
H5py

For XspecT-Dev. you also need to install jellyfish. Using conda:

conda install -c bioconda jellyfish

Get the Bloomfilters

Copy the folder that is located in the following directory into your XspecT installation:

/share/project/dominik_s/XspecT/filter/acinetobacter

How to run the web-app: Local Deployment

Run the following command lines in a console, a browser window will open automatically after the application is fully loaded.

MAC/Linux:

$ export FLASK_APP=flaskr
$ export FLASK_ENV=development
$ python app.py

Windows cmd:

set FLASK_APP=flaskr
set FLASK_ENV=development
python app.py

How to use the XspecT command line interface:

Open the file XspecT_mini.py with the configuration you want to run it with as arguments.

python3 XspecT_mini.py XspecT ClAssT Oxa Fastq 100000 Metagenome "path/to/your/input-set"

Important:

If you use reads the number of reads needs to specified directly after the file-type
the path to your data-set is the last argument
all commands are explained in XspecT_mini Commands/.md
Output data for the metagenome mode in XspecT_mini is temporary very simple in this format: [prediction number-of-reads/contigs score-median repetitveness length-median bootstrap-median]
Adding new genera is only supported by XspecT_mini in the XspecT-Dev. branch

Input Data

XspecT is able to use either raw sequence-reads (FASTQ-format .fq/.fastq) or already assembled genomes (FASTA-format .fasta/.fna). Using sequence-reads saves up the assembly process but high-quality reads with a low error-rate are needed (e.g. Illumina-reads).

The amount of reads that will be used has to be set by the user when using sequence-reads. The minimum amount is 5000 reads for species classification and 500 reads for sub-type classification. The maximum number of reads is limited by the browser and is usually around ~8 million reads. Using more reads will lead to a increased runtime (xsec./1mio reads).

Known Bugs

Sometimes when searchinig for OXA genes, the score will be over 1.0. In this case it can be assumed that the full OXA gene has been found.

Using a Python version >3.10 can cause issues with Python packages or increase runtime.

Walkthrough

A detailed walkthrough with examples is provided in Xspect’s wiki.

Contributors

Sam Gimbel
Bardja Djahanschiri
Vinh Tran
Ingo Ebersberger

About this project

This project is an attempt to support hospital staff in a possible A. baumannii outbreak. A. baumannii can build up antibiotic resistance and can cause deadly nosocomial infections. This is a bachelor thesis project; no warranty is given. Check the license for more information. constructive criticism/feedback always welcomed!

Name		Name	Last commit message	Last commit date
Latest commit History 179 Commits
Instructions/pictures		Instructions/pictures
Results		Results
Temp_for_addFilter		Temp_for_addFilter
Training_data		Training_data
config		config
files		files
filter		filter
static		static
templates		templates
.gitattributes		.gitattributes
.gitignore		.gitignore
Add_Species.py		Add_Species.py
BF_v2.py		BF_v2.py
Bootstrap.py		Bootstrap.py
Classifier.py		Classifier.py
Filter_Editor.py		Filter_Editor.py
Helper.py		Helper.py
LICENSE		LICENSE
OXA_Table.py		OXA_Table.py
Option_field_WebApp.py		Option_field_WebApp.py
README.md		README.md
WebApp.py		WebApp.py
XspecT_mini Commands:.md		XspecT_mini Commands:.md
XspecT_mini.py		XspecT_mini.py
XspecT_mini_alt.py		XspecT_mini_alt.py
XspecT_mini_alt2.py		XspecT_mini_alt2.py
app.py		app.py
change_password.py		change_password.py
logger.log		logger.log
requirements.txt		requirements.txt
search_filter.py		search_filter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XspecT - Acinetobacter Species Assignment Tool

Table of Content

Installation and Usage

Python Modules - Install Requirements

List of used Modules for Python (3.10):

Get the Bloomfilters

How to run the web-app: Local Deployment

How to use the XspecT command line interface:

Input Data

Known Bugs

Walkthrough

Contributors

About this project

About

Releases

Packages

Contributors 2

Languages

License

BIONF/XspecT

Folders and files

Latest commit

History

Repository files navigation

XspecT - Acinetobacter Species Assignment Tool

Table of Content

Installation and Usage

Python Modules - Install Requirements

List of used Modules for Python (3.10):

Get the Bloomfilters

How to run the web-app: Local Deployment

How to use the XspecT command line interface:

Input Data

Known Bugs

Walkthrough

Contributors

About this project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages