How to compile: make # in the project root directory
How to run: bin/fgclust.sh # a help message will show up
- This repository contains the source code for FgClust, a fast and scalable algorithm for clustering biological sequences. FgClust can cluster all EST sequences in the NCBI EST database, all RNA sequences in RefSeq, and all protein sequences released by UniProt in less than one day each on a typical Linux server.
- A stable version (aka v1.0) will be released in the near future.
Zhao, Xiao Fei worked on this project at the end of 2007 and is grateful for the academic mentorship provided by Zhan, Shing Hei. This project was mainly done at Fusion Genomics (http://fusiongenomics.com/ although not entirely done at Fusion Genomics). The souce code in this repository is released under the GNU General Public License v3.0