Skip to content

Latest commit

 

History

History
16 lines (10 loc) · 906 Bytes

README.md

File metadata and controls

16 lines (10 loc) · 906 Bytes

README

How to compile: make # in the project root directory

How to run: bin/fgclust.sh # a help message will show up

What is this repository for?

  • This repository contains the source code for FgClust, a fast and scalable algorithm for clustering biological sequences. FgClust can cluster all EST sequences in the NCBI EST database, all RNA sequences in RefSeq, and all protein sequences released by UniProt in less than one day each on a typical Linux server.
  • A stable version (aka v1.0) will be released in the near future.

History

Zhao, Xiao Fei worked on this project at the end of 2007 and is grateful for the academic mentorship provided by Zhan, Shing Hei. This project was mainly done at Fusion Genomics (http://fusiongenomics.com/ although not entirely done at Fusion Genomics). The souce code in this repository is released under the GNU General Public License v3.0