CD-HIT Suite: a web server for clustering and comparing biological sequences.

Journal:

Bioinformatics 2010 Mar

Authors:

Huang Y, Niu B, Gao Y, Fu L, Li W

Abstract

CD-HIT is a widely used program for clustering and comparing large biological sequence datasets. In order to further assist the CD-HIT users, we significantly improved this program with more functions and better accuracy, scalability and flexibility. Most importantly, we developed a new web server, CD-HIT Suite, for clustering a user-uploaded sequence dataset or comparing it to another dataset at different identity levels. Users can now interactively explore the clusters within web browsers. We
...[more]
also provide downloadable clusters for several public databases (NCBI NR, Swissprot and PDB) at different identity levels. AVAILABILITY: Free access at http://cd-hit.org[less]

Mesh Headings:

Cluster Analysis, Computational Biology, Databases, Genetic, Internet, Sequence Alignment, Sequence Analysis, Software, User-Computer Interface