| Literature DB >> 23104377 |
Kyle R Conway1, Christopher N Boddy.
Abstract
ClusterMine360 (http://www.clustermine360.ca/) is a database of microbial polyketide and non-ribosomal peptide gene clusters. It takes advantage of crowd-sourcing by allowing members of the community to make contributions while automation is used to help achieve high data consistency and quality. The database currently has >200 gene clusters from >185 compound families. It also features a unique sequence repository containing >10 000 polyketide synthase/non-ribosomal peptide synthetase domains. The sequences are filterable and downloadable as individual or multiple sequence FASTA files. We are confident that this database will be a useful resource for members of the polyketide synthases/non-ribosomal peptide synthetases research community, enabling them to keep up with the growing number of sequenced gene clusters and rapidly mine these clusters for functional information.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23104377 PMCID: PMC3531105 DOI: 10.1093/nar/gks993
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Organization of ClusterMine360. The compound family and cluster represent the two major organization units of the database. Additional data fields connect to either the compound family or cluster. The organization of the fredericamycin gene cluster is shown in the cluster pane (16).
Figure 2.ClusterMine360 has automated many of the steps required for curating the database. Automated curation is essential to enable crowd-sourcing without sacrificing data quality.
Figure 3.A rooted phylogenetic tree of heterocyclization domains from NRPS gene clusters shows that heterocyclization domains tree is based on function. ClusterMine360 provides a rapid and powerful tool for generating and analysing phylogenetic trees of PKS and NRPS domains.