| Literature DB >> 31411686 |
Zhongjie Tang1, ShaoQi Chen1, Ang Chen1, Bifang He1,2, Yuwei Zhou1, Guoshi Chai1, FengBiao Guo1, Jian Huang1.
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPR) and associated proteins (Cas) constitute CRISPR-Cas systems, which are antiphage immune systems present in numerous bacterial and most archaeal species. In recent years, CRISPR-Cas systems have been developed into reliable and powerful genome editing tools. Nevertheless, finding similar or better tools from bacteria or archaea remains crucial. This requires the exploration of different CRISPR systems, identification and characterization new Cas proteins. Archives tailored for Cas proteins are urgently needed and necessitate the prediction and grouping of Cas proteins into an information center with all available experimental evidence. Here, we constructed Cas Protein Data Bank (CasPDB), an integrated and annotated online database for Cas proteins from bacteria and archaea. The CasPDB database contains 287 reviewed Cas proteins, 257 745 putative Cas proteins and 3593 Cas operons from 32 023 bacteria species and 1802 archaea species. The database can be freely browsed and searched. The CasPDB web interface also represents all the 3593 putative Cas operons and its components. Among these operons, 328 are members of the type II CRISPR-Cas system.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31411686 PMCID: PMC6693189 DOI: 10.1093/database/baz093
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1The construction of CasPDB.
Figure 2Statistics of CasPDB. The number of bacterial and archaeal species and their putative Cas operons (A). Statistical distribution of each type of Cas proteins (B).
Figure 3Home and browse pages. Search option and home page (A). Browse page with all Cas proteins. The bottom section of the page shows protein distribution in bacteria (B).
Figure 4Detail and download pages. The detail page shows basic information of protein and operons (A). Download option for all Cas proteins and proteins of selected items (B).