| Literature DB >> 27577199 |
De-Kang Zhu1,2, Xue-Qin Yang1,2, Yang He1,2, Wang-Shu Zhou1,2, Xiao-Heng Song1,2, Jiang-Bo Wang1,2, Yu Zhang1,2, Ma-Feng Liu2,3, Ming-Shu Wang2,3, Ren-Yong Jia1,2, Shun Chen2,3, Kun-Feng Sun2,3, Qiao Yang2,3, Ying Wu2,3, Xiao-Yue Chen1,2, An-Chun Cheng4,5.
Abstract
BACKGROUND: Riemerella anatipestifer infection is a contagious disease that has resulted in major economic losses in the duck industry worldwide. This study attempted to characterize CRISPR-Cas systems in the disease-causing agent, Riemerella anatipestifer (R. anatipestifer). The CRISPR-Cas system provides adaptive immunity against foreign genetic elements in prokaryotes and CRISPR-cas loci extensively exist in the genomes of archaea and bacteria. However, the structure characteristics of R. anatipestifer CRISPR-Cas systems remains to be elucidated due to the limited availability of genomic data.Entities:
Keywords: CRISPR-Cas system; Riemerella anatipestifer; cas gene; phylogenetic analysis; repeat sequence; spacer sequence
Mesh:
Year: 2016 PMID: 27577199 PMCID: PMC5006608 DOI: 10.1186/s12864-016-3040-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Graphic representation of CRISPR-cas loci in R. anatipestifer strains. The black blocks represent CRISPR loci. AT-rich leader sequences (light grey blocks) are located upstream of CRISPR1. There are three cas genes (cas1, cas2 and cas9) which are closed to CRISPR1, shown as white boxed arrows. Upstream of cas9, there is a 16 nt-long anti-repeat sequence, which might be considered as a portion of the sequence that encodes for a putative tracrRNA (dark grey block). The distance between CRISPR1 and CRISPR2 is long and varies from strain to strain, thus the sequence between the two loci is omitted and is represented by a double-slash
Fig. 2Multiple sequence alignment of putative leader sequences and the first repeats. The putative leader sequence (107 bp) and the first repeat (47 bp) of each CRISPR1 locus in analyzed strains were selected for multiple sequence alignment. The nucleotide sequences of the first repeats are framed by black box. The grey shadow and arrow indicate the palindromic sequence in leaders. The asterisk represents the conserved base in all of the sequences for alignment
Fig. 3Features of repeats in R. anatipestifer CRISPR loci. a and d show sequence logo for consensus direct repeats associated with CRISPR1 in 20 R. anatipestifer strains and sequence logo for consensus direct repeats associated with CRISPR2 in five R. anatipestifer strains, respectively. The height of the letters indicates the relative frequency of the corresponding base at that position. b and c display predicted secondary structure of the CRISPR1 consensus direct repeats. The putative secondary structure of consensus direct repeats containing the T23C and G39A variants is shown in c, e, f and g display predicted secondary structure of the CRISPR2 consensus direct repeats. The putative secondary structures of consensus direct repeats containing the C27T variant and A20G variant are shown in (f) and (g), respectively
Fig. 4Phylogenetic analysis by BPGA based on pangenome for 25 R. anatipestifer strains
Fig. 5Spacer organization in CRISPR1. The CRISPR1 arrays from 24 R. anatipestifer strains are graphic represented. In order to analyze conveniently, the repeats have been eliminated and only the spacers are shown. The different subgroups are separated by the dotted lines. All arrays in the same subgroup are aligned manually. The abscissa indicates serial numbers of spacers, and ordinate displays names of analyzed R. anatipestifer strains. The direction of spacers is consistent with serial numbers (ie spacer 1 is located in 5’ terminal). Identical spacers are displayed by the same color, and are aligned such that they have the same number (apart from duplicate spacer). Additionally, unique spacer is displayed by a unique combination of color and number
Fig. 6Spacer organization in CRISPR2. The CRISPR2 arrays from 21 R. anatipestifer strains are graphic represented as in Fig. 5. The asterisk represents the spacer with large size
Identification of phage/plasmid protospacers
| Spacer Number | Description of protospacer | Position of protospacer | Similarity |
|---|---|---|---|
| 16 |
| 146275-146304 | 87 % |
| 116 |
| 42519–42548 | 100 % |
| 120 |
| 42842–42871/80620–80649 | 87 %/87 % |
| 134 |
| 8954–8983 | 100 % |
| 136 |
| 27500–27529 | 90 % |
| 138 |
| 15935–15964 | 100 % |
| 145 |
| 13831–13860 | 100 % |
| 148 |
| 213466–213495/224417–224446 | 87 %/87 % |
| 155 |
| 24539–24568 | 100 % |
| 180 |
| 28468–28497 | 97 % |
| 188 |
| 1300–1329 | 100 % |
| 189 |
| 2509–2538 | 97 % |
| 195 |
| 317614–317642 | 93 % |
| 238 |
| 32595–32623 | 97 % |
The superscript “1” represents CRISPR1, and “2” represents CRISPR2. The spacer number is corresponding to that in Figs. 5 and 6
Fig. 7Prediction of PAM consensus sequence. The numbers associated with protospacer sequence are framed by red box. The height of the letters indicates the relative frequency of the corresponding base at that position
Fig. 8Phylogenetic tree for Cas9 from 40 Gram-negative type II system-containing bacteria. Cas9 proteins from 39 Gram-negative type II system-containing bacteria were selected for analysis (see Table S3). A phylogenetic tree based on Cas9 proteins from RA-CH-2 and 39 Gram-negative type II system-containing bacteria was constructed by neighbor-joining method using MEGA software (version 6). The bootstrap values of each node are shown. Different subclusters are enclosed by distinct brackets, and is in accordance with subtypes of type II system. The R. anatipestifer strain, RA-CH-2, is in red