| Literature DB >> 26818725 |
Tatiana C Mangericao1,2, Zhanhao Peng3, Xuegong Zhang4,5.
Abstract
BACKGROUND: CRISPR has been becoming a hot topic as a powerful technique for genome editing for human and other higher organisms. The original CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats coupled with CRISPR-associated proteins) is an important adaptive defence system for prokaryotes that provides resistance against invading elements such as viruses and plasmids. A CRISPR cassette contains short nucleotide sequences called spacers. These unique regions retain a history of the interactions between prokaryotes and their invaders in individual strains and ecosystems. One important ecosystem in the human body is the human gut, a rich habitat populated by a great diversity of microorganisms. Gut microbiomes are important for human physiology and health. Metagenome sequencing has been widely applied for studying the gut microbiomes. Most efforts in metagenome study has been focused on profiling taxa compositions and gene catalogues and identifying their associations with human health. Less attention has been paid to the analysis of the ecosystems of microbiomes themselves especially their CRISPR composition.Entities:
Mesh:
Year: 2016 PMID: 26818725 PMCID: PMC4895601 DOI: 10.1186/s12918-015-0248-x
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
PILER-CR algorithm parameters
| Parameter | Value | Description |
|---|---|---|
| Minrepeat | 16 | Minimum length of a repeat. |
| Maxrepeat | 64 | Maximum length of a repeat. |
| Minspacer | 8 | Minimum spacer length. |
| Minarray | 3 | Minimum number of repeats in an array/cassette. |
| Maxspacer | 64 | Maximum spacer length. |
| Minrepeat ratio | 0,9 | Repeat length ratio |
| Minspacer ratio | 0,75 | Spacer length ratio |
| Mincons | 0,9 | A run of minarray repeats with at least mincons similarity is required for an array to be reported. The value is specified as fractional identity 0.0… 1.0, with 1.0 meaning identical sequences. |
Summary of the results’ comparison between the set of CRISPR cassetes found for type-2 diabetic individuals and healthy individuals
| Metagenomic dataset group | Type-2 diabetic individuals | Healthy (non-diabetic) individuals | |
|---|---|---|---|
| Cassettes | |||
| Identified by PILER-CR | 1624 | 1545 | |
| Average Length (in bp) | 757 | 777 | |
| Spacers | |||
| Total number | 16 949 | 17 547 | |
| Repeats | |||
| Average size (in bp) | 31 | 32 |
Summary of the results’ comparison between the CRISPR analysis in this present work and Gogleva et al
| Study | In Gogleva et al., 2014 | The present work | |
|---|---|---|---|
| Cassettes | |||
| Identified by PILER-CR | 121 | 3169 | |
| Average length | 338 bp | 767 bp | |
| Spacers | |||
| Total number | 378 | 36 709 | |
| Repeats | |||
| Total number | 121 | 3169 | |
| Unique repeats | 74 | 1305 |
CRISPR map clustering of the collection of unique repeat sequences from Chinese individuals’ CRISPR cassettes
| Number of repeats | Sequence families associated | Motifs | |
|---|---|---|---|
| Superclass A | 138 | 2, 7 | 6 |
| Superclass B | 102 | 1,6 | 1,2,7 |
| Superclass C | 168 | 5 | 3,4 |
| Superclass D | 32 | None | 11 |
| Superclass E | 48 | 9,24 | 3,4,15,16 |
| Superclass F | 182 | 13,23 | 3,4,15,17,18 |