| Literature DB >> 26923687 |
Jody E Phelan1, Francesc Coll2, Indra Bergval3, Richard M Anthony4, Rob Warren5, Samantha L Sampson6, Nicolaas C Gey van Pittius7, Judith R Glynn8, Amelia C Crampin9,10, Adriana Alves11, Theolis Barbosa Bessa12, Susana Campino13, Keertan Dheda14,15, Louis Grandjean16,17, Rumina Hasan18, Zahra Hasan19, Anabela Miranda20, David Moore21, Stefan Panaiotov22, Joao Perdigao23, Isabel Portugal24, Patricia Sheen25, Erivelton de Oliveira Sousa26, Elizabeth M Streicher27, Paul D van Helden28, Miguel Viveiros29, Martin L Hibberd30, Arnab Pain31, Ruth McNerney32, Taane G Clark33,34.
Abstract
BACKGROUND: Approximately 10% of the Mycobacterium tuberculosis genome is made up of two families of genes that are poorly characterized due to their high GC content and highly repetitive nature. The PE and PPE families are typified by their highly conserved N-terminal domains that incorporate proline-glutamate (PE) and proline-proline-glutamate (PPE) signature motifs. They are hypothesised to be important virulence factors involved with host-pathogen interactions, but their high genetic variability and complexity of analysis means they are typically disregarded in genome studies.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26923687 PMCID: PMC4770551 DOI: 10.1186/s12864-016-2467-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Loci that are highly diverse, with recombination, or under selective pressure
| Gene | Locus | No. SNPs | Diversity |
|
|
| No. sitesb | Lineage specific |
|---|---|---|---|---|---|---|---|---|
|
|
| 3 |
| 1.000 | 1.000 | - | 0 | - |
|
|
| 10 |
|
|
| 10.13 | 3 | - |
|
|
| 3 | 0.0003 |
|
| - | 0 | - |
|
|
| 130 |
|
|
| 10.5 | 49 | 1,3,4 |
|
|
| 49 |
|
| 0.419 | 10.5 | 20 | - |
|
|
| 5 | 0.0005 |
| 0.210 | 9.697 | 6 | - |
|
|
| 2 |
| 1.000 | 1.000 | 9.264 | 4 | - |
|
|
| 9 | 0.0005 |
| 1.000 | 10.495 | 9 | - |
|
|
| 18 | 0.0022 |
| 0.015 | 10.492 | 5 | 4 |
|
|
| 6 | 0.0023 |
| 0.159 | 9.694 | 2 | 4 |
|
|
| 18 | 0.0010 |
|
| 10.433 | 5 | - |
|
|
| 80 | 0.0008 |
|
| 10.5 | 79 | 4 |
|
|
| 8 | 0.0015 |
|
| 10.036 | 5 | 4 |
|
|
| 60 |
|
|
| 5.336 | 33 | 1,2,4 |
|
|
| 31 |
| 0.431 | 1.000 | 10.5 | 21 | - |
|
|
| 19 |
|
| 0.084 | 10.419 | 29 | 4 |
|
|
| 6 | 0.0010 |
| 0.373 | 7.757 | 3 | - |
|
|
| 105 |
|
|
| 7.502 | 54 | 4 |
|
|
| 3 | 0.0022 |
| 1.000 | 10.391 | 2 | - |
π nucleotide diversity, phi recombination, NS not significant
aafter removing sites under selection
bnumber of sites under selection using the Bayes Empirical Bayes method
Bolded refers to π > 0.003 or phi p-value < 0.05
Fig. 1Nucleotide diversity (π) across the genome by lineage; genes with high diversity (π > 0.003) are highlighted. The pe_pgrs3 gene appears to have high nucleotide diversity in all lineages. Some lineage-specific hotspots are seen in lineage 1 (ppe59 and Rv3901c), lineage 3 (Rv2081c) and lineage 4 (ppe57 and ppe60)
Fig. 2Phylogenetic tree constructed with SNPs. a pe/ppe genes (5,404 SNPs, 10 % of the genome); the pe_pgrs3 gene was identified as having SNPs leading to the lineages not perfectly clustering, potential evidence of recombination affecting these loci. b pe/ppe genes excluding pe_pgrs3 (5,572 SNPs, ~10 % of the genome). Clear clustering according to lineage can be seen (Lineage 1 (Indo-Oceanic, green), lineage 2 (East-Asian (Beijing), blue), lineage 3 (East-African-Indian, purple), lineage 4 (Euro-American, red)). Reference genomes are labelled. M. canetti is annotated in cyan
Fig. 3Evidence of recombination at a gene level. A Manhattan plot showing genes that are likely to be recombination hotspots. The (−log10) p-value for recombination is plotted against genome position. All genes with p-values <0.05 are labelled. Genes labelled in colour grey (eccA3, pe_pgrs4, pe_pgrs17, ppe59, Rv3466 and esxV) become statistically non-significant after removing sites under selection
Fig. 4Evidence of positive selection between the pe/ppe and other genes by functional annotation. Distributions of (−log10) p-values for positive selection (evidence of ω >1) across the pe/ppe and other genes by functional annotation