| Literature DB >> 22723863 |
Vincent Foulongne1, Virginie Sauvage, Charles Hebert, Olivier Dereure, Justine Cheval, Meriadeg Ar Gouilh, Kevin Pariente, Michel Segondy, Ana Burguière, Jean-Claude Manuguerra, Valérie Caro, Marc Eloit.
Abstract
The human skin is a complex ecosystem that hosts a heterogeneous flora. Until recently, the diversity of the cutaneous microbiota was mainly investigated for bacteria through culture based assays subsequently confirmed by molecular techniques. There are now many evidences that viruses represent a significant part of the cutaneous flora as demonstrated by the asymptomatic carriage of beta and gamma-human papillomaviruses on the healthy skin. Furthermore, it has been recently suggested that some representatives of the Polyomavirus genus might share a similar feature. In the present study, the cutaneous virome of the surface of the normal-appearing skin from five healthy individuals and one patient with Merkel cell carcinoma was investigated through a high throughput metagenomic sequencing approach in an attempt to provide a thorough description of the cutaneous flora, with a particular focus on its viral component. The results emphasize the high diversity of the viral cutaneous flora with multiple polyomaviruses, papillomaviruses and circoviruses being detected on normal-appearing skin. Moreover, this approach resulted in the identification of new Papillomavirus and Circovirus genomes and confirmed a very low level of genetic diversity within human polyomavirus species. Although viruses are generally considered as pathogen agents, our findings support the existence of a complex viral flora present at the surface of healthy-appearing human skin in various individuals. The dynamics and anatomical variations of this skin virome and its variations according to pathological conditions remain to be further studied. The potential involvement of these viruses, alone or in combination, in skin proliferative disorders and oncogenesis is another crucial issue to be elucidated.Entities:
Mesh:
Year: 2012 PMID: 22723863 PMCID: PMC3378559 DOI: 10.1371/journal.pone.0038499
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Workflow of the sequences analysis, from raw data to assignment.
| Samples | 100066 | 100067 | 100069 | 100070 | 100072 | 100073 | ||||||||||
| Total reads, raw data | 8,052,770 | 10,354,496 | 9,107,144 | 8,196,240 | 7,588,712 | 10,281,130 | ||||||||||
| Total reads, Soap and BlastN filtered | 2,849,108 (100%) | 4,556,951 (100%) | 7,690,427 (100%) | 1,572,456 (100%) | 4,422,817 (100%) | 5,255,640 (100%) | ||||||||||
|
| ||||||||||||||||
| Contig number (reads number) | 9,242 (2,786,250) | 105,232 (4,228,142) | 25,563 (7,521,632) | 87,576 (1,432,804) | 11,765 (4,348,806) | 13,948 (5,196,480) | ||||||||||
| % total reads | 97.8% | 92.8% | 97.8% | 91.1% | 98.3% | 98.9% | ||||||||||
| Singletons (Unassembled reads) | 62,858 | 328,809 | 168,795 | 139,652 | 74,011 | 59,160 | ||||||||||
| % total reads | 2.2% | 7.2% | 2.2% | 8. 9% | 1.7% | 1.1% | ||||||||||
|
| ||||||||||||||||
| % Assigned reads, contigs + singletons(reads number) | 99.7% (2,841,768) | 90.6% (4,131,010) | 99.4% (7,646,231) | 94.1% (1,480,258) | 98.6% (4,359,580) | 99.5% (5,229,528) | ||||||||||
| Contig number (reads number) | 8,521 (2,783,976) | 70,765 (3,950,373) | 21,595 (7,488,486) | 67,204 (1,374,873) | 10,898 (4,317,823) | 13,107 (5,176,684) | ||||||||||
| Contig average lenght (nt) | 216 | 197 | 191 | 157 | 309 | 219 | ||||||||||
| % Assigned singletons (reads number) | 2.0% (57,792) | 3.9% (180,637) | 2.0% (157,745) | 6.7% (105,385) | 0.9% (41,757) | 1.0% (52,844) | ||||||||||
|
| ||||||||||||||||
| % Unassigned reads, contigs + singletons (reads number) | 0.3% (7,340) | 9.4% (425,941) | 0.6% (44,196) | 5.9% (92,198) | 1.4% (63,237) | 0.5% (26,112) | ||||||||||
| Contig number (reads number) | 721 (2,274) | 34,567 (277,769) | 3,968 (33,146) | 20,372 (57,931) | 867 (30,983) | 841 (19,796) | ||||||||||
| Contig average lenght (nt) | 138 | 143 | 125 | 126 | 162 | 144 | ||||||||||
| Contig maximum lenght (nt) | 975 | 1,396 | 1,480 | 1,665 | 2,079 | 3,139 | ||||||||||
| Singletons | 5,066 | 148,172 | 11,050 | 34,267 | 32,254 | 6,316 | ||||||||||
Relative proportion of the cutaneous microbiome in each sample.
| Samples | 100066 | 100067 | 100069 | 100070 | 100072 | 100073 |
| Total reads, Soap andBlastN filtered |
|
|
|
|
|
|
|
| ||||||
| Archea |
|
|
|
|
|
|
| Eubacteria |
|
|
|
|
|
|
| Plasmids |
|
|
|
|
|
|
| Phages |
|
|
|
|
|
|
| Eukaryotes* |
|
|
|
|
|
|
| Other** |
|
|
|
|
|
|
| Eukaryotic DNA virus |
|
|
|
|
|
|
| Unassigned reads |
|
|
|
|
|
|
-*Eukaryotes denotes Fungi, Viridiplantae, Protozoa and Metazoa sequences.
- **Other denotes metagenome, vectors, RNA virus, endogenous retrovirus and unclassified sequences.
Figure 1Cutaneous virome.
Relative abundance of reads from beta, gamma and unclassified papillomavirus, from circoviruses and other single strand DNA viruses and from the respective polyomaviruses (MCPyV, HPyV6, 7 and 9) within the whole eucaryotic DNA viral reads detected on the skin surface of each sample.
Figure 2MCPyV genome sequence coverage versus MCPyV viral load in the six samples.
Genome coverage is expressed as number of reads per nucleotide and viral load is in genome copies per ng of whole DNA. Coefficient of correlation is: R = 0.894; and best fit regression equation is: [MCPyV viral load] = 4.1 [MCPyV coverage]- 28,850.
Human papillomavirus sequences assembly and assignment in each sample.
| Sample | 100066 | 100067 | 100069 | 100070 | 100072 | 100073 | ||||||||||||||||
| HPVs family | β-HPV | γ-HPV | uc# | β-HPV | γ-HPV | uc | β-HPV | γ-HPV | uc | β-HPV | γ-HPV | uc | β-HPV | γ-HPV | uc | β-HPV | γ-HPV | uc | ||||
| N. total Contigs | 170 | 68 | 81 | 76 | 32 | 12 | 37 | 9 | 15 | 97 | 32 | 20 | 25 | 18 | 26 | 5 | 2 | 2 | ||||
| N. Contigs >500nt | 48 | 3 | 14 | 23 | 6 | 1 | 14 | 4 | 3 | 26 | 18 | 4 | 5 | 6 | 1 | 4 | 2 | 2 | ||||
| N. Contigs >500nt with partial L1 gene | 21 | 0 | 6 | 13 | 6 | 1 | 10 | 4 | 0 | 11 | 11 | 3 | 3 | 5 | 0 | 4 | 2 | 2 | ||||
| N. Contigs >500nt with whole L1 cds | 17 | 0 | 0 | 9 | 3 | 1 | 6 | 4 | 0 | 7 | 6 | 0 | 3 | 3 | 0 | 4 | 2 | 2 | ||||
|
| ||||||||||||||||||||||
| <70% ≥70% <90% >90% | 0 | 0 | 0 | 0 |
| 0 | 0 |
| 0 | 0 |
| 0 | 0 |
| 0 | 0 |
| 0 | ||||
| 4 | 0 | 0 | 2 | 0 | 1 | 3 | 2 | 0 | 1 | 4 | 0 | 0 |
| 0 | 1 | 1 | 2 | |||||
| 13 | 0 | 0 | 7 | 0 | 0 | 3 | 0 | 0 | 6 | 0 | 0 | 3 | 0 | 0 | 3 | 0 | 0 | |||||
| HPV closest relative (% similarity) | HPV110 (99%) | HPV110 (100%) |
| HPV148 (82%) | HPV80 (83%) | HPV133 (72%) | HPV12 (99%) |
| HPV14D (99%) |
| HPV38b (99%) |
| HPV148 (79%) | |||||||||
| SIBX-3a (98%) | HPV113 (81%) |
| HPV110 (100%) |
| HPV38b (99%) |
| HPV5 (99%) |
| HPV80 (99%) | HPV48 (80%) | HPV148 (71%) | |||||||||||
| SIBX-3a (87%) | HPV150 (99%) |
| HPV12 (99%) |
| HPV75 (99%) | HPV4 (72%) | HPV93 (94%) |
| HPV93 (94%) | |||||||||||||
| HPV21 (99%) | HPV38b (99%) | HPV150 (81%) |
| HPV110 (100%) | HPV131 (75%) | HPV124 (78%) | ||||||||||||||||
| HPV36 (99%) | HPV76 (99%) | HPV20 (99%) | HPV80 (86%) | HPV109 (75%) | ||||||||||||||||||
| HPV5 (99%) | HPV9 (99%) | HPV120 (85%) | HPV8 (99%) | HPV48 (70%) | ||||||||||||||||||
| HPV20 (98%) | HPV96 (85%) | HPV115 (99%) | ||||||||||||||||||||
| HPV22 (99%) | HPV12 (99%) | |||||||||||||||||||||
| HPV37 (99%) | SIBX-3a (99%) | |||||||||||||||||||||
| HPV14D (99%) | ||||||||||||||||||||||
| HPV122 (76%) | ||||||||||||||||||||||
| HPV124 (99%) | ||||||||||||||||||||||
| HPV124 (81%) | ||||||||||||||||||||||
| FA75 (99%) | ||||||||||||||||||||||
| HPV24 (97%) | ||||||||||||||||||||||
| HPV98 (99%) | ||||||||||||||||||||||
| HPV118 (76%) | ||||||||||||||||||||||
# uc denotes unclassified HPVs.
Whole genome sequences confirmed through Sanger sequencing and submitted into Genbank.
New HPV sequences appear in bold and sequences labeled with * are those related to complete genome submitted into Genbank with the JF966371 to JF966379 respective accession numbers (correspondences are displayed in table 4).
Genomic organization of 9 new Human gamma-papillomavirus sequences.
| GenBank accession number | JF966371* | JF966372 | JF966374 | JF966373 | JF966375* | JF966376 | JF966377 | JF966378** | JF966379** |
| Genome lenght (nt) | 7300 | 7299 | 7251 | 7167 | 7286 | 7219 | 7265 | 7152 | 7095 |
| Most closely related HPV type (accession number) |
|
|
|
|
|
|
|
|
|
| Nucleotide (amino acid) sequence similarity (%) of L1 | 66,6 (77.4) | 62.5 (73.3) | 66.2 (80.3) | 67.3 (79.9) | 65.5 (77.8) | 69.7 (82.1) | 65.3 (74.8) | 79.2 (92.6) | 70.9 (83.0) |
| E6 | 103–519 (138) | 103–528 (141) | 103–627 (174) | 103–546 (147) | 103–546 (147) | 103–528 (141) | 103–525 (140) | 103–519 (138) | 103–519 (138) |
| E7 | 521–805 (94) | 525–815 (96) | 630–926 (98) | 521–814 (97) | 521–808 (95) | 533–823 (96) | 522–815 (97) | 519–800 (93) | 519–80 0 (93) |
| E1 | 789–2615 (608) | 799–2640 (613) | 913–2724 (603) | 798–2603 (601) | 792–2615 (607) | 810–2642 (610) | 802–2622 (606) | 784–2592 (602) | 784–2580 (598) |
| E2 | 2557–3750 (397) | 2573–3754 (393) | 2660–3829 (389) | 2539–3750 (403) | 2557–3744 (395) | 2578–3744 (388) | 2558–3766 (402) | 2525–3724 (399) | 2513–3685 (390) |
| E4 | 3023–3508 (161) | – | 3033–3590 (185) | 3011–3508 (165) | 3023–3505 (160) | 3041–3508 (155) | 3090–3533 (147) | 3108–3482 (124) | 3096–3443 (115) |
| L2 | 3752–5350 (532) | 3761–5305 (514) | 3831–5333 (500) | 3750–5279 (509) | 3746–5341 (531) | 3747–5273 (508) | 3771–5300 (509) | 3724–5232 (502) | 3687–5180 (497) |
| L1 | 5307–6857 (516)$ | 5316–6872 (518) | 5342–6925 (527) | 5269–6816 (515)$ | 5313–6848 (511) | 5282–6841 (519) | 5248–6858 (536)$ | 5244–6776 (510) | 5191–6717 (508) |
For HPV's genes E6 to L1, open reading frame positions and numbering are indicated with the size of the respective predicted protein in brackets.
- $ deduced from the second start codon likely predicted for in vivo use.
- * JF966371 and JF966375 are two strains from the same new HPV species.
- **JF966378 and JF966379 are two strains from the recently described HPV148 species.
Figure 3Phylogenic tree of L1 nucleotide sequences.
Bayesian phylogeny (using the Tamura-Nei model TN93 with gamma distribution and invariant sites) of the main alpha, beta, gamma, mu and nu papillomaviruses infecting human inferred from their L1 nucleotide sequences (the list of the taxa included all sequences available from the PAVE database and are available at: http://pave.niaid.nih.gov/#prototypes?type=human). The new sequences (red branches) and species reported here belongs to the Gammapapillomavirus genus only, therefore, Alphapapillomavirus and Betapapillomavirus subtrees have been collapsed for clarity reasons. The tree is rooted by a bird Etapapillomavirus: the Fringilla coelebs Papillomavirus (FCPV). The added animal papillomaviruses to the phylogeny are the Roussetus aegyptiacus papillomavirus type 1 (RAP), the Bovine papillomavirus 8 (BPV8) and the Ovine papillomavirus (OPV) type 3. Posterior probabilities are reported for each node and sequences obtained during this study are depicted by grey/red branches. Each Human papillomavirus is noted HPV followed by the number of the species and by its Genbank accession number.