| Literature DB >> 32082274 |
Sébastien Lhomme1,2,3, Florence Nicot1, Nicolas Jeanne1, Chloé Dimeglio1, Alain Roulet4, Caroline Lefebvre1, Romain Carcenac1, Maxime Manno4, Martine Dubois1, Jean-Marie Peron5, Laurent Alric6, Nassim Kamar2,3,7, Florence Abravanel1,2,3, Jacques Izopet1,2,3.
Abstract
Recombinant strains of hepatitis E virus (HEV) with insertions of human genomic fragments or HEV sequence duplications in the sequence encoding the polyproline region (PPR) were previously described in chronically infected patients. Such genomic rearrangements confer a replicative advantage in vitro but little is known about their frequency, location, or origin. As the sequences of only a few virus genomes are available, we analyzed the complete genomes of 114 HEV genotype 3 strains from immunocompromised (n = 85) and immunocompetent (n = 29) patients using the single molecular real-time sequencing method to determine the frequency, location, and origin of inserted genomic fragments, plus the proportions of variants with genomic rearrangements in each virus quasispecies. We also examined the amino acid compositions and post-translational modifications conferred by these rearrangements by comparing them to sequences without human gene insertions or HEV gene duplications. We found genomic rearrangements in 7/114 (6.1%) complete genome sequences (4 HEV-3f, 1 HEV-3e, 1 HEV-3 h, and 1 HEV-3chi-new), all from immunocompromised patients, and 3/7 were found at the acute phase of infection. Six of the seven patients harbored virus-host recombinant variants, including one patient with two different recombinant variants. We also detected three recombinant variants with genome duplications of the PPR or PPR + X domains in a single patient. All the genomic rearrangements (seven human fragment insertions of varying origins and three HEV genome duplications) occurred in the PPR. The sequences with genomic rearrangements had specific characteristics: increased net load (p < 0.001) and more ubiquitination (p < 0.001), phosphorylation (p < 0.001), and acetylation (p < 0.001) sites. The human fragment insertions and HEV genome duplications had slightly different characteristics. We believe this is the first description of HEV strains with genomic rearrangements in patients at the acute phase of infection; perhaps these strains are directly transmitted. Clearly, genomic rearrangements produce a greater net load with duplications and insertions having different features. Further studies are needed to clarify the mechanisms by which such modifications influence HEV replication.Entities:
Keywords: genomic rearrangement; hepatitis E virus; polyproline region; virus-host recombinant variants; virus-virus recombinant variants
Year: 2020 PMID: 32082274 PMCID: PMC7004952 DOI: 10.3389/fmicb.2020.00001
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Characteristics of the patients infected by HEV-3 strain with genomic rearrangements.
| HEV strain | Pathology of the patient | HEV diagnosis | Chronic/acute HEV infection at the time of diagnosis | Plasma HEV RNA concentration (log copies/ml) | Time between HEV infection and detection of genomic rearrangement |
|---|---|---|---|---|---|
| 6 | Chronic lymphoid leukemia | 2014 | Acute | 7.5 | <2 months |
| 26 | Renal transplant | 2013 | Acute | 5.8 | <2 months |
| 94 | Hepatic transplant | 2013 | Acute | 6.6 | <2 months |
| 93 | Hepatic transplant | 2013 | Chronic | 7.5 | 3 months |
| 154 | Chronic lymphoid leukemia | 2014 | Chronic | 5.3 | 5 months |
| 64 | Cardiac transplant | 2009 | Chronic | 6.2 | 9 months |
| 12 | Light chain myeloma | 2010 | Chronic | 7.7 | 10 months |
Characteristics of the naturally occurring genomic rearrangements in the PPRs of seven HEV genotype 3 strains.
| HEV strain | HEV subtype | Nature of insertion | Number of variants | Name of the recombinant variant | Position in PPR (nt) | Nature of the inserted fragment | Percent of quasi species |
|---|---|---|---|---|---|---|---|
| 6 | 3h | Human insert | 1 | Hepac-6 | 84–239 | Ring finger protein 19A (RNF 19A) | 100% |
| 26 | 3chi-new | Human insert | 2 | Hepac-26-2 | 81–230 | Human ribosomal protein L6 (RPL6) | 21% |
| 94 | 3f | Human insert | 2 | Hepac-94-2 | 57–164 | Ribosomal protein 17S (RPS17) | 63% |
| 93 | 3f | Human insert | 3 | Hepac-93-2 | 166–222 | Eukaryotic translation elongation factor 1 alpha 1 pseudogene 13 (EEF1A1P13) | 12% |
| Hepac-93-3 | 164–242 | 18S ribosomal pseudogene 5 (RNA18SP5) | 31% | ||||
| 154 | 3f | Human insert | 1 | Hepac-154 | 170–247 | Kinesin family member 1B (KIF1B) | 100% |
| 64 | 3f | Human insert | 1 | Hepac-64 | 181–318 | Zinc finger protein 787 (ZNF787) | 100% |
| 12 | 3e | HEV duplication | 3 | Hepac-12-1 | 93–239 | PPR + X-domain | 67% |
| Hepac-12-2 | 85–237 | PPR | 22% | ||||
| Hepac-12-3 | 84–239 | PPR | 11% |
Strains with one wild type variant.
Figure 1Human fragment insertions and duplications in the PPR of seven HEV GC sequences obtained by SMRT and Sanger sequencing. (A) Human fragment inserts. Variants 93-2 and 93-3 were characterized in the same patient. (B) HEV genome duplication and reference sequences. PPR duplications are boxed. Hyphen: gap.
Figure 2Principal Component Analysis of HEV-3 PPR sequences with insertions or duplications. Individual observations, each dot represents a sample. Each clade (3 abjk, 3 chi, 3 efg short or long, and 3ra) is represented by a symbol. The axes show the first two principal components [dimension 1 (dim 1) and dim 2], with the fraction of explained variance in parenthesis. Variables of the two components are detailed in Figure 3.
Figure 3Principal Component Analysis variable circles of correlation. Variables characterizing insertions/duplications are shown in red (positive charge, net load, ubiquitination acetylation, phospohorylation sites). Dim1 is mainly composed of net load (16.4%), ubiquitination (13.3%), acetylation (13.2%), and phosphorylation (11.4%). Dim2 is mainly composed of positive charge (12.2%).
Figure 4Amino acid compositions of PPR sequences with and without insertions/duplications. White bars represent sequences with insertion/duplication (n = 20) and black bars sequences without insertion/duplication (n = 294). Statistical differences between groups are indicated by stars. A p < 0.05 was considered significant.
Impact of insertions on the amino acid composition, physico-chemical properties, and potential new regulation sites.
| Variable | Sequences with human fragment insertions ( | Sequences with HEV genome duplication ( | Sequences without insertions/duplications ( | ||
|---|---|---|---|---|---|
| GC content (%) | 48.5 (44.8; 51.9) | 46 (45.3; 47.4) | 46 (44; 49) | NS | NS |
| Small AA (%) | 5.1 (2; 10.4) | 5.5 (1.6; 12.8) | 5.3 (1.9; 11.4) | NS | NS |
| Tiny AA (%) | 8.3 (2; 10.7) | 6.2 (1.7; 13.3) | 6.7 (1.3; 12) | NS | NS |
| Positive charged AA (%) | 4.2 (1.8; 7.1) | 4.2 (0.82; 5.3) | 2.7 (1.3; 4.5) | <0.01 | NS |
| Negative charged AA (%) | 3.2 (2; 4.5) | 2.3 (1.6; 2.8) | 2.9 (2.6; 4) | NS | <0.01 |
| Charged AA (%) | 3.8 (2; 5.7) | 2.9 (1.5; 4.7) | 2.7 (1.6; 4) | <0.01 | NS |
| Aliphatic AA (%) | 4.7 (3.6; 5.6) | 4.2 (2.3; 5.5) | 4 (2.7; 6.7) | NS | NS |
| Aromatic AA (%) | 1 (0.75; 1.62) | 0.8 (0.45; 1.6) | 1.3 (0; 1.3) | NS | NS |
| Hydrophobic AA (%) | 2.4 (1.8; 5.5) | 2.4 (0.8; 4.7) | 1.9 (1; 4.8) | 0.04 | NS |
| Polar AA (%) | 2.4 (0.8; 6.5) | 1.8 (0.8; 4.8) | 2.5 (1; 4.8) | 0.02 | NS |
| Net load | 5 (3; 6) | 8 (5; 9) | 0 (−1; 1) | <0.001 | <0.001 |
| Ubiquination sites | 5 (3; 6) | 6 (5.5; 6) | 2 (1; 3) | <0.001 | <0.001 |
| Acetylation sites | 5 (3; 6) | 6 (5.5; 6) | 2 (1; 3) | <0.001 | <0.001 |
| Phosphorylation sites | 19 (16; 22) | 19 (17; 19.5) | 10 (8; 14) | <0.001 | <0.01 |
| Methylation sites | 1 (0; 3) | 0 (0; 0.5) | 0 (0; 0) | <0.001 | NS |
| Nuclear export signal sites (presence) | 11 (84.6%) | 7 (100%) | 286 (97.3%) | <0.01 | NS |
| Nuclear localization signal sites (presence) | 3 (23.1%) | 0 (0%) | 1 (0.3%) | <0.001 | NS |
| N-Glycosylation (presence) | 1 (7.7%) | 0 (0%) | 13 (4.4%) | NS | NS |
| C-Glycosylation (presence) | 0 (0%) | 0 (0%) | 3 (1%) | NS | NS |
Data are numbers unless otherwise indicated. Variables are expressed as medians and interquartile ranges for Wilcoxon test, number (%) for chi2 or Fisher’s exact tests. AA: amino acids. NS: not significant. A .
Wilcoxon test.
Fisher’s exact test.
chi2 test.