| Literature DB >> 30995717 |
Anastasia Murzakova1, Dmitry Kireev2, Pavel Baryshev3, Alexey Lopatukhin4, Ekaterina Serova5, Andrey Shemshura6, Sergey Saukhat7, Dmitry Kolpakov8, Anna Matuzkova9, Alexander Suladze10, Marina Nosik11, Vladimir Eremin12, German Shipulin13, Vadim Pokrovsky14,15.
Abstract
Although HIV-1 subtype A has predominated in Russia since the end of the 20th century, other viral variants also circulate in this country. The dramatic outbreak of HIV-1 subtype G in 1988-1990 represents the origin of this variant spreading in Russia. However, full genome sequencing of the nosocomial viral variant and an analysis of the current circulating variants have not been conducted. We performed near full-length genome sequencing and phylogenetic and recombination analyses of 11 samples; the samples were determined to be subtype G based on an analysis of the pol region. Three samples were reliably obtained from patients infected during the nosocomial outbreak. The other 8 samples were obtained from patients who were diagnosed in 2010-2015. Phylogenetic analysis confirmed that a man from the Democratic Republic of the Congo was the origin of the outbreak. We also found that currently circulating viral variants that were genotyped as subtype G according to their pol region are in fact unique recombinant forms. These recombinant forms are similar to the BG-recombinants from Western Europe, particularly Spain and Portugal. The limitations of subtyping based on the pol region suggest that these viral variants are more widespread in Europe than is currently supposed.Entities:
Keywords: Elista; HIV-1; molecular epidemiology; near full-length genome sequencing; nosocomial outbreak; subtype G; unique recombinant forms
Year: 2019 PMID: 30995717 PMCID: PMC6521041 DOI: 10.3390/v11040348
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Characteristics of the Russian patients for whom near full-length genome sequences were obtained.
| Patient ID | Sample | Gender | Route of Infection | Date of First Positive Immune Blot | Date of Collection | Accession Number |
|---|---|---|---|---|---|---|
| 1 | RU.1989.MF614605 | male | nosocomial outbreak | 1989 | 1989 | MF614605 |
| 2a | RU.2012.MF614606 | male | nosocomial outbreak | 1990 | 2012 | MF614606 |
| 2b | RU.2014.MF614607 | male | nosocomial outbreak | 1990 | 2014 | MF614607 |
| 3 | RU.2014.MF614608 | male | unknown | 2014 | 2014 | MF614608 |
| 4 | RU.2014.MF614609 | male | unknown | 2014 | 2014 | MF614609 |
| 5 | RU.2015.MF614610 | male | sexual contact | unknown | 2015 | MF614610 |
| 6 | RU.2015.MF614611 | male | unknown | 2010 | 2015 | MF614611 |
| 7 | RU.2015.MF614612 | male | unknown | 2014 | 2015 | MF614612 |
| 8 | RU.2016.MF614613 | male | sexual contact | 2010 | 2016 | MF614613 |
| 9 | RU.2016.MF614614 | unknown | unknown | unknown | 2016 | MF614614 |
| 10 | RU.2015.MF614615 | male | unknown | 2015 | 2015 | MF614615 |
Figure 1ML tree (500 bootstrap replicates) of HIV-1 subtype G sequences with a length of more than 8000 bp. The green circles indicate the sequences from the patients of the Russian nosocomial outbreak. The red circle indicates the sequence from the Democratic Republic of the Congo. The blue circles indicate the sequences of the infected patients from Russia who were diagnosed after 2010. The name of each sequence combines the country code, sampling year and accession number separated by dots. Only bootstrap values of more than 70% are shown.
Figure 2Maximum likelihood (ML) tree (500 bootstrap replicates) of the pol region sequences of subtype G. The red circle indicates the sequence from the Democratic Republic of the Congo. The green circles indicate sequences from patients from the nosocomial outbreak. The green triangles indicate sequences from Russian patients diagnosed between 1988 and 2006. The blue circles indicate the near full-length genome sequences of the infected patients from Russia who were diagnosed after 2010. The blue triangles indicate sequences from Russian patients diagnosed between 2007 and 2015. The purple triangles indicate sequences from Russian patients with unknown diagnosis dates. The sequence from the Democratic Republic of the Congo and samples from patients living in the territory covered by the outbreak are highlighted in the green oval. Samples from patients with a first positive immune blot test predominantly after 2007 are highlighted in the yellow oval. The name of each sequence consists of the country code, sampling year and accession number (Los Alamos) or sample name (Russian database) separated by dots. The symbol XXXX marks the samples for which the date of sampling is unknown. Only bootstrap values of more than 60% are shown. *-38% bootstrap support, **-41% bootstrap support.
Figure 3Scheme of the location of recombination patterns according to the jpHMM (jumping profile Hidden Markov Model) results. The patterns of homology are shown as follows: the G subtype is highlighted in green, the B subtype is highlighted in blue, the C subtype is highlighted in brown, the A1 subtype is highlighted in red and the unknown subtype (uncertain regions) is highlighted in grey.
Figure 4The mosaic structure of the new BG-recombinant form. The regions are numbered according to the jpHMM program. Areas III and V are subtype B or uncertain regions. The patterns of homology are shown as follows: the G subtype is highlighted in green and the B subtype is highlighted in blue.
Figure 5Phylogenetic trees for the nucleotide sequences of recombination regions I (A), III (B), IV (C) and V (D) of 11 genomes of subtypes G and BG. The B subtype is highlighted in blue and the G subtype is highlighted in green. BG recombinants are indicated with blue stars, except for MF614610. Sample MF614610 is indicated by a blue rhomb and the samples of subtype G from the outbreak are indicated by green stars.
Figure 6Dated Bayesian phylogenetic tree of subtype G and new BG recombinants. The BG recombinants are highlighted in red, the G subtype from patients from the nosocomial outbreak is highlighted in blue, samples from Europe are highlighted in green and samples from Africa are highlighted in yellow and black. The branch from Elista outbreak samples is shown in red and the branch from the BG recombinants is shown in yellow. The horizontal scale axis at the bottom corresponds to the calendar year. Nodes of branches contain the average 95% highest posterior density (HPD).