| Literature DB >> 34075073 |
Elena Delgado1, Aurora Fernández-García1,2, Marcos Pérez-Losada3,4, María Moreno-Lorenzo1, Ismael Fernández-Miranda1,5, Sonia Benito1, Vanessa Montero1, Horacio Gil1, Silvia Hernáez6, Josefa Muñoz6, Miren Z Zubero-Sulibarria6, Elena García-Bodas1, Mónica Sánchez1, Jorge Del Romero7, Carmen Rodríguez7, Luis Elorduy8, Elena Bereciartua8, Esther Culebras9, Icíar Rodríguez-Avial9, María Luisa Giménez-Alarcón10, Carmen Martín-Salas11, Carmen Gómez-González12, José J García-Irure13, Gema Cenzual14, Ana Martínez-Sapiña15, María Maiques-Camarero16, Lucía Pérez-Álvarez1, Michael M Thomson17.
Abstract
Circulating recombinant forms (CRFs) contribute substantially to the HIV-1 pandemic. Among 105 CRFs described in the literature, 16 are BF intersubtype recombinants, most of South American origin, of which CRF12_BF is the most widely spread. A BF recombinant cluster identified in Bolivia was suggested to represent a new CRF_BF. Here we find that it belongs to a larger cluster incorporating 39 viruses collected in 7 countries from 3 continents, 22 of them in Spain, most from Bolivian or Peruvian individuals, and 12 in South America (Bolivia, Argentina, and Peru). This BF cluster comprises three major subclusters, two associated with Bolivian and one with Peruvian individuals. Near full-length genome sequence analyses of nine viruses, collected in Spain, Bolivia, and Peru, revealed coincident BF mosaic structures, with 13 breakpoints, 6 and 7 of which coincided with CRF12_BF and CRF17_BF, respectively. In a phylogenetic tree, they grouped in a clade closely related to these CRFs, and more distantly to CRF38_BF and CRF44_BF, all circulating in South America. These results allowed to identify a new HIV-1 CRF, designated CRF89_BF. Through phylodynamic analyses, CRF89_BF emergence was estimated in Bolivia around 1986. CRF89_BF is the fifth CRF member of the HIV-1 recombinant family related to CRF12_BF.Entities:
Year: 2021 PMID: 34075073 PMCID: PMC8169922 DOI: 10.1038/s41598-021-90023-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Epidemiological data of patients residing in Spain studied by us and GenBank accessions of sequences.
| Sample ID | City of sample collection | Region of sample collection | Year of sample collection | Year of HIV diagnosis | Gender | Transmission route | Country of origin | GenBank accession (Pr-RT) | GenBank accession (NFLG) |
|---|---|---|---|---|---|---|---|---|---|
| CU0019 | Cuenca | Castilla-La Mancha | 2016 | 2015 | F | Heterosexual | Spain | MW344905 | |
| CU0020 | Cuenca | Castilla-La Mancha | 2016 | 2009 | M | Heterosexual | Spain | MW344906 | |
| M0849 | Madrid | Madrid | 2016 | 2016 | M | Heterosexual | Bolivia | MW344907 | |
| M1063 | Madrid | Madrid | 2017 | 2017 | F | Heterosexual | Peru | MW344908 | |
| M1079 | Madrid | Madrid | 2017 | 2017 | M | Heterosexual | Peru | MW344909 | MW802822 |
| M1131 | Madrid | Madrid | 2017 | 2017 | M | MSM | Peru | MW344910 | |
| MS0254 | Madrid | Madrid | 2018 | 2018 | M | MSM | Peru | MW344911 | |
| MS0360 | Madrid | Madrid | 2019 | 2019 | M | MSM | Peru | MW344912 | MW802823 |
| NA0239 | Tudela | Navarra | 2016 | 2016 | M | Heterosexual | Peru | MW344913 | |
| NA0379 | Pamplona | Navarra | 2018 | 2018 | F | Heterosexual | Bolivia | MW344914 | |
| P2345 | Bilbao | Basque Country | 2009 | 2009 | F | Heterosexual | Bolivia | MW344915 | |
| P2346 | Bilbao | Basque Country | 2009 | 2009 | M | Sexual | Spain | MW344916 | |
| P2633 | Bilbao | Basque Country | 2010 | 2010 | F | Heterosexual | Bolivia | MW344917 | KX818199 |
| P3174 | Bilbao | Basque Country | 2012 | 2012 | M | Heterosexual | Spain | MW344918 | |
| P3177 | Bilbao | Basque Country | 2012 | 2012 | F | Heterosexual | Bolivia | MW344919 | KX818200 |
| P4464 | Bilbao | Basque Country | 2016 | 2015 | M | MSM | Spain | MW344920 | |
| P5090 | Vitoria | Basque Country | 2018 | 2018 | M | Sexual | Bolivia | MW344921 | |
| TO0275 | Toledo | Castilla-La Mancha | 2020 | 2020 | M | MSM | Spain | MW802824 | |
| Z0275 | Zaragoza | Aragon | 2018 | 2018 | M | n.a | Bolivia | MW344922 | MW802825 |
n.a. not available.
Figure 1Maximum likelihood tree of Pr-RT sequences of BF cluster. Names of sequences obtained by us, all collected in Spain, are in bold type. Two-letter ISO code of country of origin of the individual, when known, is in parentheses after the virus name. In database sequences branching in the BF cluster, the country of sample collection is indicated before the virus name with the two-letter ISO country code. In reference sequences, subtype or CRF is indicated before the virus name. Branches corresponding to references of subtypes A, C, F, G, and H are compressed. Only bootstrap values ≥ 80% are shown.
Figure 2Maximum likelihood tree of NFLG sequences of the BF cluster and of references of CRF_BFs from the Southern Cone of South America and of subtypes. Names of sequences obtained by us are in bold type. In database sequences located in the BF cluster, the country of sample collection is indicated before the virus name with the two-letter ISO country code. In reference sequences, the subtype or CRF is indicated before the virus name. Only bootstrap values ≥ 80% are shown.
Figure 3Bootscan analyses of 9 NFLG genomes of the BF cluster. The horizontal axis represents the position in the HXB2 genome of the midpoint of a 250 nt window moving in 20 nt increments and the vertical axis represents bootstrap values supporting clustering with subtype reference sequences.
Figure 4Phylogenetic trees of genome segments of the BF cluster. HXB2 positions delimiting the analyzed segments are indicated on top of the trees. Sequence names of viruses of the BF cluster are in blue. Names of subtype references are preceded by the corresponding subtype. Node supports of B and F1 clades are indicated, in this order, as ultrafast bootstrap value/aLRT SH-like support/posterior probability, which were obtained with IQ-Tree, PhyML, and MrBayes programs, respectively. For the other nodes, only ultrafast bootstrap values are indicated. Only bootstrap values ≥ 80%, aLRT SH-like values ≥ 0.9, and posterior probabilities ≥ 0.9 are shown.
Figure 5Mosaic structure of CRF89_BF compared to those of CRF12_BF, CRF17_BF, CRF38_BF, and CRF44_BF. Intersubtype breakpoints were determined as the midpoint of B-F1 75% consensus transitions (Supplementary Fig. S5). B and F1 fragments are shown in blue and green, respectively. Vertical red lines indicate coincident breakpoints. Positions correspond to the HXB2 genome.
Figure 6Maximum clade credibility tree of CRF89_BF Pr-RT sequences. Branch colors indicate, for terminal branches, country of sample collection or, if known, of origin of the individual, and for internal branches, the most probable location country of the subtending node, according to the legend on the left. Nodes supported by PP ≥ 0.95 are indicated with filled circles. Most probable countries at the root of the tree and of the three major clusters are indicated, together with PP supporting the locations and tMRCA (mean values, with 95% HPD intervals in parentheses).