| Literature DB >> 31212650 |
Young-Keol Cho1, Jung-Eun Kim2, Brian T Foley3.
Abstract
We determined the earliest full-length HIV-1 gag gene sequences in 110 patients with HIV-1, including 20 hemophiliacs (HPs) and 90 local controls (LCs). The gag gene from stored sera was amplified using RT-PCR, and was subjected to direct sequencing. Phylogenetic analysis indicated that 94 and 16 sequences belonged to the Korean subclade of HIV-1 subtype B (KSB) and subtype B, respectively. A total of 12 signature pattern amino acids were found within the KSB, distinct from the worldwide consensus of subtype B. Within the KSB, the gag gene sequences from donors O and P and those from the 20 HPs comprised two subclusters. In particular, sequences from donor O strongly clustered with those of eight HPs. Moreover, signature pattern analysis indicated that 14 signature nucleotides were shared between the HPs and LCs within KSB (p < 0.01). Among the 14 nucleotides, positions 9 and 5 belonged to clusters O and P, respectively. In conclusion, signature pattern analysis for the gag gene revealed 12 signature pattern residues within the KSB and also confirmed the previous conclusion that the 20 HPs were infected with viruses due to incompletely inactivated clotting factor IX. This study is the first genetic analysis of the HIV-1 gag gene in Korea.Entities:
Keywords: HIV-1 gag gene; Korean subclade of subtype B; hemophiliacs; phylogenetic analysis; signature pattern
Year: 2019 PMID: 31212650 PMCID: PMC6631484 DOI: 10.3390/v11060545
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Phylogenetic tree of the earliest full-length gag gene sequences (1503 bp) of 110 Korean HIV-1 patients: 20 hemophiliacs (HPs); plasma donors O, P, and R; 71 local control patients infected with the Korean subclade of HIV-1 subtype B (KSB); and 16 non-KSB-infected patients (including three HPs, 21–23) infected with subtype B. Ninety-six sequences belonged to KSB (shown as a red horizontal line), and 16 subtype B Korean sequences and reference sequences are shown as a dark horizontal line. Nine patients, including donor O (Cluster O: 1–4, 6, 8, 10, and 18), strongly clustered within the KSB-infected local control patients. Sequences from donor P clustered with 10 HPs, leaving only two sequences—HPs 5 and 9—outside cluster P, although in more conserved full-length pol gene analysis, all 12 HPs clustered with donor P [2]. Three HPs (21–23) were infected by clotting factor purchased outside Korea (blue color). The two digits prior to patient IDs and the one or two digits after patient IDs denote the year and month of sampling, respectively. The reliability of the tree was evaluated by 1000 bootstrap replicates. The asterisks in the node show subclasses with more than 75% bootstrap support.
Frequency of the signature pattern nucleotides in the gag gene in 94 KSB-infected patients.
| 14 Signature Nucleotide Positions | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Nucleotide positions in HXB2 | 807 | 823 | 940 | 1012 | 1151 | 1264 | 1281 | 1674 | 1707 | 1722 | 1897 | 1899 | 2155 | 2190 |
| Nucleotide in HXB2 | A | A | T | C | A | A | C | C | C | G | G | A | A | A |
| Signature nucleotide | G | T | T | T | A | A | G | T | G | |||||
| Cluster-O ( | 1.0a | 1.0 | 1.0 | 1.0 | 1.0 | 0.89 | 1.0 | 1.0 | 1.0 | |||||
| Other KSB ( | 0.04b | 0.13 | 0.05 | 0.11 | 0.05 | 0.02 | 0.00 | 0.13 | 0.13 | |||||
| Signature nucleotide | G | C | A | G | G | |||||||||
| Cluster-P ( | 0.85 | 1.0 | 0.92 | 1.0 | 0.92 | |||||||||
| Other KSB ( | 0.25 | 0.04 | 0.12 | 0.30 | 0.40 | |||||||||
Cluster O contained “PSAP” in the “PTAP motif,” as it had a T instead of A at position 2155. Nucleotide positions in HXB2 indicate the position of the nucleotide in the second row. The fourth and seventh lines show the frequencies of these nine and five signature nucleotides among the hemophiliacs’ viral sequences and the fifth and eighth lines show their frequencies among the 85 and 81 background local control sequences (other KSB). 1a and 0.04b denote 100% and 4% of all sequences within cluster O and other KSB, respectively. We did not annotate values because there was no statistically significant difference between clusters O and P, and between each cluster and other KSBs.
Signature pattern residues in the Gag proteins of 110 infected patients.
| Signature Pattern Amino Acids at 12 Residues | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Position in HXB2 | 30 | 34 | 62 | 94 | 102 | 114 | 119 | 138 | 389 | 403 | 483 | 490 |
| KSB ( | R83 | L78 | G86 | I89 | E77 | K93 | T89 | L93 | T74 | K89 | L88 | R84 |
| k11 | i14 | e5 | v5 | d14 | n1 | a5 | m1 | i10 | r5 | p3 | k20 | |
| v2 | a2, r1 | k1, h1 | p10 | q2, k2, h1 | ||||||||
| Non-KSB ( | k13 | I20 | g15 | V19 | e15 | k13 | A24 | l15 | i9 | r14 | l13 | k15 |
| r9 | l4 | e6 | i5 | d10 | r7 | t1 | m5 | t6 | k11 | m9 | r10 | |
| q3 | i2 | t4 | i3, v2 | n3, s3 | r2 | |||||||
| v1 | n1 | m3, g1 | q1 | |||||||||
Twelve residues were found to be signature pattern residues based on the criteria (>75%) applied in a previous study [2]. Signature amino acids (capital letters) in all 12 sites of the KSB were derived from the various amino acids of the non-KSB subtype B. Five positions (62, 102, 114, 138, and 483) were derived from major amino acids, and the remaining seven positions were derived from minor amino acids from non-KSB subtype B. This finding provides evidence for a founder effect, because KSB was derived from its parent, subtype B [1,2].
Comparison of the frequency of the signature pattern nucleotides among six genes.
| 10 Specific Nucleotides | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene |
|
|
|
|
|
|
| |||||
| No. of KSB ( | 94 | 91 [ | 127 [ | 52* | 52* | 77 [ | 143 [ | |||||
| Nucleotide position | 1899 | 2321 | 4235 | 5070 | 5253 | 5621 | None | 6473 | 8252 | 8670 | 8774 | None |
| Signature | G | G | A | A | T | C | T | |||||
| Cluster-O ( | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | |||||
| Other KSB ( | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |||||
| Signature | G | A | G | |||||||||
| Cluster-P ( | 1.0 | 1.0 | 1.0 | |||||||||
| Other KSB ( | 0.0 | 0.0 | 0.0 | |||||||||
n = the number of patients infected with KSB, including 20 hemophiliacs and local controls. Position 5070 also belongs to the vif gene. Nucleotide positions in the third row indicate the position of the nucleotides in the fourth and seventh rows. The fifth and eighth lines show the frequencies of these seven and three signature nucleotides among the hemophiliacs’ viral sequences and the sixth and ninth lines show their frequencies among the 85 and 81 background local control sequences (other KSB). 1 and 0.0 denote 100% and 0% of all sequences within cluster O and other KSB, respectively. *Unpublished data.
Comparison of sequence identity with donors O and P within each cluster.
| Gene | Cluster O ( | Cluster P ( | KSB ( |
|---|---|---|---|
|
| 99.1 ± 0.5 | 99.2 ± 0.4a | 97.1 |
|
| 98.9 ± 0.2 | 98.9 ± 0.5 | 95.2 |
|
| 98.9 ± 0.5 | 98.9 ± 0.7 | 96.2 |
|
| 97.6 ± 1.6 | 97.3 ± 2.1a, b | 95.6 |
|
| 96.3 ± 1.7 | 94.5 ± 3.2b | 89.8 |
ap < 0.01 and b p < 0.05 in cluster P only. p < 0.05 between the gag and nef genes and p < 0.01 between the gag and env genes in both clusters. p < 0.05 between the vif and env genes in cluster O and between the vif and nef genes in both clusters. The sequence identity of clusters O and P in all genes was higher than that of published KSB (n = 19) by ≥2%.
Figure 2Correlation of the time lag (months) between the outbreak (around January 1990) and sampling and the number of nucleotide differences in the 20 HPs, compared to corresponding plasma donors. Four patients (HPs 9, 15, 19, and 20) who provided serum samples after 2002 exhibited the most differences (≥29). The correlation coefficient, γ, was 0.89 (p < 0.0001). Eight hemophiliacs infected with HIV-1 originated from plasma donor O and 12 hemophiliacs infected from plasma donor P are indicated with white and black circles, respectively.