| Literature DB >> 9841928 |
F Matsuda1, K Ishii, P Bourvagnet, K i Kuma, H Hayashida, T Miyata, T Honjo.
Abstract
The complete nucleotide sequence of the 957-kb DNA of the human immunoglobulin heavy chain variable (VH) region locus was determined and 43 novel VH segments were identified. The region contains 123 VH segments classifiable into seven different families, of which 79 are pseudogenes. Of the 44 VH segments with an open reading frame, 39 are expressed as heavy chain proteins and 1 as mRNA, while the remaining 4 are not found in immunoglobulin cDNAs. Combinatorial diversity of VH region was calculated to be approximately 6,000. Conservation of the promoter and recombination signal sequences was observed to be higher in functional VH segments than in pseudogenes. Phylogenetic analysis of 114 VH segments clearly showed clustering of the VH segments of each family. However, an independent branch in the tree contained a single VH, V4-44.1P, sharing similar levels of homology to human VH families and to those of other vertebrates. Comparison between different copies of homologous units that appear repeatedly across the locus clearly demonstrates that dynamic DNA reorganization of the locus took place at least eight times between 133 and 10 million years ago. One nonimmunoglobulin gene of unknown function was identified in the intergenic region.Entities:
Mesh:
Substances:
Year: 1998 PMID: 9841928 PMCID: PMC2212390 DOI: 10.1084/jem.188.11.2151
Source DB: PubMed Journal: J Exp Med ISSN: 0022-1007 Impact factor: 14.307
Figure 1Organization of the human immunoglobulin VH locus. The 957-kb DNA is represented by the four collections of thick horizontal lines with the 3′ end at the bottom right corner. VH segments belonging to different VH families are indicated by vertical lines of different colors with their names on the upper row. Pseudogenes and newly identified VH segments are indicated with a P and an asterisk at the end of the name, respectively. Full height vertical lines represent VH segments without truncation while those containing truncation at the 5′-, 3′-, and both 5′- and 3′-portions are indicated by half-height upper, lower, and middle lines, respectively. An enlarged physical map of the 39-kb DNA of the human D gene cluster is also shown. Locations of D segments of six families are shown by diamonds of different colors with their names. Eight nonimmunoglobulin genes are shown with their names by short arrows of different colors indicating the transcriptional orientation. 13 locus-specific homology units are indicated by boxes of different colors in the middle row. Different classes of sequences are shown in the lower row: (a) Alu (red), MIR (magenta); (b) LINE1 (green), LINE2 (dark green); (c) retrotransposons (yellow), retroviral and other LTRs (blue); (d) DNA transposons (black); (e) medium reiteration frequency repetitive sequences (purple); and (f) simple repeats (cyan). DNA clones covering the locus are shown at the bottom. The YAC clone Y13.3, cosmid clones M146, U22-1, U22, and M83, as well as P1 clones H10 and A1 were newly isolated in this study whereas the others have been described previously (8). The nucleotide sequence was deposited in DDBJ/GenBank/EMBL database under the accession number AB019437-AB019441.
Summary of the Human VH Segments
| Classification | VH family | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | Total | |||||||||
| Functional | 9 | 3 | 19 | 6 | 1 | 1 | 0 | 39 | ||||||||
| Transcribed | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | ||||||||
| ORF | 0 | 0 | 3 | 0 | 0 | 0 | 1 | 4 | ||||||||
| Pseudogene | ||||||||||||||||
| Point mutation | 3 | 1 | 21 | 2 | 0 | 0 | 2 | 29 | ||||||||
| Truncation | 2 | 0 | 22 | 23 | 1 | 0 | 2 | 50 | ||||||||
| Total | 14 | 4 | 65 | 32 | 2 | 1 | 5 | 123 | ||||||||
Summary of the Human V Segments
| Name | bp from JH1 | 5′ regulatory region | ATG | gt/ag | RSS | Defects in the pseudogenes | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Heptamer | (bp) | Octamer | (bp) | TATA | (bp) | 7mer | (bp) | 9mer | ||||||||||||||||||
| Functional1–2 | 121362 | CTCATGA | 2 | ATGCAAAT | 19 | TAAATAC | 82 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–3 | 139937 | CTCATGA | 2 | ATGCAAAT | 8 | TGACTAT | 77 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–8 | 207771 | CTCATGA | 2 | ATGCAAAT | 19 | TAAATAT | 81 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–18 | 310253 | TTCATGA | 2 | ATGCAAAT | 12 | TATAGAT | 76 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–24 | 401835 | CTCATGA | 2 | ATGCAAAT | 19 | TAAATAC | 80 | + | gc/ag | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–45 | 631622 | CTCATCA | 2 | ATGCAAAT | 19 | TAAATAT | 81 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–46 | 635740 | CTCATGA | 2 | ATGCAAAT | 19 | TAAATAT | 81 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 1–58 | 747064 | CTCATGA | 2 | ATGCAAAT | 19 | TAAATAT | 81 | + | gc/ag | CACAGTG | 23 | TCAGAAACG | ||||||||||||||
| 1–69 | 838623 | CTCATGC | 2 | ATGCAAAT | 19 | TAAATAT | 81 | + | + | CACAGTG | 23 | TCAGAAACC | ||||||||||||||
| 2–5 | 162833 | — | — | ATGCAAAT | 26 | TTGAAAA | 42 | + | + | CACAAAG | 23 | ACAAAAACC | ||||||||||||||
| 2–26 | 426348 | — | — | ATGCAAAT | 26 | TTCAAAA | 41 | + | + | CACAGAG | 23 | ACAAGAACC | ||||||||||||||
| 2–70 | 847518 | — | — | ATGCAAAT | 26 | TTCAAAA | 41 | + | + | CACAGAG | 23 | ACAAGAACC | ||||||||||||||
| 3–7 | 187109 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–9 | 220985 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACAAAAACC | ||||||||||||||
| 3–11 | 241936 | — | — | ATGCAAAT | 18 | ATAAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–13 | 254843 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–15 | 279028 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–20 | 336289 | — | — | ATGCAGGT | 17 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACG | ||||||||||||||
| 3–21 | 360380 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–23 | 393910 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–30 | 459712 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–33 | 484429 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–43 | 594900 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACAAAAACC | ||||||||||||||
| 3–48 | 662523 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–49 | 681653 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–53 | 717376 | — | — | ATCCAAAT | 18 | ATGAAAA | 98 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–64 | 782450 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | GCAGAAACC | ||||||||||||||
| 3–66 | 799737 | — | — | ATGCAAAT | 18 | ATGAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–72 | 867647 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGCG | 23 | ACACAAACC | ||||||||||||||
| 3–73 | 879647 | — | — | ATGCAAAT | 19 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 3–74 | 887385 | — | — | ATGCAAAT | 18 | AAGAAAA | 90 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 4–4 | 146795 | — | — | ATGCAAAT | 39 | TTAAATT | 59 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 4–31 | 473900 | — | — | ATGCAAAT | 38 | TTAAATT | 59 | + | + | CACAATG | 23 | ACACAAACC | ||||||||||||||
| 4–34 | 498280 | — | — | ATGCAAAT | 39 | TTAAATT | 59 | + | + | CACAGTG | 23 | ACAAAAACC | ||||||||||||||
| 4–39 | 546311 | — | — | ATGCAAAT | 39 | TTAAATT | 58 | + | + | CACAGTG | 23 | ACAAAAACC | ||||||||||||||
| 4–59 | 751941 | — | — | ATGCAAAT | 39 | TTAAATT | 59 | + | + | CACAGTG | 23 | ACAAAAACC | ||||||||||||||
| 4–61 | 763817 | — | — | ATGCAAAT | 39 | TTAAATT | 59 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| 5–51 | 703418 | — | — | ATGCAAAT | 18 | ACTTAAA | 79 | + | + | CACAGTG | 23 | CTAAAACCC | ||||||||||||||
| 6–1 | 74312 | — | — | AGGCAAAT | 19 | TTTAAAT | 78 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| Transcribed | ||||||||||||||||||||||||||
| 4–28 | 449201 | — | — | ATGCAAAT | 38 | TTAAATT | 59 | + | + | CACAGTG | 23 | ACACAAACC | ||||||||||||||
| ORF | ||||||||||||||||||||||||||
| 3–16 | 290601 | — | — | ATGCAAAT | 18 | ATGAAAA | 94 | + | + |
| 23 | ACACAAACC | ||||||||||||||
| 3–35 | 514030 | — | — | ATGCAAAT | 18 | ATAAAAA | 95 | + | + | CACTGAG | 23 | ACACAAACC | ||||||||||||||
| 3–38 | 535112 | — | — | — | — | — | — | + | + |
| 23 | ACACAAACC | 5′-T(13 bp upstream of −19) | |||||||||||||
| 7–81 | 951482 | TTCATGA | 2 | ATGCAAAT | 8 | GGAATAT | 79 | + | + | CACCATG | 23 | TCAGAAATC | ||||||||||||||
| Pseudogene with point mutation(s) | ||||||||||||||||||||||||||
| 1–17P | 299723 | CTCATGA | 2 | ATGCAAAT | 19 | TAAATTT | 79 | + | + | CACAGTG | 23 | TCAGAAACC | 1 bp-I(46) | |||||||||||||
| 1–67P | 805315 | CTCATGA | 2 | ACGCAAAT | 16 | TACAGAT | 77 | + | + | CACAGTG | 23 | TCAG | S(36), 4 bp-I(31) | |||||||||||||
| 1–68P | 828559 | CTCATGA | 2 | ATGTAAAT | 18 | TAAATAT | 76 | + | gt/gg | CACGGTG | 23 | TCAGGAACC | S(15), 1 bp-D(−13) | |||||||||||||
| 2–10P | 228236 | — | — | ATGCAAAT | 26 | TTGAAAA | 44 | + | + | CACAGAG | 23 | ACAAGAACC | S(36,53) | |||||||||||||
| 3–6P | 180487 | — | — | ACGCAAAT | 18 | ATGAAAA | 98 | + | + |
| 23 | ACACAAACC | 1 bp-D(16) | |||||||||||||
| 3–22P | 383077 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | S(59) | |||||||||||||
| 3–25P | 414329 | — | — | ATGGAAAT | 18 | ATAAAAA | 100 | + | + | CACAGTG | 23 | ACACAAACC | S(49,91) | |||||||||||||
| 3–29P | 456071 | — | — | — | 14 | GTGAAAA | 101 | + | + | C | 23 | ACACAAAAT | S(−13,33,47,94) | |||||||||||||
| 3–30.2P | 469013 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | C | 23 | ACACA | S(−2,−1,33,47,61,94) | |||||||||||||
| 3–32P | 480784 | — | — | ATGAAAAC | 18 | GTGAAAT | 101 | + | + | C | 23 | ACACAACAT | S(−13,33,47,94) | |||||||||||||
| 3–33.2P | 493733 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | C | 23 | ACACA | S(−2,−1,33,47,94) | |||||||||||||
| 3–37P | 521282 | — | — | ATGCAAAT | 21 | ATGAAAA | 101 | + | + | CA | 23 | CCAGAAACC | S(22), 1 bp-D(16,21), 2 bp- D(63), 10 bp-D(90-93) | |||||||||||||
| 3–41P | 567750 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | S(47), 1 bp-D(71) | |||||||||||||
| 3–47P | 643217 | — | — | — | — | ATGAAAA | 101 | AGG | + | CACAGTG | 23 | ATACAAACT | ATG to AGG (−19), 5′-T (108 bp upstream of −19) | |||||||||||||
| 3–50P | 690799 | — | — | AAGAAAAT | 18 | ATGAAAA | 100 | ATA | gc/ag | C | 23 | ACACAAAAT | S(−2,33,36,47,58,92), 1 bp- D(16), ATG to ATA (−19) | |||||||||||||
| 3–52P | 711066 | — | — | ATGCAAAC | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | S(9) | |||||||||||||
| 3–54P | 726051 | — | — | ATGCAAAT | 18 | ATGACCA | 93 | + | + | C | 23 | ACACA | S(33,47,52,52A,94) | |||||||||||||
| 3–60P | 755914 | — | — | AAGCAAAT | 18 | CTGAAAA | 101 | + | + | C | 20 | ACACAAACC | S(66), 1 bp-I(16) | |||||||||||||
| 3–62P | 767844 | — | — | ATGCGAAT | 18 | ATGAAAA | 98 | + | + | C | 20 | ACACAAACC | S(46) | |||||||||||||
| 3–63P | 776951 | — | — | ATGAAAAC | 18 | GTGAAAA | 99 | + | + | C | 23 | ACACAAAAT | S(33,94) | |||||||||||||
| 3–65P | 790804 | — | — | ATGCAAAT | 18 | AAGAAAA | 103 | + | at/ag | CACAGTG | 23 | ACACAAACC | 1 bp-I(31) | |||||||||||||
| 3–71P | 852113 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | CACAGTG | 23 | ACACAAACC | S(59) | |||||||||||||
| 3–75P | 900633 | — | — | ATGCAAAG | 15 | GTGAAAA | 99 | + | gt/gg | C | 23 | ACACAAACC | S(22,66) 1 bp-I(80), 1 bp- D(21,23) | |||||||||||||
| 3–76P | 904794 | — | — | ATGAAAAT | 18 | ATGAAAA | 101 | + | gc/ag | CACAGTG | 23 | TCACAAACC | S(3,19,36,51), 2 bp-D(52A) | |||||||||||||
| 3–79P | 944560 | — | — | ATGCAAAC | 18 | ATGAAAA | 101 | + | gt/gg | C | 23 | ACACA | S(33,47), 1 bp-D(−11), 2 bp-D(16) | |||||||||||||
| 4–55P | 730817 | — | — | ATGCAAAT | 39 | TTAAATT | 59 | + | + | CACAGTG | 23 | ACACAAACC | S(34) | |||||||||||||
| 7–34.1P | 501920 | TTCATGA | 2 | ATGCAATT | 8 | TGACTAT | 79 | + | + | CACAGTG | 23 | TCAGAAAGC | S(−15,38,39,46) | |||||||||||||
| 7–56P | 734462 | TTCATGA | 2 | ATGCAAAT | 8 | TTAATAC | 80 | + | + | CACCGTG | 23 | TTAGAAACC | S(27), 1 bp-D(53,65) | |||||||||||||
| Pseudogene with truncation(s) | ||||||||||||||||||||||||||
| 1–14P | 271076 | — | — | — | — | — | — | − | − | CACAGTG | 23 | TCAGAAATC | 5′-T(15) | |||||||||||||
| 1–12P | 247559 | CTTATGA | 2 | ATGCAAAT | 19 | TAAATAT | 54 | + | + | — | — | — | 3′-T(50) | |||||||||||||
| 3–2.1P | 136244 | — | — | — | — | — | — | − | −/ag | CACAGTG | 23 | ACACAAAGC | 5′-T(intron) | |||||||||||||
| 3–5.1P | 164252 | — | — | — | — | — | — | − | − | CACATGA | 17 | ACACAAACC | 5′-T(65) | |||||||||||||
| 3–5.2P | 170347 | — | — | — | — | — | — | − | − | CACAGTG | 22 | ACGCAAACT | 5′-T(12) | |||||||||||||
| 3–13.1P | 267558 | — | — | — | — | — | — | − | −/ag | CACAGTG | 23 | ACAC | 5′-T(intron) | |||||||||||||
| 3–16.1P | 296006 | — | — | — | — | — | — | − | −/ag | CACAGGA | 24 | ACAGAAAAA | 5′-T(intron) | |||||||||||||
| 3–19P | 321917 | — | — | — | — | — | — | − | − | CACTGTG | 23 | ACACAAACC | 5′-T(1) | |||||||||||||
| 3–26.1P | 434239 | — | — | — | — | — | — | − | −/ag | CACAGGG | 24 | ACACAAAAA | 5′-T(intron) | |||||||||||||
| 3–38.1P | 542512 | — | — | — | — | — | — | − | −/ag | CACAGTG | 23 | ACACAAAAG | 5′-T(intron) | |||||||||||||
| 3–42P | 587869 | — | — | — | — | — | — | − | + | CA | 22 | ACACAAATC | 5′-T(−10) | |||||||||||||
| 3–47.1P | 655704 | — | — | — | — | — | — | − | −/ag | CACGGTG | 23 | ACACAAACC | 5′-T(intron) | |||||||||||||
| 3–51.1P | 708156 | — | — | — | — | — | — | − | + | CA | 23 | AGACA | 5′-T(−6) | |||||||||||||
| 3–57P | 743493 | — | — | — | — | — | — | − | −/ag | CACAGGA | 24 | ACACAAAAA | 5′-T(intron) | |||||||||||||
| 3–67.2P | 811473 | — | — | — | — | — | — | − | − | CACATGA | 22 | ACATAAACC | 5′-T(65) | |||||||||||||
| 3–67.3P | 817161 | — | — | — | — | — | — | − | − | CACAGCG | 22 | ACAGAAACC | 5′-T(12) | |||||||||||||
| 3–67.4P | 819680 | — | — | — | — | — | — | − | −/ag | CACAGGA | 24 | ACACAAAAA | 5′-T(intron) | |||||||||||||
| 3–82P | 956317 | — | — | — | — | — | — | − | −/gt | CA | 24 | ACACAAAAT | 5′-T(intron) | |||||||||||||
| 3–22.2P | 389722 | — | — | GTGAAAAT | 18 | ATGGAAA | 100 | ATA | gc/ag | — | — | — | 3′-T(50) | |||||||||||||
| 3–36P | 517398 | — | — | ATGCAAAT | 18 | ATGAAAA | 95 | + | + | CA | — | — | 3′-T(RSS spacer) | |||||||||||||
| 3–44P | 602655 | — | — | ATGCAAAT | 18 | ATGAAAA | 101 | + | + | — | — | — | 3′-T(7) | |||||||||||||
| 3–11.1P | 245337 | — | — | — | — | — | — | − | −/ag | — | — | — | 5′-T(intron), 3′-T(57) | |||||||||||||
| 3–25.1P | 418667 | — | — | — | — | — | — | − | −/ag | — | — | — | 5′-T(intron), 3′-T(77) | |||||||||||||
| 3–76.1P | 908562 | — | — | — | — | — | — | − | −/aa | CACAGGA | — | — | 5′-T(intron), 3′-T(RSS spacer) | |||||||||||||
| 4–1.1P | 79503 | — | — | — | — | — | — | − | − | GACAGAA | 23 | ACACAAACC | 5′-T(33) | |||||||||||||
| 4–15.1P | 288747 | — | — | — | — | — | — | − | − | CACAGGA | 22 | ACACAAACC | 5′-T(10) | |||||||||||||
| 4–20.1P | 338221 | — | — | — | — | — | — | − | − | CACAGTG | 23 | ACACAAACC | 5′-T(41), Alu insertion (82B) | |||||||||||||
| 4–22.1P | 388477 | — | — | — | — | — | — | − | − | CACAGCG | 24 | ACAC | 5′-T(10) | |||||||||||||
| 4–26.2P | 439482 | — | — | — | — | — | — | − | −/ag | C | 23 | ACACAAACC | 5′-T(−4) | |||||||||||||
| 4–28.1P | 454194 | — | — | — | — | — | — | − | − | C | 23 | ACACAACCC | 5′-T(10) | |||||||||||||
| 4–30.1P | 467131 | — | — | — | — | — | — | − | − | CACAGTG | 23 | ACCCAAGCC | 5′-T(10) | |||||||||||||
| 4–31.1P | 478899 | — | — | — | — | — | — | − | − | C | 23 | ACACAAACC | 5′-T(10) | |||||||||||||
| 4–33.1P | 491850 | — | — | — | — | — | — | − | − | CACAGTG | 23 | ACCCAAGCC | 5′-T(10) | |||||||||||||
| 4–44.1P | 614033 | — | — | — | — | — | — | − | + | CACTGTG | 23 | ACACAAACC | 5′-T(−13) | |||||||||||||
| 4–44.2P | 618651 | — | — | — | — | — | — | − | −/ag |
| 23 | ACATAAACC | 5′-T(17) | |||||||||||||
| 4–49.1P | 688842 | — | — | — | — | — | — | − | − |
| 23 | AAACAAACC | 5′-T(10) | |||||||||||||
| 4–51.2P | 709216 | — | — | — | — | — | — | − | − |
| 22 | ACACAAACT | 5′-T(10) | |||||||||||||
| 4–53.1P | 724192 | — | — | — | — | — | — | − | − | CACAGTA | 23 | ACCCAAACC | 5′-T(10) | |||||||||||||
| 4–60.1P | 762234 | — | — | — | — | — | — | − | − | CA | 23 | ACCCAAACC | 5′-T(10) | |||||||||||||
| 4–62.1P | 775048 | — | — | — | — | — | — | − | − | CACAGTG | 24 | ACCAAAACC | 5′-T(10) | |||||||||||||
| 4–65.1P | 796316 | — | — | — | — | — | — | − | − | CACAACG | 23 | ATACAAACC | 5′-T(10) | |||||||||||||
| 4–78.1P | 942365 | — | — | — | — | — | — | − | − | CACAGTG | 23 | ACCCAAACC | 5′-T(10) | |||||||||||||
| 4–80P | 949650 | — | — | ATGCAAAT | 40 | TTAAATT | 60 | + | + | — | — | — | 3′-T(84) | |||||||||||||
| 4–40.1P | 565177 | — | — | — | — | — | — | − | − | — | — | — | 5′-T(9), 3′-T(33) | |||||||||||||
| 4–43.1P | 597179 | — | — | — | — | — | — | − | − | — | — | — | 5′-T(24), 3′-T(90) | |||||||||||||
| 4–46.1P | 640314 | — | — | — | — | — | — | + | gt/ac | — | — | — | 5′-T(−13), 3′-T(48) | |||||||||||||
| 4–67.1P | 810918 | — | — | — | — | — | — | − | + | — | — | — | 5′-T(−12), 3′-T(48) | |||||||||||||
| 4–74.1P | 897900 | — | — | — | — | — | — | − | −/ag | — | — | — | 5′-T(−4), 3′-T(52) | |||||||||||||
| 5–78P | 928017 | — | — | ATGCAAAT | 18 | ACTTAAA | 79 | + | + | — | — | — | 3′-T(RSS 7mer) | |||||||||||||
| 7–27P | 442755 | CTCATGA | 2 | ATGCAAAT | 8 | TAAATAT | 80 | + | + | — | — | — | 3′-T(50) | |||||||||||||
| 7–40P | 549722 | — | — | — | — | — | — | − | − | CACAGTG | 23 | TCAGAAACC | 5′-T(31) | |||||||||||||
Figure 2(A) A phylogenetic tree of the human VH segments based on their nucleotide sequence alignment. Three distinct sets of the VH segments that correspond to VHI, VHII, and VHIII subgroups (39) are separated by broken lines and indicated by Roman numerals. The seven VH families are indicated by different colors. Groups of the VH3 and VH4 segments containing the 5′-truncation are circled. (B) Estimation of divergence time between 10 homologous units containing a pair of the VH3 and VH4 segments. The human/mouse divergence time (vertical line) is indicated in million years ago (Myr).