| Literature DB >> 15673711 |
Ilgar Z Mamedov1, Elena S Arzumanyan, Anna L Amosova, Yuri B Lebedev, Eugene D Sverdlov.
Abstract
A new experimental technique for genome-wide detection of integration sites of polymorphic retroelements (REs) is described. The technique allows one to reveal the absence of a retroelement in an individual genome provided that this retroelement is present in at least one of several other genomes under comparison. Since quite a number of genomes are compared simultaneously, the search for polymorphic REs insertions is very efficient. The technique includes two whole-genome selective PCR amplifications of sequences flanking REs: one for a particular genome and another one for a mixture of ten different genomes. A subsequent subtractive hybridization of the obtained amplicons with DNA of a particular genome as driver results in isolation of polymorphic insertions. The technique was successfully applied for identification of 41 new polymorphic human AluYa5/Ya8 insertions. Among them, 18 individual Alu elements first sequenced in this work were not found in the available human genome databases. This result suggests that significant part of polymorphic REs were not identified during genome sequencing and remain to be detected and characterized. The proposed method does not depend on preliminary knowledge of evolutionary history of retroelements and can be applied for identification of insertion/deletion polymorphic markers in genomes of different species.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15673711 PMCID: PMC548376 DOI: 10.1093/nar/gni018
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Oligonucleotides/primers used for the library construction
| A1A2 | TGTAGCGTGAAGACGACAGAAAGGGCGTGGTGCGGAGGGCGGT |
| A1 | TGTAGCGTGAAGACGACAGAA |
| A2 | AGGGCGTGGTGCGGAGGGCGGT |
| a12 | ACCGCCCTCC |
| A3A4 | AGCAGCGAACTCAGTACAACAAGTCGACGCGTGCCCGGGCTGGT |
| A3 | AGCAGCGAACTCAGTACAACA |
| A4 | AGTCGACGCGTGCCCGGGCTGGT |
| a34 | ACCAGCCC |
| A5A6 | GTAATACGACTCACTGGAGGGCGAGCGTGGTCGCCGCCGAGGTG |
| A5 | GTAATACGACTCACTGGAGGGC |
| A6 | GAGCGTGGTCGCCGCCGAGGTG |
| a56 | CACCTCGGC |
| T1 | TCACCGTTTTAGCCGGGA |
| T2 | GTGAGCCACCGCGCCCGG |
Figure 1(A) Tracer DNA preparation. Double lines denote genomic DNA. R, RsaI restriction sites; open box represents individual Alu Ya5 or Ya8 element. A1A2 (hatched boxes), the first oligonucleotide adapter ligated to the restricted DNA; A1/T1 and A2/T2, primers used in the first and second PCR rounds, respectively. A3A4 (gray boxes) and A5A6 (dark boxes), ligated adapters forming Tracer A and Tracer B, respectively. Note: Driver DNA was prepared from a single sample exactly in the same way as Tracer but omitting stage f. (B) Scheme of subtractive hybridization. Wavy lines depict flanking sequences of Alu present in one or more of 10 Tracer genomes and absent from Driver. Straight lines mark flanks of Alu common for Driver and Tracer. Open boxes represent the remainder of Alu Ya5 or Ya8 elements; A3/A5 and A4/A6, primers used in the first and second PCR rounds, respectively. Other designations are as in (A).
Figure 2Geographic origin of populations used for individual DNAs sampling. Territory of Russian Federation is in gray and neighboring countries are white (B, Belarus; U, Uzbekistan; KZ, Kazakhstan). Scale is indicated at the bottom. Dr indicates that Andi genomic DNA sample was used as driver.
The numbers of Alu inserts studied and proportions of polymorphic AluYa5 inserts
| Total | Alu inserts | ||
|---|---|---|---|
| Polymorphic | Fixed | ||
| Clones arrayed | 288 | ||
| Clones sequenced | 120 | 78 | 21 |
| The number of different Alu flanking sequences | 88 | ||
| Flanking sequences unambiguously mapped on the human genome | 77 | ||
| Alu insertions characterized | 46 | 21 | |
| Previously described | 8 | 5 | 3 |
| Tested by PCR in this work | 64 | 41 | 18 |
| The number of polymorphic Alu insertions present in databases | 23 | ||
| The number of polymorphic Alu insertions absent from databases | 18 | ||
aFive of 64 PCR assays resulted in uncertain results (see ‘Alu polymorphisms identification’ and notes to Table 3).
Figure 3Examples of PCR amplification with three individual polymorphic Alu-containing loci (MLS 19, MLS 50 and MLS 65) in ten tracer (lines t1–t10) and one driver (line Dr) DNA samples. Lines K+ and K− represent positive and negative controls, respectively. M, DNA fragments of a 100 bp ladder length marker (SybEnzime). Gray and white arrows to the left of the electrophoregrams indicate the predicted locations of the Alu containing and Alu-free PCR products, respectively. Scheme of genomic primers location is represented at the bottom.
Alu Ya5 genomic location, neighboring genes and polymorphism frequency
| Name | Accession | Chromosome | Neighboring genes | Polymorphism frequency | |
|---|---|---|---|---|---|
| Alu+ | Alu− | ||||
| Ya5-MLS09 | AK056306 | AL162431 | 1q25.3 | 2k down | 7/20 |
| Ya5-MLS33 | AL513546 | 1q21.3 | No genes | N/A | |
| Ya5-MLS51 | AY736296 | AL592148 | 1q41 | In AK096526 | 1/20 |
| Ya5-MLS58 | AY736298 | AL390117 | 1p12 | No genes | 1/20 |
| Ya5-MLS59 | AL356501 | AL365175 | 1p31.1 | No genes | 7/20 |
| Ya5-MLS60 | AL928921 | 1p36.22 | No genes | N/A | |
| Ya5-MLS48 | AC073577 | AC073046 | 2p13.1 | In | 14/20 |
| Ya5-MLS26 | AY736289 | AC099331 | 3p22.1 | In | 9/20 |
| Ya5-MLS29 | AC011325 | 3q29 | No genes | 13/20 | |
| Ya5-MLS47 | AC024248 | 4q26 | No genes | 3/20 | |
| Ya5-MLS57 | AY736297 | AC010683 | 4q31.22 | No genes | 2/20 |
| Ya5-MLS05 | AC105756 | AC087107 | 4q34.3 | No genes | 5/20 |
| Ya5-MLS50 | AY736295 | AC115540 | 4q35.2 | No genes | 4/20 |
| Ya5-MLS04 | AC114316 | AC116332 | 5q14.3 | No genes | 10/20 |
| Ya5-MLS06 | AC020921 | AC007554 | 5q14.3 | No genes | 8/20 |
| Ya5-MLS31 | L43392 | 5q23.3 | In | N/A | |
| Ya5-MLS44 | AY736294 | AL365508 | 6q22.31 | In c6orf170 | 2/18 |
| Ya5-MLS19 | AC010942 | 6q22.33 | In | 10/20 | |
| Ya5-MLS10 | AY736285 | AL121969 | 6p12.2 | No genes | 5/20 |
| Ya5-MLS14 | AY736286 | AC019066 | 7p12.3 | In | 1/20 |
| Ya5-MLS39 | AC005377 | 7q34 | No genes | 6/20 | |
| Ya5-MLS23 | AP003357 | AC012482 | 8q22.1 | 4k up | 5/20 |
| Ya5-MLS41 | AP005354 | AC103853 | 8q23.1 | In | 14/20 |
| Ya5-MLS37 | AY736292 | AC022389 | 10q23.1 | 4k down | 6/20 |
| Ya5-MLS28 | AY736290 | AL358033 | 10p13 | No genes | 2/20 |
| Ya5-MLS56 | AL139818 | 10p15.3 | No genes | N/A | |
| Ya5-MLS07 | AC025427 | 10q21.1 | No genes | 3/20 | |
| Ya5-MLS35 | AL731574 | 10q25.1 | No genes | 18/20 | |
| Ya5-MLS36 | AC090832 | 11p11.2 | In | 11/16 | |
| Ya5-MLS18 | AC080183 | AC023869 | 11p14.3 | In AK127695 | 12/20 |
| Ya5-MLS45 | AC079363 | 12q21.31 | In | N/A | |
| Ya5-MLS34 | AY736291 | AC009721 | 12q24 | In | 6/18 |
| Ya5-MLS70 | AY736302 | AL159153 | 13q34 | In | 14/20 |
| Ya5-MLS12 | AL138681 | 13q12.3 | No genes | 14/20 | |
| Ya5-MLS46 | AL390722 | 13q14.3 | No genes | 14/20 | |
| Ya5-MLS69 | AY736301 | AL390964 | 13q21.1 | No genes | 4/20 |
| Ya5-MLS22 | AL161897 | AE014308 | 13q33.2 | No genes | 4/20 |
| Ya5-MLS68 | AY736300 | AL132990 | 14q32.13 | 3k up | 5/20 |
| Ya5-MLS65 | AY736299 | AL445883 | 14q13.1 | No genes | 1/20 |
| Ya5-MLS11 | AC010674 | 15q21.2 | In | 5/20 | |
| Ya5-MLS21 | AC021958 | AC026583 | 15q23 | No genes | 9/20 |
| Ya5-MLS63 | AC130456 | AC003108 | 16p12.3 | In | 3/18 |
| Ya8-MLS32 | AC139236 | AC140504 | 16p13.1 | No genes | 3/20 |
| Ya5-MLS16 | AY736287 | AC108483 | 16q23.2 | No genes | 3/16 |
| Ya5-MLS30 | AC016586 | 19p13.3 | 8k up | 16/18 | |
| Ya5-MLS20 | AY736288 | AL135935 | 20p12.2 | In | 1/20 |
| Ya5-MLS66 | AC023275 | 1q31.1 | In | FP | |
| Ya5-MLS13 | AL590082 | 1p21.1 | No genes | FP | |
| Ya5-MLS62 | AC079300 | 2p16.3 | No genes | FP | |
| Ya5-MLS49 | AC108068 | 2q32.3 | No genes | FP | |
| Ya5-MLS52 | AC016903 | 2q33.3 | No genes | FP | |
| Ya5-MLS24 | AC019130 | 2q37.1 | In MGC42174 | FP | |
| Ya5-MLS61 | AC099339 | 4q31.3 | In | FP | |
| Ya5-MLS55 | AC094098 | 5q33.2 | No genes | FP | |
| Ya5-MLS25 | AL139093 | 6p22.3 | No genes | FP | |
| Ya5-MLS38 | AC006006 | 7q34 | In | FP | |
| Ya5-MLS15 | AL355592 | 9q33.1 | No genes | FP | |
| Ya5-MLS53 | AC069540 | 10q23.1 | No genes | FP | |
| Ya5-MLS42 | AC022878 | 11p15.3 | No genes | FP | |
| Ya5-MLS43 | AP001782 | 11q24.1 | No genes | FP | |
| Ya5-MLS01 | AC023795 | 12q12 | In | FP | |
| Ya5-MLS27 | AC078860 | 12q21.1 | 2k down | FP | |
| Ya5-MLS67 | AC007432 | 17q24.3 | No genes | FP | |
| Ya5-MLS54 | AL031657 | 20q11.23 | 4k up | FP | |
aFor Alu repeats that were sequenced in this work for the first time accession numbers obtained in this work are given.
bNeighboring genes were detected by Human Genome Browser under RefSeq Genes. Up, Alu is N kb upstream gene; down, Alu is N kb downstream gene; in, Alu is into intron of gene; no genes, no neighboring genes were detected within 10 kb of Alu.
cFor polymorphic Alu repeats fraction of Alu-positive haplotypes to the total haplotypes tested for tracer DNA. Driver DNA for polymorphic insertions is always Alu negative. FP (fixed present), all the DNA samples tested were Alu-positive; N/A, PCR products were not detected for driver DNA sample, or the samples were Alu negative, or no specific PCR products detected for driver and tracer DNAs.
Chromosomal distribution of polymorphic Alu Ya5 and Ya8 repeats found in this work compared to the distribution of other polymorphic REs detected by in silico approach
| Chromosome | Polymorphic (other works) | Polymorphic (this work) | Total polymorphic |
|---|---|---|---|
| 1 | 24 | 4 | 28 |
| 2 | 29 | 1(1) | 29 |
| 3 | 18 | 2 | 20 |
| 4 | 20 | 4(1) | 23 |
| 5 | 18 | 2(1) | 19 |
| 6 | 27 | 3(1) | 29 |
| 7 | 37 | 2 | 39 |
| 8 | 10 | 2 | 12 |
| 9 | 6 | 0 | 6 |
| 10 | 13 | 4(1) | 16 |
| 11 | 12 | 2 | 14 |
| 12 | 12 | 1 | 13 |
| 13 | 18 | 5(2) | 21 |
| 14 | 17 | 2 | 19 |
| 15 | 4 | 2 | 6 |
| 16 | 7 | 3(1) | 9 |
| 17 | 10 | 0 | 10 |
| 18 | 7 | 0 | 7 |
| 19 | 5 | 1 | 6 |
| 20 | 12 | 1 | 13 |
| 21 | 4 | 0 | 4 |
| 22 | 3 | 0 | 3 |
| Total | 313 | 41 | 346 |
aCited from (12).
bNumber of polymorphic insertions identical to those described by Mark Batzers group [Otieno et al. (12)] recently are given in brackets.