| Literature DB >> 32110248 |
Jui Wan Loh1, Hongseok Ha1,2, Timothy Lin1, Nawei Sun1,2, Kathleen H Burns3, Jinchuan Xing1,2.
Abstract
BACKGROUND: Mobile elements are ubiquitous components of mammalian genomes and constitute more than half of the human genome. Polymorphic mobile element insertions (pMEIs) are a major source of human genomic variation and are gaining research interest because of their involvement in gene expression regulation, genome integrity, and disease.Entities:
Keywords: AluYb; High-throughput sequencing; LINE-1; ME-Scan; Mobile element insertion; Retrotransposon; SVA
Year: 2020 PMID: 32110248 PMCID: PMC7035633 DOI: 10.1186/s13100-020-00207-x
Source DB: PubMed Journal: Mob DNA
Fig. 1ME-specific amplification during ME-Scan library construction. For each ME type library, two rounds of nested amplification are performed. The ME-specific amplification primers (ME1 and ME2) are shown as thin arrows above the ME consensus and the amplification directions are indicated by the arrows. First-round amplification primers (ME1) are biotinylated (green star) for enrichment, and the second-round nested primers (ME2) include the Illumina sequencing adaptor (orange box). Different components of AluYb, SVA, and L1HS consensuses are labelled. The final paired-end sequencing reads from the resulting sequencing libraries are represented with blue arrows (ME Reads) and black arrows (Flanking Reads), respectively. Blue box: ME sequence; grey box: flanking genomic region; green star: biotin; orange box: Illumina sequencing adaptor
Fig. 2Computational data analysis overview. a) The paired-end sequencing reads. Sequencing reads from the pooled libraries are represented by red (ME Reads) and blue arrows (Flanking Reads), respectively. b) Read filtering. The ME Reads are compared to the targeted ME consensus to identify recent insertions and are filtered based on the BLAST bit-score cutoff. The Flanking Reads are mapped to the reference genome and are filtered based on the mapping quality score cutoff. c) Flanking Read clustering and insertion loci identification. Filtered Flanking reads that are within a 500 bp sliding window are clustered into a candidate insertion locus and the genomic position closest to the ME Read is selected as the insertion position (marked with a star). Black box: clustering window
Cutoffs and the number of candidate loci in YRI individuals
| L1HS | SVA | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Individual | Cutoff (TPM,UR) | All | Poly-morphic | Novel | Cutoff (TPM,UR) | All | Poly-morphic | Novel | Cutoff (TPM,UR) | All | Poly-morphic | Novel |
| NA18500 | (10,10) | 2411 | 387 | 168 | (10,10) | 951 | 234 | 108 | (4,9) | 1395 | 164 | 13 |
| NA18501 | (10,10) | 2453 | 416 | 188 | (10,10) | 949 | 231 | 112 | (3,10) | 1398 | 178 | 12 |
| NA18502 | (10,10) | 2546 | 452 | 206 | (10,10) | 968 | 252 | 142 | (5,10) | 1393 | 172 | 14 |
| NA18503 | (10,10) | 2418 | 392 | 181 | (10,10) | 950 | 237 | 126 | (4,10) | 1390 | 166 | 10 |
| NA18504 | (10,10) | 2463 | 418 | 191 | (10,10) | 981 | 257 | 133 | (3,9) | 1400 | 170 | 7 |
| NA18505 | (10,10) | 2494 | 427 | 183 | (10,10) | 1028 | 301 | 159 | (4,10) | 1407 | 175 | 9 |
| NA18506 | (10,8) | 2347 | 362 | 139 | (10,10) | 916 | 211 | 116 | (4,6) | 1392 | 170 | 15 |
| NA18507 | (10,10) | 2408 | 383 | 145 | (10,10) | 922 | 220 | 115 | (3,10) | 1404 | 174 | 11 |
| NA18508 | (10,10) | 2634 | 509 | 242 | (10,10) | 998 | 276 | 156 | (5,10) | 1405 | 175 | 15 |
| NA18515 | (10,10) | 2563 | 445 | 213 | (10,10) | 982 | 255 | 135 | (5,10) | 1393 | 162 | 13 |
| NA18516 | (10,10) | 2554 | 448 | 208 | (10,10) | 1049 | 310 | 172 | (6,10) | 1369 | 141 | 7 |
| NA18517 | (10,10) | 2566 | 464 | 224 | (10,10) | 1023 | 293 | 165 | (4,10) | 1417 | 183 | 21 |
| NA18521 | (10,10) | 2572 | 470 | 218 | (10,10) | 959 | 238 | 131 | (5,10) | 1398 | 163 | 14 |
| NA18522 | (10,10) | 2562 | 449 | 205 | (10,10) | 967 | 237 | 122 | (5,10) | 1402 | 171 | 14 |
| NA18523 | (10,10) | 2689 | 533 | 252 | (10,10) | 1047 | 319 | 176 | (5,10) | 1427 | 193 | 23 |
| NA19101 | (10,10) | 2534 | 424 | 184 | (10,10) | 1001 | 273 | 147 | (5,10) | 1378 | 154 | 5 |
| NA19102 | (10,10) | 2550 | 445 | 193 | (10,10) | 1018 | 286 | 157 | (6,10) | 1381 | 149 | 4 |
| NA19103 | (10,10) | 2455 | 398 | 161 | (10,10) | 1005 | 278 | 155 | (5,7) | 1372 | 149 | 8 |
| NA19137 | (10,10) | 2593 | 476 | 201 | (10,10) | 995 | 268 | 144 | (6,10) | 1379 | 150 | 10 |
| NA19138 | (10,10) | 2604 | 467 | 201 | (10,10) | 1040 | 299 | 156 | (7,10) | 1376 | 146 | 11 |
| NA19139 | (10,10) | 2619 | 480 | 202 | (10,10) | 1021 | 285 | 152 | (7,10) | 1370 | 147 | 11 |
| NA19171 | (10,10) | 2652 | 503 | 219 | (10,10) | 1085 | 333 | 188 | (6,10) | 1374 | 149 | 6 |
| NA19172 | (10,10) | 2668 | 533 | 231 | (10,10) | 1100 | 350 | 194 | (7,10) | 1381 | 149 | 11 |
| NA19173 | (10,10) | 2564 | 438 | 173 | (10,10) | 1035 | 292 | 160 | (7,10) | 1361 | 138 | 9 |
| NA19200 | (10,10) | 2617 | 479 | 209 | (10,10) | 1029 | 293 | 163 | (6,10) | 1387 | 158 | 8 |
| NA19201 | (10,10) | 2567 | 455 | 199 | (10,10) | 1004 | 275 | 151 | (7,10) | 1367 | 140 | 7 |
| NA19202 | (10,10) | 2669 | 510 | 230 | (10,10) | 1098 | 358 | 204 | (6,10) | 1384 | 155 | 10 |
| NA19203 | (10,10) | 2559 | 433 | 179 | (10,10) | 1051 | 307 | 165 | (6,10) | 1374 | 151 | 4 |
| NA19204 | (10,10) | 2686 | 534 | 239 | (10,10) | 1160 | 408 | 254 | (7,10) | 1378 | 152 | 14 |
| NA19205 | (10,10) | 2589 | 454 | 199 | (10,10) | 1024 | 290 | 158 | (6,10) | 1368 | 145 | 9 |
| NA19206 | (10,10) | 2270 | 349 | 122 | (10,9) | 953 | 248 | 129 | (4,4) | 1392 | 172 | 13 |
| NA19207 | (10,10) | 2516 | 422 | 184 | (10,10) | 1062 | 325 | 191 | (6,10) | 1371 | 140 | 10 |
| NA19208 | (10,10) | 2491 | 410 | 169 | (10,10) | 1011 | 278 | 144 | (4,9) | 1381 | 152 | 8 |
| NA19209 | (10,10) | 2544 | 432 | 195 | (10,10) | 1025 | 296 | 160 | (7,10) | 1367 | 141 | 8 |
| NA19210 | (10,10) | 2615 | 473 | 198 | (10,10) | 1073 | 336 | 197 | (6,10) | 1380 | 147 | 11 |
| NA19211 | (10,10) | 2485 | 412 | 176 | (10,10) | 1037 | 296 | 161 | (5,10) | 1375 | 150 | 3 |
| Total | 4216 | 1819 | 1079 | 2250 | 1456 | 1175 | 1779 | 477 | 180 | |||
Fig. 3Sensitivity analysis for determining proper TPM and UR cutoffs. Using presumably fixed reference MEIs as true positives, the sensitivity is calculated under different TPM and UR cutoffs for AluYb, L1HS, and SVA candidate loci, respectively. The average individual sensitivity (left panel) and overall sensitivity (right panel) for the 36 YRI samples are shown. The sensitivity is shown as the percentage of presumably fixed insertions being identified for each cutoff. The heatmap color corresponds to the sensitivity, as indicated in the color bar on the right of each plot
Oligos and primers used in this study
| Description | Sequence (5′- > 3′) |
|---|---|
| Long adaptor with indexes | CAAGCAGAAGACGGCATACGAGAT |
| Short adaptor with indexes | |
| 1st round | /5Biosg/CAGGCCGGACTGCGGA*C |
| 2nd round | AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNAGTGCTGGGATTACAGGCGTG*A |
| 1st round L1HS amplification primer | /5Biosg/GGGAGATATACCTAATGCTAGATGACAC*A |
| /5Biosg/GGGAGATATACCTAATGCTAGATGACAC*G | |
| /5Biosg/GGGAGATATACCTAATGCTAGATGACAA*G | |
| 2nd round L1HS amplification primer | TGCACATGTACCCTAAAACTTAGAGTATAA*T |
| 1st round SVA amplification primer | /5Biosg/AGAATCAGGCAGGGAGGTT*G |
| 2nd round SVA amplification primer | AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNAGTACMGTCCAGCTTCGGC*T |
| P7 adaptor amplification primer | CAAGCAGAAGACGGCATACGAGA*T |
| L1_1 internal primer for validation | GGGAGATATACCTAATGCTAGATGACA |
| L1_2 internal primer for validation | TGCACATGTACCCTAAAACTTAG |
| SVA_1 internal primer for validation | AGAATCAGGCAGGGAGGTTG |
| SVA_2 internal primer for validation | AGTACMGTCCAGCTTCGGCT |
| /5Biosg/: 5′ Biotin; *: 3′ Phosphorothioate bond; Index1 and Index2: individual specific 6 bp indexes. | |
ME-Scan amplification conditions
| First amplification | Second amplification | |||||
|---|---|---|---|---|---|---|
| L1HS* | SVA (10 cycles) | L1HS (12 cycles) | SVA (12 cycles) | |||
| PCR grade water | As needed | As needed | ||||
| 2X KAPA HiFi HS RM | 25 μl | 25 μl | 37.5 μl | |||
| Adapter primer (P7)+ | 2.5 μl | 2.5 μl | 3.75 μl | |||
| ME-specific primer + | 2.5 μl | 2.5 μl | 3.75 μl | |||
| DNA | 360 ng | 100 ng | 200 ng | 16 μl | 2 μl | 24 μl |
| Total | 50 μl | 50 μl | 75 μl | |||
* follow step-down PCR thermocycling conditions in Table 4
+ primers are shown in Table 2 with 10 μM concentration
Step-down PCR thermocycling condition for L1HS amplification
| 95 °C | 5 min | |
|---|---|---|
| 95 °C | 1 min | Repeat 5 cycles |
| 72 °C | 1 min | |
| 72 °C | 5 min | |
| 95 °C | 1 min | Repeat 5 cycles |
| 68 °C | 1 min | |
| 72 °C | 5 min | |
| 95 °C | 45 s | Repeat 15 cycles |
| 64 °C | 1 min | |
| 72 °C | 5 min | |
| 72 °C | 15 min | |
| 4 °C | Hold | |