| Literature DB >> 21044359 |
Eric Rouchka1, Diego E Montoya-Durango, Vilius Stribinskis, Kenneth Ramos, Ted Kalbfleisch.
Abstract
BACKGROUND: In humans, copies of the Long Interspersed Nuclear Element 1 (LINE-1) retrotransposon comprise 21% of the reference genome, and have been shown to modulate expression and produce novel splice isoforms of transcripts from genes that span or neighbor the LINE-1 insertion site.Entities:
Mesh:
Year: 2010 PMID: 21044359 PMCID: PMC2967742 DOI: 10.1186/1471-2105-11-S9-S12
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Description of the LINE-1 insertion sites identified in this study
| Chromosome | Insert Site Position | NA12878 | NA12891 | NA12892 | LINE-1 Orientation | Gene Locus | Entrez ID | Gene orientation |
| Chr_2 | 144021434 | +/- | +/- | N | - | ARHGAP15 | 55843 | + |
| Chr_3 | 38601086 | +/- | +/- | +/- | + | SCN5A | 6331 | - |
| Chr_3 | 82073683 | +/- | Y | N | - | |||
| Chr_3 | 125073416 | Y | +/- | Y | + | MYLK | 4638 | - |
| Chr_3 | 187854832 | +/- | N | Y | + | |||
| Chr_4 | 132401098 | +/- | +/- | +/- | - | |||
| Chr_4 | 147444745 | +/- | N | Y | + | SLC10A7 | 84068 | - |
| Chr_5 (*) | 21243489 | Y | +/- | +/- | + | |||
| Chr_5 | 89486537 | Y | Y | Y | + | |||
| Chr_6 | 102952779 | +/- | +/- | N | - | |||
| Chr_6 (*) | 123895635 | +/- | +/- | N | - | TRDN | 10345 | - |
| Chr_7 (*) | 7985552 | +/- | N | +/- | + | GLCCI1 | 113263 | + |
| Chr_7 | 53619996 | +/- | Y | ?/? | - | |||
| Chr_10 | 124445220 | Y | +/- | Y | + | |||
| Chr_11 | 109883089 | Y | N | +/- | - | |||
| Chr_12 | 125368882 | +/- | N | +/- | - | |||
| Chr_13 | 60360333 | Y | Y | +/- | - | |||
| Chr_14 | 51737509 | Y | Y | Y | - | |||
| Chr_15 | 31819131 | Y | +/- | +/- | - | RYR3 | 6263 | + |
| Chr_15 | 54038444 | +/- | N | +/- | + | NEDD4 | 4734 | - |
| Chr_17 (*) | 62067756 | +/- | +/- | N | + | PRKCA | 5578 | + |
| Chr_18 | 12481262 | +/- | N | +/- | + | SPIRE1 | 56907 | - |
Legend: Position and genotype for the twenty two LINE-1 putative full length insertions identified in this work. The splice site position is the position of the base nearest the 5′ most base of the LINE-1 insertion in the human genome assembly build 36.3. A ‘Y’ genotype indicates that all reads found for that individual at that position contained the LINE-1 insert. An ‘N’ genotype indicates that no reads found for that individual at that position contained the LINE-1 insert. A ‘+/-‘ genotype indicates the individual was heterozygous at that position, and ‘?/?’ indicates that no reads could be found for that individual at that position. The L1 orientation of + and – indicate sense and antisense insertion respectively with respect to the assembly. Those rows marked with an (*) were verified by PCR.
Figure 1Allele specific confirmation of genotypes. Gel images of four validation runs using allele specific PCR. Each lane is labeled with a sample identifier D=NA12878(daughter), F=NA12891(father), M=NA12892(mother), and C=Control. For each locus, the samples were run against two pairs of primers, one that would amplify DNA that contained a LINE1 insertion (L1+), and one that would amplify DNA that did not (L1-). The four insertion sites chosen for validation were a) the intergenic region on chromosome 5 located at position 21243489 on human genome build 36.3, and those insertions within the loci for b)TRDN, c) GLCCI1, and d) PRKCA. See the Methods section for more details regarding the primer sequences and PCR conditions.
Figure 2Diagram of insert search process. A diagram describing the sequence search process by which putative full length LINE-1 insertion sites were identified.
Primers used for allele specific PCR
| Locus/Allele | 5' Oligo | 3' Oligo | PCR Product Size(nt) |
|---|---|---|---|
| Chr5:21243489_L1+ | ttgcctaagcttcccataca | TGAACCCGGTACCTCAGATG | 539 |
| Chr5:21243489_L1- | ttgcctaagcttcccataca | tgatccatgtcaataatgaggtc | 677 |
| TRDN_L1+ | tttcatgccatctctatattccaa | TGAACCCGGTACCTCAGATG | 477 |
| TRDN_L1- | tttcatgccatctctatattccaa | aaaaacaggcctgagattgc | 690 |
| GLCCI1_L1+ | tcctcatacaaacttgtgaagtgat | TGAACCCGGTACCTCAGATG | 666 |
| GLCCI1_L1- | tcctcatacaaacttgtgaagtgat | ttcaagcttcatctaatgaaagaaaa | 682 |
| PRKCA_L1+ | tgtgcaaatttcaccccata | TGAACCCGGTACCTCAGATG | 527 |
| PRKCA_L1- | tgtgcaaatttcaccccata | ccatagagcagtgacccaca | 622 |
Legend: Primers used for the allele specific PCR verification of the next generation sequence data analysis. The same right primer was used in all L1+ PCR reactions. This primer spans bases 90 to 109 of the LINE-1 transcript with accession L19092.
Insertions identified by sequence search of whole genome shotgun sequence data
| Chromosome | Insert Site Position | Gene Locus |
| Chr_2 | 144021434 | ARHGAP15 |
| Chr_5 | 21243489 | |
| Chr_6 | 102952779 | |
| Chr_7 | 7985552 | GLCCI1 |
| Chr_14 | 51737509 | |
| Chr_15 | 31819131 | RYR3 |
| Chr_17 | 62067756 | PRKCA |
| Chr_18 | 12481262 | SPIRE1 |
Legend: The splice site position is the position of the base nearest the 5′ most base of the LINE-1 insertion in the human genome assembly build 36.3.