| Literature DB >> 27882299 |
Minako Yoshihara1, Tetsuya Sato1, Daisuke Saito1, Osamu Ohara2, Takashi Kuramoto3, Mikita Suyama1.
Abstract
We report sequence data obtained by our recently devised target capture method TargetEC applied to 20 inbred rat strains. This method encompasses not only all annotated exons but also highly conserved non-coding sequences shared among vertebrates. The total length of the target regions covers 146.8 Mb. On an average, we obtained 31.7 × depth of target coverage and identified 154,330 SNVs and 24,368 INDELs for each strain. This corresponds to 470,037 unique SNVs and 68,652 unique INDELs among the 20 strains. The sequence data can be accessed at DDBJ/EMBL/GenBank under accession number PRJDB4648, and the identified variants have been deposited at http://bioinfo.sls.kyushu-u.ac.jp/rat_target_capture/20_strains.vcf.gz.Entities:
Year: 2016 PMID: 27882299 PMCID: PMC5114524 DOI: 10.1016/j.gdata.2016.11.010
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Summary statistics for sequencing and variant calling.
| Strain | Sex | Total reads | Read length | Mapped reads after post-processing (%) | Average target depth | SNV | INDEL |
|---|---|---|---|---|---|---|---|
| BDIX.Cg- | Unknown | 77,031,192 | 151 | 62,133,380 (80.7) | 33.0 | 161,043 | 25,729 |
| BDIX/NemOda | Female | 62,884,340 | 151 | 50,668,261 (80.6) | 26.2 | 155,727 | 24,561 |
| BN/SsNSlc | Male | 67,363,478 | 151 | 54,385,129 (80.7) | 29.4 | 23,060 | 5533 |
| BUF/MNa | Male | 60,898,020 | 151 | 49,603,905 (81.5) | 27.2 | 154,382 | 24,122 |
| DOB/Oda | Male | 68,359,820 | 151 | 61,010,641 (89.2) | 31.7 | 196,751 | 30,148 |
| F344/DuCrlCrlj | Male | 73,516,660 | 151 | 59,541,186 (81.0) | 27.7 | 152,184 | 23,890 |
| F344/Jcl | Male | 62,994,072 | 151 | 50,991,611 (80.9) | 26.6 | 152,141 | 23,855 |
| F344/NSlc | Male | 62,838,170 | 151 | 50,726,936 (80.7) | 27.5 | 152,546 | 23,930 |
| F344/Stm | Male | 64,788,908 | 151 | 52,984,127 (81.8) | 29.1 | 151,919 | 23,735 |
| HTX/Kyo | Male | 72,484,640 | 151 | 64,572,821 (89.1) | 33.7 | 154,418 | 24,156 |
| HWY/Slc | Male | 74,687,034 | 151 | 66,579,903 (89.1) | 34.6 | 157,070 | 24,873 |
| IS/Kyo | Male | 79,430,344 | 151 | 70,744,396 (89.1) | 37.4 | 187,300 | 29,120 |
| IS- | Male | 75,990,092 | 151 | 67,761,875 (89.2) | 35.8 | 186,648 | 28,902 |
| KFRS3B/Kyo | Female | 81,643,134 | 151 | 72,603,786 (88.9) | 35.1 | 154,292 | 24,419 |
| LE/Stm | Male | 72,300,094 | 151 | 58,438,239 (80.8) | 31.7 | 157,488 | 25,052 |
| LEC/Tj | Unknown | 78,990,272 | 151 | 70,539,682 (89.3) | 37.3 | 167,547 | 26,315 |
| NIG-III/Hok | Unknown | 78,128,624 | 151 | 69,625,354 (89.1) | 36.9 | 164,732 | 26,195 |
| RCS/Kyo | Male | 71,627,648 | 151 | 57,894,324 (80.8) | 31.6 | 155,472 | 24,975 |
| ZF | Male | 69,986,466 | 151 | 56,655,891 (81.0) | 30.2 | 150,778 | 23,815 |
| ZFDM | Male | 73,535,060 | 151 | 59,407,086 (80.8) | 31.9 | 151,101 | 24,025 |
| Specifications [ | |
|---|---|
| Organism/cell line/tissue | |
| Sex | Female and male, see |
| Sequencer or array type | Illumina NextSeq 500 |
| Data format | FASTQ and VCF |
| Experimental factors | Genomic DNA extracted from spleen |
| Experimental features | Target capture sequencing of exons and conserved non-coding sequences |
| Consent | Not applicable |
| Sample source location | Rat strains were provided by the National BioResource Project (NBRP)–Rat ( |