| Literature DB >> 27251557 |
Xiaojing Zhou1, Yang Dong1, Jiaojiao Zhao1, Li Huang1, Xiaoping Ren1, Yuning Chen1, Shunmou Huang1,2, Boshou Liao1, Yong Lei1, Liying Yan1, Huifang Jiang3.
Abstract
BACKGROUND: Single-locus markers have many advantages compared with multi-locus markers in genetic and breeding studies because their alleles can be assigned to particular genomic loci in diversity analyses. However, there is little research on single-locus SSR markers in peanut. Through the de novo assembly of DNA sequencing reads of A. hypogaea, we developed single-locus SSR markers in a genomic survey for better application in genetic and breeding studies of peanut.Entities:
Keywords: Combined libraries; Peanut (A hypogaea L.); Single-locus SSR; de novo assembly
Mesh:
Substances:
Year: 2016 PMID: 27251557 PMCID: PMC4888616 DOI: 10.1186/s12864-016-2743-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of sequencing data
| Library insert-size | No. of raw reads | Total length (bp) of raw reads | No. of high-quality reads after filtering | Total length (bp) of high-quality reads |
|---|---|---|---|---|
| 270 bp | 855,464,570 | 128,319,685,500 | 714,934,076 | 101,712,765,895 |
| 500 bp | 543,665,696 | 81,549,854,400 | 401,580,046 | 55,276,499,395 |
| 2 Kbp | 422,214,574 | 63,332,186,100 | 363,241,962 | 52,076,474,416 |
| 5 Kbp | 235,532,130 | 35,329,819,500 | 195,875,900 | 28,017,291,974 |
| Total | 2,056,876,970 | 308,531,545,500 | 1,675,631,984 | 237,083,031,680 |
Statistics of de novo assembly results
| Contig | Scaffold | |||
|---|---|---|---|---|
| Size (bp) | Number | Size (bp) | Number | |
| N50 | 1,782 | - | 3,920 | - |
| Longest | 310,739 | - | 576,627 | - |
| Total size | 1,752,933,618 | - | 1,987,916,087 | - |
| Average | 835 | - | 1,693 | - |
| Total number | - | 2,102,446 | - | 1,176,527 |
| Total number (≥2 kb) | - | 171,497 | - | 360,557 |
| Total number (≥10 kb) | - | 4,709 | - | 9,448 |
The distribution of different types of single-locus SSRs identified
| Motif | Number (%1, %2) | Repeat number | Total length (bp3, %4) | Average length (bp5) |
|---|---|---|---|---|
| Mono | 25 224 (16.2, 100) | 12–146 | 650 581 (23.9) | 25.8 |
| A | 23 979 (15.4, 95.1) | 12–146 | 630 491 (23.2) | 26.3 |
| C | 1 245 (0.8, 4.9) | 12–145 | 20 090 (0.7) | 16.1 |
| Di | 25 460 (16.4, 100) | 6–106 | 501 652 (18.4) | 19.7 |
| AT | 13 757 (8.8, 54.0) | 6–54 | 232 732 (8.6) | 16.9 |
| AG | 8 693 (5.6, 34.1) | 6–97 | 209 942 (7.7) | 24.2 |
| AC | 2 739 (1.8, 10.8) | 6–106 | 55 622 (2.0) | 20.3 |
| CG | 271 (0.2, 1.1) | 6–9 | 3 356 (0.1) | 12.4 |
| Tri | 42 233 (27.1, 100) | 4–67 | 672 066 (24.7) | 15.9 |
| AAT | 14 329 (9.2, 33.9) | 4–67 | 275 664 (10.1) | 19.2 |
| AAG | 9 709 (6.2, 23.0) | 4–49 | 140 004 (5.1) | 14.4 |
| AAC | 3 937 (2.5, 9.3) | 4–32 | 57 747 (2.1) | 14.7 |
| ACT | 2 601 (1.7, 6.2) | 4–48 | 38 688 (1.4) | 14.9 |
| AGT | 2 586 (1.7, 6.1) | 4–51 | 38 715 (1.4) | 15.0 |
| CCG | 2 337 (1.5, 5.5) | 4–9 | 29 151 (1.1) | 12.5 |
| AGG | 2 327 (1.5, 5.5) | 4–61 | 34 473 (1.3) | 14.8 |
| ACC | 2 316 (1.5, 5.5) | 4–29 | 31 248 (1.1) | 13.5 |
| ACG | 1 069 (0.7, 2.5) | 4–11 | 13 491 (0.5) | 12.6 |
| AGC | 1 023 (0.7, 2.4) | 4–16 | 12 885 (0.5) | 12.6 |
| Tetra | 41 309 (26.5, 100) | 3–43 | 535 760 (19.7) | 13.0 |
| AAAT | 15 583 (10.0, 37.7) | 3–13 | 205 212 (7.5) | 13.2 |
| AATT | 6 035 (3.9, 14.6) | 3–8 | 76 076 (2.8) | 12.6 |
| AAAG | 5 491 (3.5, 13.3) | 3–16 | 71 208 (2.6) | 13.0 |
| AAAC | 1 949 (1.3, 4.7) | 3–9 | 24 192 (0.9) | 12.4 |
| ACAT | 1 554 (1.0, 3.8) | 3–43 | 23 452 (0.9) | 15.1 |
| AATC | 1 372 (0.9, 3.3) | 3–9 | 17 364 (0.6) | 12.7 |
| AACT | 1 319 (0.8, 3.2) | 3–9 | 16 892 (0.6) | 12.8 |
| AATG | 890 (0.6, 2.2) | 3–9 | 11 204 (0.4) | 12.6 |
| AAGT | 861 (0.6, 2.1) | 3–6 | 10 716 (0.4) | 12.4 |
| AGAT | 841 (0.5, 2.0) | 3–18 | 11 760 (0.4) | 14.0 |
| others | 5 414 (3.5, 13.1) | 3–14 | 67 684 (2.5) | 12.5 |
| Penta | 15 399 (9.9, 100) | 3–10 | 243 955 (9.0) | 15.8 |
| AAAAT | 4 505 (2.9, 29.3) | 3–10 | 71 830 (2.6) | 15.9 |
| AAAAG | 1 563 (1.0, 10.2) | 3–8 | 24 885 (0.9) | 15.9 |
| AAATT | 1 114 (0.7, 7.2) | 3–6 | 17 355 (0.6) | 15.6 |
| AATAT | 622 (0.4, 4.0) | 3–8 | 9 825 (0.4) | 15.8 |
| AAGAT | 582 (0.4, 3.8) | 3–6 | 9 130 (0.3) | 15.7 |
| AATAG | 546 (0.4, 3.5) | 3–10 | 8 630 (0.3) | 15.8 |
| others | 6 467 (4.2, 42.0) | 3–8 | 102 300 (3.8) | 15.8 |
| Hexa | 6 040 (3.9, 100) | 3–20 | 115 266 (4.2) | 19.1 |
| AAAAAT | 829 (0.5, 13.7) | 3–16 | 15 516 (0.6) | 18.7 |
| AAAAAG | 527 (0.3, 8.7) | 3–7 | 9 906 (0.4) | 18.8 |
| AAAATT | 387 (0.2, 6.4) | 3–6 | 7 200 (0.3) | 18.6 |
| AAATAT | 184 (0.1, 3.0) | 3–5 | 3 456 (0.1) | 18.8 |
| AATCCT | 182 (0.1, 3.0) | 3–6 | 3 396 (0.1) | 18.7 |
| others | 3 931 (2.5, 65.1) | 3–20 | 75 792 (2.8) | 19.3 |
| Total | 155 665 (100, −) | 3–146 | 2 719 280 (100) | 17.5 |
%1, the number of each nucleotide repeat accounted for the percentage of all motif number
%2, the number of each nucleotide repeat accounted for the percentage of all number of corresponding motif type
bp3, the total motif length of each nucleotide repeat type
%4, the total motif length of each nucleotide repeat type relative to the total motif length of all nucleotide repeat types
bp5, the total motif length of each nucleotide repeat type/the total number of nucleotide repeat types
Fig. 1Motif frequency distributions of mono- to hexanucleotide motif types with different repeat numbers (from 3 to >20) in the de novo assembled genomic sequences of A. hypogaea
Amplification patterns of the 1,790 developed SSR markers in the 12 inbred lines
| Primer synthesized | Amplified primers | Single amplicon | Polymorphic primers of single amplicon | Two amplicons | Three or more amplicons |
|---|---|---|---|---|---|
| 1,790 | 1,687 | 1,637 | 290 | 32 | 18 |
Fig. 2Relationship of the polymorphism rates of putative single-locus markers to the motif type (a) and repeat number (b)
Fig. 3Distribution of single-locus SSR markers on the genetic linkage map. The map was constructed using 154 F2 plants derived from Zhonghua 10 and ICG12625. The single-locus markers developed in this study are shown in boldface and are underlined. The markers are shown on the right side of the LGs, and the map distances are shown on the left side
The genetic diversity of 100 SSR markers revealed by 96 A. hypogaea accessions
| Marker |
|
| PIC | Marker |
|
| PIC | Marker |
|
| PIC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AHGA46532 | 8 | 0.00 | 0.70 | AHGA65330 | 4 | 0.00 | 0.51 | AHGA179378 | 3 | 0.01 | 0.28 |
| AHGA164448 | 3 | 0.00 | 0.18 | AHGA65333 | 6 | 0.01 | 0.65 | AHGA300244 | 8 | 0.00 | 0.68 |
| AHGA362464 | 2 | 0.01 | 0.07 | AHGA65348 | 3 | 0.00 | 0.08 | AHGA11201 | 5 | 0.01 | 0.51 |
| AHGA362480 | 3 | 0.00 | 0.10 | AHGA65349 | 3 | 0.00 | 0.37 | AHGA11209 | 2 | 0.00 | 0.11 |
| AHGA362488 | 2 | 0.00 | 0.02 | AHGA160420 | 4 | 0.01 | 0.14 | AHGA17736 | 5 | 0.00 | 0.12 |
| AHGA362499 | 5 | 0.00 | 0.44 | AHGA363105 | 7 | 0.01 | 0.60 | AHGA75014 | 2 | 0.00 | 0.22 |
| AHGA40095 | 2 | 0.00 | 0.06 | AHGA373365 | 5 | 0.00 | 0.38 | AHGA161485 | 2 | 0.00 | 0.06 |
| AHGA40106 | 2 | 0.00 | 0.04 | AHGA373382 | 7 | 0.01 | 0.53 | AHGA161495 | 9 | 0.01 | 0.79 |
| AHGA40135 | 8 | 0.04 | 0.54 | AHGA7413 | 6 | 0.00 | 0.59 | AHGA161510 | 3 | 0.00 | 0.39 |
| AHGA54444 | 2 | 0.00 | 0.30 | AHGA7429 | 8 | 0.01 | 0.68 | AHGA152194 | 3 | 0.00 | 0.40 |
| AHGA79898 | 3 | 0.00 | 0.06 | AHGA164129 | 3 | 0.00 | 0.44 | AHGA363491 | 3 | 0.00 | 0.54 |
| AHGA36568 | 2 | 0.00 | 0.09 | AHGA361225 | 3 | 0.00 | 0.29 | AHGA363492 | 2 | 0.00 | 0.37 |
| AHGA38598 | 4 | 0.00 | 0.20 | AHGA7048 | 5 | 0.00 | 0.60 | AHGA363495 | 2 | 0.00 | 0.37 |
| AHGA38612 | 2 | 0.00 | 0.37 | AHGA7051 | 4 | 0.00 | 0.35 | AHGA226115 | 3 | 0.02 | 0.38 |
| AHGA170476 | 6 | 0.00 | 0.52 | AHGA22885 | 2 | 0.00 | 0.04 | AHGA226118 | 2 | 0.00 | 0.09 |
| AHGA176207 | 2 | 0.00 | 0.32 | AHGA57727 | 6 | 0.00 | 0.50 | AHGA59791 | 2 | 0.00 | 0.11 |
| AHGA176210 | 2 | 0.08 | 0.21 | AHGA98567 | 4 | 0.00 | 0.40 | AHGA59797 | 4 | 0.10 | 0.12 |
| AHGA220404 | 17 | 0.03 | 0.85 | AHGA155736 | 3 | 0.00 | 0.12 | AHGA59809 | 3 | 0.02 | 0.36 |
| AHGA220933 | 2 | 0.00 | 0.37 | AHGA75538 | 7 | 0.01 | 0.64 | AHGA148181 | 2 | 0.00 | 0.11 |
| AHGA331177 | 7 | 0.01 | 0.73 | AHGA364906 | 4 | 0.00 | 0.57 | AHGA244586 | 2 | 0.01 | 0.30 |
| AHGA354330 | 12 | 0.01 | 0.77 | AHGA5481 | 3 | 0.00 | 0.18 | AHGA358460 | 2 | 0.00 | 0.37 |
| AHGA354339 | 3 | 0.01 | 0.33 | AHGA24894 | 10 | 0.00 | 0.75 | AHGA360266 | 4 | 0.00 | 0.41 |
| AHGA9097 | 4 | 0.00 | 0.08 | AHGA25786 | 5 | 0.00 | 0.51 | AHGA84019 | 11 | 0.00 | 0.86 |
| AHGA9103 | 5 | 0.01 | 0.51 | AHGA364915 | 2 | 0.00 | 0.37 | AHGA352202 | 3 | 0.00 | 0.40 |
| AHGA9104 | 5 | 0.00 | 0.57 | AHGA364920 | 2 | 0.00 | 0.04 | AHGA352262 | 5 | 0.00 | 0.36 |
| AHGA44695 | 4 | 0.01 | 0.42 | AHGA364936 | 6 | 0.00 | 0.52 | AHGA352280 | 2 | 0.00 | 0.13 |
| AHGA68628 | 4 | 0.01 | 0.37 | AHGA14239 | 6 | 0.00 | 0.59 | AHGA372606 | 18 | 0.00 | 0.81 |
| AHGA68647 | 4 | 0.00 | 0.41 | AHGA128473 | 2 | 0.00 | 0.08 | AHGA61572 | 3 | 0.00 | 0.36 |
| AHGA265121 | 4 | 0.00 | 0.48 | AHGA47958 | 5 | 0.01 | 0.58 | AHGA195525 | 2 | 0.00 | 0.36 |
| AHGA362520 | 2 | 0.00 | 0.35 | AHGA96458 | 4 | 0.00 | 0.54 | AHGA195527 | 2 | 0.00 | 0.02 |
| AHGA193642 | 7 | 0.00 | 0.57 | AHGA96466 | 3 | 0.00 | 0.44 | AHGA195528 | 2 | 0.00 | 0.04 |
| AHGA193650 | 7 | 0.00 | 0.57 | AHGA96491 | 4 | 0.01 | 0.49 | AHGA214492 | 2 | 0.00 | 0.09 |
| AHGA65328 | 2 | 0.00 | 0.35 | AHGA96496 | 8 | 0.00 | 0.72 | Mean | 3.85 | 0.01 | 0.33 |
| AHGA65329 | 2 | 0.00 | 0.35 | AHGA159068 | 3 | 0.01 | 0.47 | Total | 428 | - | - |
H represents observed heterozygosity, A represents number of alleles, PIC represents polymorphism information content