| Literature DB >> 25314602 |
Jin Zhang1, Wenge Ma1, Xiaomin Song1, Qiaohong Lin1, Jian-Fang Gui2, Jie Mei3.
Abstract
Yellow catfish (Pelteobagrus fulvidraco) is one of the most important freshwater fish due to its delicious flesh and high nutritional value. However, lack of sufficient simple sequence repeat (SSR) markers has hampered the progress of genetic selection breeding and molecular research for yellow catfish. To this end, we aimed to develop and characterize polymorphic expressed sequence tag (EST)-SSRs from the 454 pyrosequencing transcriptome of yellow catfish. Totally, 82,794 potential EST-SSR markers were identified and distributed in the coding and non-coding regions. Di-nucleotide (53,933) is the most abundant motif type, and AC/GT, AAT/ATT, AAAT/ATTT are respective the most frequent di-, tri-, tetra-nucleotide repeats. We designed primer pairs for all of the identified EST-SSRs and randomly selected 300 of these pairs for further validation. Finally, 263 primer pairs were successfully amplified and 57 primer pairs were found to be consistently polymorphic when four populations of 48 individuals were tested. The number of alleles for the 57 loci ranged from 2 to 17, with an average of 8.23. The observed heterozygosity (HO), expected heterozygosity (HE), polymorphism information content (PIC) and fixation index (fis) values ranged from 0.04 to 1.00, 0.12 to 0.92, 0.12 to 0.91 and -0.83 to 0.93, respectively. These EST-SSR markers generated in this study could greatly facilitate future studies of genetic diversity and molecular breeding in yellow catfish.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25314602 PMCID: PMC6271634 DOI: 10.3390/molecules191016402
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Distribution of EST-SSRs across the 5' UTR, CDS and 3' UTR in yellow catfish. Number of SSRs located on non-coding and coding region (A) and the distributions of SSRs with different motif sizes (B).
Frequency of different repeat motifs among the EST-SSRs of yellow catfish.
| Repeats | Mo | Di | Tri | Tetra | Penta | Hexa | Total | Percentage (%) |
|---|---|---|---|---|---|---|---|---|
| 5 | - | 0 | 2654 | 1843 | 253 | 43 | 4793 | 5.79 |
| 6 | - | 12,561 | 1347 | 994 | 80 | 22 | 15,004 | 18.12 |
| 7 | - | 7110 | 893 | 632 | 44 | 8 | 8687 | 10.49 |
| 8 | - | 4411 | 537 | 421 | 16 | 5 | 5390 | 6.51 |
| 9 | - | 3248 | 384 | 316 | 18 | 3 | 3969 | 4.79 |
| 10 | 6769 | 2429 | 276 | 289 | 19 | 2 | 9784 | 11.82 |
| 11 | 3055 | 1972 | 263 | 225 | 15 | 0 | 5530 | 6.68 |
| 12 | 1805 | 1628 | 244 | 194 | 4 | 1 | 3876 | 4.68 |
| 13 | 995 | 1418 | 207 | 144 | 14 | 0 | 2778 | 3.36 |
| 14 | 602 | 1260 | 206 | 129 | 6 | 0 | 2203 | 2.66 |
| 15 | 392 | 1112 | 173 | 132 | 2 | 0 | 1811 | 2.19 |
| 16 | 174 | 1008 | 186 | 96 | 2 | 0 | 1466 | 1.77 |
| 17 | 136 | 896 | 141 | 110 | 1 | 0 | 1284 | 1.55 |
| 18 | 80 | 846 | 113 | 64 | 0 | 0 | 1103 | 1.33 |
| 19 | 53 | 806 | 128 | 60 | 3 | 0 | 1050 | 1.27 |
| 20 | 26 | 799 | 90 | 46 | 1 | 0 | 962 | 1.16 |
| 21 | 18 | 731 | 81 | 58 | 0 | 0 | 888 | 1.07 |
| 22 | 13 | 688 | 54 | 44 | 0 | 0 | 799 | 0.97 |
| 23 | 12 | 713 | 44 | 48 | 0 | 0 | 817 | 0.99 |
| 24 | 5 | 709 | 30 | 26 | 0 | 0 | 770 | 0.93 |
| 25 | 3 | 655 | 23 | 30 | 0 | 0 | 711 | 0.86 |
| 26 | 4 | 634 | 12 | 23 | 0 | 0 | 673 | 0.81 |
| 27 | 1 | 648 | 9 | 20 | 0 | 0 | 678 | 0.82 |
| 28 | 3 | 573 | 3 | 12 | 0 | 0 | 591 | 0.71 |
| 29 | 0 | 594 | 1 | 12 | 0 | 0 | 607 | 0.73 |
| 30 | 3 | 563 | 1 | 12 | 0 | 0 | 579 | 0.70 |
| 31 | 5 | 521 | 0 | 6 | 0 | 0 | 532 | 0.64 |
| 32 | 2 | 479 | 2 | 7 | 0 | 0 | 490 | 0.59 |
| 33 | 0 | 462 | 2 | 2 | 0 | 0 | 466 | 0.56 |
| 34 | 0 | 432 | 0 | 3 | 0 | 0 | 435 | 0.53 |
| 35 | 1 | 421 | 0 | 5 | 0 | 0 | 427 | 0.52 |
| 36 | 0 | 394 | 0 | 5 | 0 | 0 | 399 | 0.48 |
| >36 | 11 | 3212 | 0 | 19 | 0 | 0 | 3242 | 3.92 |
|
| 14,168 | 53,933 | 8104 | 6027 | 478 | 84 | 82,794 | 100.00 |
|
| 17.11 | 65.14 | 9.79 | 7.28 | 0.58 | 0.10 | 100.00 |
Figure 2Characterization and frequency of different motifs among dinucleotide repeats (A), trinucleotide repeats (B) and the tetranucleotide repeats (C) EST-SSRs of yellow catfish.
Figure 3PCR amplification profiles of 48 yellow catfish accessions using primer pair H86. The PCR amplified products were separated on 7% polyacrylamide gel. M indicated the molecular markers.
Characteristics of the 57 EST-SSR markers for yellow catfish. Population genetic diversity analysis at 57 SSR loci was shown under the parameters: number of alleles per locus (N), observed heterozygosity (H), expected heterozygosity (H), polymorphic information content (PIC) and fixation index (FIS).
| EST-SSR | Repeat Motif | Primer Sequences (5'–3') | T a (°C) | Allele Size Range (bp) | Description of Putative Function | GenBank Accession No. | Heterozygosity | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| H2 | (AAT)13 | F: CTTCCAGGGGGCTTCTAAGT | 51 | 138–180 | F-box and WD repeat containing protein 7 | KM211716 | 7 | 0.604 | 0.831 | 0.80 | 0.266 |
| H6 | (ATAG)16 | F: TGTTGTAATCTCTCAATGAAGGTG | 53 | 252–348 | Transposable element Tc1 transposase | KM216910 | 13 | 0.729 | 0.865 | 0.84 | 0.148 |
| H13 | (GT)10 | F: AGAGCTAGGCCAAACTGCTG | 53 | 141–205 | Calcium binding protein 39 | KM236563 | 7 | 0.917 | 0.720 | 0.67 | −0.286 |
| H15 | (CA)15 | F: CTCGACCAGTCCTGAGCTTC | 53 | 209–240 | NF-kappa-B inhibitor beta | KM216912 | 5 | 0.271 | 0.565 | 0.47 | 0.515 |
| H16 | (CA)17 | F: GAGAGACAGCGAGCCTCAGT | 58 | 121–180 | NEDD4–like E3 ubiquitin protein ligase WWP2 | KM216871 | 16 | 1.000 | 0.924 | 0.91 | −0.094 |
| H17 | (TTA)14 | F: ACCACCTCCGAGACACGC | 57 | 110–172 | Hypothetical protein | KM216905 | 7 | 0.500 | 0.815 | 0.78 | 0.380 |
| H20 | (TTA)17 | F: ATGTGTTTCCCACAGTGCAG | 58 | 152–248 | No significant match | KM216903 | 11 | 0.542 | 0.824 | 0.80 | 0.336 |
| H28 | (TGGAGC)6 | F: GGGGCCTCTTGGGTTATTTA | 57 | 153–216 | Gonadal-soma derived growth factor precursor | KM216886 | 7 | 0.375 | 0.725 | 0.68 | 0.477 |
| H29 | (TTTTA)7 | F: GCCCTACAGCAGAGCTGAAC | 57 | 102–132 | Protein regulator of cytokinesis 1a | KM216864 | 4 | 0.417 | 0.550 | 0.47 | 0.234 |
| H32 | (TGATGT)8 | F: TTCGGGTAAAAAGTGATCCG | 58 | 197–345 | Predicted protein | KM216901 | 10 | 0.500 | 0.774 | 0.74 | 0.347 |
| H66 | (AG)7 | F: ATGGGATGACCAGGAGACAG | 59 | 263–300 | cAMP-dependent protein kinase catalytic subunit beta | KM236564 | 3 | 0.083 | 0.120 | 0.12 | 0.299 |
| H77 | (TG)7 | F: AAGCATAGATTTGCGCGTCT | 58 | 264–334 | Glucocorticoid receptor 2 | KM216888 | 3 | 0.354 | 0.298 | 0.26 | −0.201 |
| H78 | (GTAT)9 | F: GACCAAAGTGGATCGGACTC | 62 | 273–378 | Glucocorticoid receptor 2 | KM216909 | 3 | 1.000 | 0.552 | 0.44 | −0.829 |
| H84 | (AC)24 | F: TGTAAAGGGGGAAAACCACA | 58 | 202–284 | Low density lipoprotein receptor | KM216916 | 7 | 1.000 | 0.837 | 0.81 | −0.207 |
| H86 | (TG)11tc(TG)8 | F: CTCCTCCAGAGTGTCTTCGG | 59 | 255–305 | Adenylate cyclase type 5 | KM216892 | 9 | 0.917 | 0.715 | 0.66 | −0.297 |
| H89 | (TGGA)5 | F: AATGACAATAGGGTGCGGAG | 59 | 269–339 | No significant match | KM216896 | 3 | 0.208 | 0.194 | 0.18 | −0.085 |
| H96 | (GAAT)5 | F: GCACTCCGTCCAAGGTGTAT | 59 | 173–181 | No significant match | KM216857 | 2 | 0.292 | 0.252 | 0.22 | −0.171 |
| H106 | (TTCT)5 | F: TGATTTTTGGGACAGAGGAAA | 59 | 202–264 | No significant match | KM216856 | 14 | 0.604 | 0.903 | 0.88 | 0.324 |
| H107 | (TTCT)5 | F: TGATTTTTGGGACAGAGGAAA | 58 | 238–294 | No significant match | KM216891 | 5 | 0.375 | 0.622 | 0.56 | 0.391 |
| H109 | (TTTTG)6 | F: TATTTCCCTGTGGTGCTTCC | 58 | 275–315 | Heterogeneous nuclear ribonucleoprotein U protein 1 | KM216875 | 13 | 0.417 | 0.908 | 0.89 | 0.537 |
| H114 | (TCTGT)5 | F: TGAGGGGGTGCTAACTTTTG | 59 | 215–322 | Probable palmitoyltransferase ZDHHC20–like | KM216914 | 5 | 0.313 | 0.636 | 0.57 | 0.503 |
| H135 | (ATCTA)5 | F: GCATGACAGTGCTCGTTGTT | 59 | 140–225 | No significant match | KM216858 | 9 | 0.563 | 0.737 | 0.69 | 0.229 |
| H139 | (TTAGC)6 | F: GCTAGCGGCATTGTTAGCAT | 58 | 154–204 | Cyclin-dependent kinase 2 associated protein 2 | KM216895 | 4 | 0.042 | 0.609 | 0.52 | 0.931 |
| H147 | (TCTA)25 | F: TTGCCCAATTATACCACTTGC | 58 | 229-264 | Uncharacterized protein LOC101056656, partial | KM216859 | 14 | 0.563 | 0.818 | 0.79 | 0.305 |
| H149 | (ATCT)22 | F: TTGCACTTATTGGGGATGTG | 58 | 210–272 | Hypothetical protein PANDA_009670 | KM216860 | 11 | 0.604 | 0.790 | 0.76 | 0.227 |
| H151 | (TGTT)11 | F: CACTGATGATGGAATTGGGA | 59 | 143–183 | Glycogen phosphorylase, liver form | KM216904 | 5 | 0.438 | 0.711 | 0.65 | 0.378 |
| H152 | (AGTT)15 | F: GAAACGGATATTTAGTGGGGG | 59 | 191–252 | No significant match | KM216879 | 10 | 0.771 | 0.868 | 0.84 | 0.102 |
| H153 | (ACAT)12 | F: TGCCAGTATCTGACAACCCA | 58 | 164–204 | Collagen type IV alpha-3–binding protein-like | KM216898 | 8 | 0.625 | 0.762 | 0.72 | 0.172 |
| H154 | (TTTC)14 | F: GAACTGTCCTTTGCTTTCGC | 58 | 223–283 | E3 ubiquitin-protein ligase MIB2 | KM216861 | 17 | 0.604 | 0.924 | 0.91 | 0.339 |
| H155 | (AATA)15 | F: CCTTTCTATTGTGCGTTGGC | 59 | 232–344 | No significant match | KM216862 | 11 | 0.604 | 0.857 | 0.83 | 0.288 |
| H156 | (AAAT)15 | F: CATAACCGCACTGAATATGTGA | 58 | 211–259 | Family with sequence similarity 222, member B | KM216885 | 7 | 0.521 | 0.801 | 0.77 | 0.343 |
| H158 | (ATTT)16 | F: ATCCATGCATCCTTCACACA | 60 | 223–307 | No significant match | KM216894 | 6 | 0.500 | 0.753 | 0.71 | 0.329 |
| H159 | (ATCT)22 | F: TTCATTGCTTAGTCTAGTTTACATC | 58 | 217–332 | No significant match | KM216893 | 4 | 0.271 | 0.613 | 0.55 | 0.554 |
| H160 | (TTCT)11 | F: CGTTGCACATTGGTGGTTTA | 59 | 217–278 | No significant match | KM216865 | 14 | 0.417 | 0.751 | 0.73 | 0.440 |
| H161 | (CCAT)11 | F: AGCAACAGTCGAGGAGCATA | 59 | 161–202 | Hypothetical protein PANDA_019388 | KM216854 | 8 | 0.792 | 0.779 | 0.74 | −0.027 |
| H163 | (AAAT)11 | F: GCCTTGATCAGCTTTCTTCC | 58 | 286–382 | No significant match | KM216884 | 4 | 0.583 | 0.659 | 0.59 | 0.106 |
| H165 | (CACT)11 | F: GCGGAGACGCTTTCTGTATC | 58 | 171–255 | Muscle creatine kinase | KM216887 | 9 | 0.583 | 0.823 | 0.79 | 0.284 |
| H166 | (TGTT)11 | F: AGCGTTAGCGTTAGCATCGT | 58 | 157–233 | Hypothetical protein ZEAMMB73_428483 | KM216899 | 14 | 0.729 | 0.838 | 0.81 | 0.121 |
| H168 | (ATCC)10 | F: TGATCACGTGACCTCAGAGC | 58 | 258–334 | No significant match | KM216863 | 5 | 0.417 | 0.537 | 0.46 | 0.216 |
| H169 | (CATC)11 | F: CGATCACATGTCACTCCTCC | 58 | 221–292 | Rho GTPase-activating protein 7–like | KM216906 | 7 | 0.563 | 0.805 | 0.77 | 0.294 |
| H171 | (ATAC)10 | F: GATTCACCCAAAATGACATGG | 58 | 173–248 | Tribbles homolog 3 | KM216872 | 10 | 0.271 | 0.492 | 0.48 | 0.444 |
| H172 | (AGAA)10 | F: AGTGGTTCCGTTGAGGGTTT | 58 | 255–328 | No significant match | KM216913 | 6 | 0.500 | 0.762 | 0.72 | 0.337 |
| H176 | (AATA)10 | F: TGAAGGTCAGAAATGCAGAGC | 58 | 118–145 | No significant match | KM216876 | 5 | 0.833 | 0.761 | 0.71 | −0.107 |
| H203 | (TGAT)8 | F: CAGAGCCGGTGTTTCTTTTC | 58 | 131–157 | Protein LBH-like | KM216869 | 9 | 0.521 | 0.786 | 0.75 | 0.330 |
| H216 | (CTTT)8 | F: GATGATGAGTTGCATGACGC | 58 | 113–151 | No significant match | KM216874 | 6 | 0.625 | 0.729 | 0.69 | 0.134 |
| H217 | (ATTT)8 | F: CTCGAATGGAAAAACCATCTG | 58 | 231–257 | No significant match | KM216908 | 5 | 0.458 | 0.656 | 0.59 | 0.294 |
| H228 | (TTTA)8 | F: CGGAGACGCTTAAGGACTTG | 61 | 204–272 | Zgc:63767 protein | KM216915 | 12 | 0.354 | 0.835 | 0.81 | 0.572 |
| H229 | (ATTT)8 | F: TTTTGCAAACGAATATCACCA | 58 | 197–252 | No significant match | KM216907 | 11 | 0.479 | 0.765 | 0.74 | 0.367 |
| H233 | (ATCA)8 | F: CCACTCGGAAAGCTCAGAAC | 58 | 244–286 | No significant match | KM216890 | 8 | 0.229 | 0.497 | 0.47 | 0.534 |
| H237 | (TCTT)8 | F: TGGAGTAGTGCTGGTTCACG | 58 | 248–301 | No significant match | KM216880 | 12 | 0.458 | 0.841 | 0.82 | 0.449 |
| H246 | (ATA)9 | F: GACGCAGCTCGTGAATGTTA | 58 | 223–294 | No significant match | KM216883 | 10 | 0.625 | 0.821 | 0.79 | 0.230 |
| H249 | (ATT)13 | F: GGGGAATAGTTATGAAAATGGG | 58 | 276–326 | No significant match | KM216877 | 9 | 0.229 | 0.684 | 0.62 | 0.662 |
| H251 | (AATG)9 | F: CTGAGATAGGCACAGGCTCC | 58 | 244–324 | C1orf43–like protein | KM216866 | 9 | 0.375 | 0.656 | 0.63 | 0.423 |
| H254 | (ATAA)8 | F: TTCACTCAAATTCGTGTTCAAA | 58 | 282–319 | No significant match | KM216870 | 7 | 0.646 | 0.685 | 0.64 | 0.048 |
| H256 | (GAAT)8 | F: CAATGCACAAGCATGTAGGG | 58 | 212–346 | No significant match | KM216902 | 15 | 0.792 | 0.879 | 0.86 | 0.090 |
| H259 | (ATTT)12 | F: CAGCATGGCCTTTCTTTGTT | 56 | 263–326 | No significant match | KM216853 | 8 | 0.333 | 0.613 | 0.59 | 0.451 |
| H260 | (TCTG)17 | F: GGATGTGGAGAGGCTTTGAA | 58 | 218–248 | No significant match | KM216855 | 6 | 0.208 | 0.620 | 0.55 | 0.660 |