| Literature DB >> 24065109 |
Aletta Bester-Van Der Merwe1, Sonja Blaauw, Jana Du Plessis, Rouvay Roodt-Wilding.
Abstract
Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.Entities:
Mesh:
Year: 2013 PMID: 24065109 PMCID: PMC3794836 DOI: 10.3390/ijms140919341
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary of the variants of the putative single nucleotide polymorphisms (SNPs) detected in vitro and in silico for H. midae following two bioinformatic pipelines.
| 58 | 256 | |
| 105 | 400 | |
| A/T | 15 (14.3%) | 52 (13.0%) |
| A/C | 7 (6.7%) | 35 (8.8%) |
| C/G | 10 (9.5%) | 15 (3.8%) |
| T/G | 6 (5.7%) | 43 (10.8%) |
| A/G | 35 (33.3%) | 124 (31.0%) |
| T/C | 30 (28.6%) | 129 (32.3%) |
| 2 (1.9%) | 2 (0.5%) | |
Comparison of the genotyping success of two GoldenGate assays (Plex-48 and Plex-192) assembled for H. midae. EST, expressed sequence tag.
| Plex-48 | Plex-192 | |
|---|---|---|
| Functionality score | 0.75 | 0.8 |
| ESTs/Contigs | 35 | 139 |
| Feasible SNPs | 48 | 186 |
| GenTrain score | 0.25 | 0.45 |
| Total | 48 | 186 |
| Failures | 7 (14.58%) | 44 (23.7%) |
| Monomorphic | 10 (20.83%) | 14 (7.5%) |
| Polymorphic | 31 (64.58%) | 128 (68.8%) |
Figure 1Genoplots obtained with the GenomeStudio™ Genotyping Module representative of (a) a failed SNP; (b) a successfully genotyped (polymorphic) SNP; and (c) a monomorphic SNP analyzed in this study.
Annotation, variants and predicted gene location and functional effect of the 48 feasible SNPs validated in the GoldenGate genotyping assay, Plex-48. UTR, untranslated region.
| SNP Name | EST/Contig | Functional annotation | Variant | SNP effect |
|---|---|---|---|---|
| 3B4 | 60s Ribosomal protein L8 | T/C | UTR | |
| A/T | UTR | |||
|
| ||||
| 3D10 | Hemocyanin | A/G | UTR | |
|
| ||||
| 2H9 | Ribosomal protein L22 | A/T | UTR | |
|
| ||||
| HdSNPc148 | Actin | T/C | Synonymous | |
|
| ||||
| HdSNPc106 | Tubulin alpha-1a chain isoform 2 | C/T | Synonymous | |
|
| ||||
| HmSNPc4 | Microsatellite sequence | C/T | Non-synonymous | |
|
| ||||
| HaSNPdw | Microsatellite sequence | C/T | UTR | |
|
| ||||
| HmLCS5M | - | T/A | - | |
|
| ||||
| Opacity protein | C/T | Synonymous | ||
|
| ||||
| HmLCS55T | Microsatellite sequence | G/T | Non-synonymous | |
|
| ||||
| HmRS36T | 5-Formyltetrahydrofolate cyclo-ligase | T/C | Synonymous | |
|
| ||||
| Contig 101 | Myosin heavy chain | A/C | UTR | |
| C/G | UTR | |||
|
| ||||
| Contig 146 | ADP/ATP carrier protein | A/G | Synonymous | |
| T/G | Non-synonymous | |||
|
| ||||
| Contig 149 | Heat shock protein 70 | A/C | UTR | |
| C/G | Non-synonymous | |||
| A/G | Synonymous | |||
| A/G | UTR | |||
| T/C | UTR | |||
|
| ||||
| Contig 210 | - | T/G | - | |
|
| ||||
| Contig 214 | Ribosomal protein L10 | T/C | Non-synonymous | |
| T/C | UTR | |||
|
| ||||
| Contig 342 | Heat shock protein | T/C | UTR | |
|
| ||||
| Contig 449 | s-Adenosylmethionine synthetase isoform type-1 | A/G | Non-synonymous | |
| T/C | Synonymous | |||
|
| ||||
| Contig 1718 | NADH dehydrogenase subunit 1 | A/T | UTR | |
|
| ||||
| Contig 1833 | Alpha tubulin | A/G | UTR | |
|
| ||||
| Contig 1834 | Tubulin alpha-1a chain- partial | A/G | UTR | |
| A/G | Non-synonymous | |||
|
| ||||
| Contig 1949 | Ribosomal protein L3 | A/C | UTR | |
|
| ||||
| Contig 4691 | Heat shock protein 70 | A/G | UTR | |
|
| ||||
| Contig 17550 | Clathrin heavy chain 1 | A/G | Non-synonymous | |
| A/T | UTR | |||
| A/T | UTR | |||
|
| ||||
| Contig 48 | Collagen alpha-4 chain | T/G | UTR | |
|
| ||||
| Contig 67 | Collagen alpha-6 partial | A/G | UTR | |
|
| ||||
| Contig 140 | Na+ K+-ATPase alpha subunit | T/C | Synonymous | |
|
| ||||
| Contig 229 | 14-3-3 Zeta | T/C | UTR | |
|
| ||||
| Contig 300 | - | A/G | - | |
|
| ||||
| Contig 972 | Myosin heavy chain | T/C | Synonymous | |
|
| ||||
| Contig 1001 | Cre-sca-1 protein | T/C | UTR | |
|
| ||||
| Contig 2091 | - | A/C | - | |
|
| ||||
| Contig 3129 | Arginine kinase | A/G | Synonymous | |
|
| ||||
| Contig 5837 | Mucus-associated protein partial | T/C | Non-synonymous | |
|
| ||||
| Contig 13865 | Cathepsin l | T/C | UTR | |
|
| ||||
| Contig 20648 | Filamin-c isoform 4 | A/G | Synonymous | |
Genetic diversity estimates of the polymorphic SNP markers (Plex-48) for H. midae individuals. HW, Hardy-Weinberg.
| SNP Name | Minor allele frequency | Heterozygosity | Inbreeding coefficient | Probability | |
|---|---|---|---|---|---|
| Observed | Expected | ||||
| 0.4549 | 0.592 | 0.465 | −0.274 | 0.01 | |
| 0.1901 | 0.537 | 0.473 | −0.134 | <0.01 | |
| 0.0391 | 0.281 | 0.472 | 0.406 | 0.001 | |
| 0.4564 | 0.183 | 0.172 | −0.066 | 0.000 | |
| 0.0015 | 0.721 | 0.462 | −0.563 | <0.01 | |
| 0.0420 | 0.009 | 0.331 | 0.972 | 0.002 | |
| 0.0557 | 0.069 | 0.073 | 0.049 | 0.530 | |
| 0.0015 | 0.091 | 0.126 | 0.278 | 0.786 | |
| 0.1352 | 0.285 | 0.245 | −0.165 | 0.011 | |
| 0.0651 | 0.025 | 0.031 | 0.189 | 0.799 | |
| 0.4079 | 0.483 | 0.478 | −0.009 | 0.001 | |
| 0.2464 | 0.003 | 0.003 | 0.000 | - | |
| 0.0669 | 0.044 | 0.043 | −0.021 | 1.000 | |
| 0.0000 | 0.041 | 0.040 | −0.019 | 0.875 | |
| 0.1023 | 0.098 | 0.111 | 0.116 | 0.793 | |
| 0.4035 | 0.32 | 0.497 | 0.357 | 0.000 | |
| 0.4781 | 0.013 | 0.012 | −0.005 | - | |
| 0.0014 | 0.788 | 0.489 | −0.614 | <0.01 | |
| 0.1764 | 0.309 | 0.266 | −0.162 | 0.000 | |
| 0.0667 | 0.138 | 0.353 | 0.610 | 0.025 | |
| 0.0189 | 0.013 | 0.013 | −0.005 | - | |
| 0.0841 | 0.119 | 0.112 | −0.062 | 0.081 | |
| 0.0696 | 0.182 | 0.176 | −0.033 | 0.181 | |
| 0.1272 | 0.178 | 0.162 | −0.096 | 0.310 | |
| 0.0338 | 0.099 | 0.128 | 0.228 | 0.880 | |
| 0.0678 | 0.195 | 0.181 | −0.075 | 0.058 | |
| 0.2267 | 0.082 | 0.394 | 0.794 | <0.01 | |
| 0.1893 | 0.280 | 0.250 | −0.119 | 0.000 | |
| 0.2580 | 0.314 | 0.486 | 0.354 | 0.001 | |
| 0.2246 | 0.318 | 0.462 | 0.312 | <0.01 | |
| 0.0889 | 0.164 | 0.166 | 0.015 | 0.063 | |
-no p-value was assigned.
Figure 2Factorial component analysis (FCA) based on 31 SNP loci in six H. midae populations. The respective populations are indicated by color: Family DS_1 (Yellow), Family DS_5 (Blue), Family DS_6 (White), Riet Point (Grey), Saldanha Bay (Pink) and Witsand (Green). The wild populations are encircled.
Sample sizes of H. midae populations genotyped in Plex-48 and Plex-192.
| Sample origin | Number of individuals | |
|---|---|---|
| Plex-48 | Plex-192 | |
| Family DS 1 | 103 | 70 |
| Family DS_2 | 94 | 87 |
| Family DS_5 | 90 | - |
| Family DS_6 | 94 | - |
| Family D | - | 72 |
| Family H | - | 71 |
| Family I | - | 81 |
| Family J | - | 72 |
| Saldanha Bay | 23 | - |
| Witsand | 26 | - |
| Riet Point | 26 | - |
| Positive controls | 2 per plate | 3 per plate |
| Negative controls | - | 1 per plate |