| Literature DB >> 24277739 |
Wenqian Jiao1, Xiaoteng Fu, Jinqin Li, Ling Li, Liying Feng, Jia Lv, Lu Zhang, Xiaojian Wang, Yangping Li, Rui Hou, Lingling Zhang, Xiaoli Hu, Shi Wang, Zhenmin Bao.
Abstract
Mollusca is the second most diverse group of animals in the world. Despite their perceived importance, omics-level studies have seldom been applied to this group of animals largely due to a paucity of genomic resources. Here, we report the first large-scale gene-associated marker development and evaluation for a bivalve mollusc, Chlamys farreri. More than 21,000 putative single-nucleotide polymorphisms (SNPs) were identified from the C. farreri transcriptome. Primers and probes were designed and synthesized for 4500 SNPs, and 1492 polymorphic markers were successfully developed using a high-resolution melting genotyping platform. These markers are particularly suitable for population genomic analysis due to high polymorphism within and across populations, a low frequency of null alleles, and conformation to neutral expectations. Unexpectedly, high cross-species transferability was observed, suggesting that the transferable SNPs may largely represent ancestral genetic variations that have been preserved differentially among subfamilies of Pectinidae. Gene annotations were available for 73% of the markers, and 65% could be anchored to the recently released Pacific oyster genome. Large-scale association analysis revealed key candidate genes responsible for scallop growth regulation, and provided markers for further genetic improvement of C. farreri in breeding programmes.Entities:
Keywords: genome-wide association (GWAS); high resolution melting (HRM); mollusca; single-nucleotide polymorphism (SNP); transcriptome
Mesh:
Substances:
Year: 2013 PMID: 24277739 PMCID: PMC3989488 DOI: 10.1093/dnares/dst048
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.Distribution of sequencing coverage for the SNPs detected in the C. farreri transcriptome.
Figure 2.A schematic workflow describing SNP marker development and evaluation in C. farreri.
Overview of the efficiency of HRM-based SNP marker development in C. farreri
| Minor allele frequency ranges | No. of primers designed | No. of primer validated (% of designed) | No. of probe validated (% of designed) | No. of polymorphic loci (% of designed) |
|---|---|---|---|---|
| 0.4–0.5 | 1538 | 1056 (69%) | 761 (49%) | 571 (37%) |
| 0.3–0.4 | 1422 | 954 (67%) | 701 (49%) | 506 (36%) |
| 0.2–0.3 | 832 | 559 (67%) | 422 (51%) | 266 (32%) |
| 0.1–0.2 | 636 | 418 (66%) | 309 (49%) | 133 (21%) |
| 0–0.1 | 72 | 55 (76%) | 38 (53%) | 16 (22%) |
The cross-species transferability of 16 C. farreri SNP markers
| Marker name | Species name | ||||
|---|---|---|---|---|---|
| C10745S115_CG | √ | √ | √ | √ | |
| C12208S460_TC | √ | √ | √ | ||
| C49010S592_AG | √ | √ | √ | ||
| C49829S586_AC | √ | √ | √ | ||
| C30426S483_TC | √ | √ | |||
| C47021S446_CA | √ | √ | |||
| C34305S164_CG | √ | √ | |||
| C15644S268_AG | √ | ||||
| C13134S327_TC | √ | ||||
| C37392S686_GA | √ | ||||
| C20740S340_TC | √ | ||||
| C11312S559_GA | √ | ||||
| C20815S313_AT | √ | ||||
| C19522S376_TG | √ | ||||
| C30287S1286_TG | √ | ||||
| C15676S348_AG | √ | ||||
Non-synonymous SNP makers are indicated in grey.
Figure 3.Schematic representation of the correspondence between eight C. farreri SNP markers and Pacific oyster scaffold419 (genome assembly v9). Oyster proteins and the related C. farreri SNPs are indicated below the scaffold.
Summary of the oyster scaffolds matched by C. farreri SNP markers
| Total marker no. matched to each scaffold | No. of matched oyster scaffolds | Length of matched scaffolds (kb) |
|---|---|---|
| 12 | 1 | 1620 |
| 10 | 1 | 1193 |
| 9 | 1 | 1726 |
| 8 | 7 | 649–1965 |
| 7 | 4 | 969–1120 |
| 6 | 10 | 650–1697 |
| 5 | 15 | 538–1855 |
| 4 | 25 | 63–1372 |
| 3 | 63 | 58–1861 |
| 2 | 108 | 31–1727 |
| 1 | 279 | 2–1100 |
Figure 4.The shell-height distribution of approximately 1000 C. farreri individuals collected from each of two C. farreri populations. For the large body size (LBS) groups, the average trait value was 75.59 ± 2.92 mm for population a and 72.02 ± 2.57 mm for population b, whereas for small body size (SBS) groups, it was 49.93 ± 3.84 and 53.04 ± 3.41 mm for populations a and b, respectively. For each population, 40 individuals were sampled from each group for association analysis.
Figure 5.Example HRM profiles for two growth-related markers, C10973S307_GA and C7493S233_CT, identified by genotyping DNA pools prepared from both the LBS and SBS groups. Clear differences in allele frequencies were observed between groups.
Comparison of the allele and genotype frequency of five growth-related markers between LBS and SBS groups sampled from two C. farreri populations.
| Maker name | LBS group (population 1/population 2) | SBS group (population 1/population 2) | Allele frequency | Genotype frequency | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Allele frequency | Genotype frequency | Allele frequency | Genotype frequency | |||||||||
| C10973S307_GA | 0.51/0.29 (A) | 0.49/0.71 (G) | 0.36/0.07 (AA) | 0.31/0.44 (GA) | 0.33/0.49 (GG) | 0.86/0.74 (A) | 0.14/0.26 (G) | 0.76/0.68 (AA) | 0.18/0.12 (GA) | 0.05/0.20 (GG) | 8.86 | 5.00 |
| C6068S140_CA | 0.49/0.41 (A) | 0.51/0.59 (C) | 0.33/0.24 (AA) | 0.31/0.33 (CA) | 0.36/0.43 (CC) | 0.81/0.80 (A) | 0.19/0.20 (C) | 0.70/0.76 (AA) | 0.22/0.08 (CA) | 0.08/0.16 (CC) | 3.98 | 2.24 |
| C13555S183_TG | 0.45/0.32 (G) | 0.55/0.68 (T) | 0.28/0.13 (GG) | 0.34/0.38 (TG) | 0.38/0.49 (TT) | 0.77/0.74 (G) | 0.23/0.26 (T) | 0.69/0.64 (GG) | 0.17/0.18 (TG) | 0.14/0.18 (TT) | 6.10 | 1.70 |
| C17803S726_GA | 0.55/0.76 (A) | 0.45/0.24 (G) | 0.40/0.72 (AA) | 0.30/0.08 (GA) | 0.30/0.20 (GG) | 0.25/0.47 (A) | 0.75/0.53 (G) | 0.10/0.36 (AA) | 0.29/0.23 (GA) | 0.61/0.41 (GG) | 1.44 | 5.01 |
| C7493S233_CT | 0.96/0.85 (C) | 0.04/0.15 (T) | 0.94/0.77 (CC) | 0.03/0.15 (CT) | 0.03/0.08 (TT) | 0.71/0.53 (C) | 0.29/0.47 (T) | 0.59/0.31 (CC) | 0.23/0.43 (CT) | 0.18/0.26 (TT) | 1.73 | 8.39 |
aSignificant after Bonferroni correction.
For each marker, alleles or genotypes are indicated in parenthesis below their frequencies.