| Literature DB >> 28775792 |
Xuedi Du1,2,3, Kai Song1, Jinpeng Wang1, Rihao Cong1, Li Li1,3,4, Guofan Zhang1,4,5.
Abstract
Carotenoids are commonly deposited in the gonads of marine bivalves but rarely in their adductor muscles. An orange-adductor variant was identified in our breeding program for the bay scallop Argopecten irradians. In the present study, bay scallop genome survey sequencing was conducted, followed by genotyping by sequencing (GBS)-based case-control association analysis in a selfing family that exhibited segregation in adductor color. K-mer analysis (K=17) revealed that the bay scallop genome is about 990 Mb in length. De novo assembly produced 217,310 scaffold sequences, which provided 72.1% coverage of the whole genome and covered 72,187 transcripts, thereby yielding the most informative sequence resource for bay scallop to date. The average carotenoid content of the orange-adductor progenies was significantly higher than that of the white-adductor progenies. Thus, 20 individuals of each subgroup were sampled for case-control analysis. As many as 15,224 heterozygous loci were identified in the parent, among which 9280 were genotyped in at least 10 individuals of each of the two sub-groups. Association analysis indicated that 126 SNPs were associated with carotenoid accumulation in the adductor muscle and that 88 of these were significantly enriched on 28 scaffolds (FDR controlled P < 0.05). The SNPs and genes located on these scaffolds can serve as valuable candidates for further research into the mechanisms by which marine bivalves accumulate carotenoids in their adductor muscles.Entities:
Keywords: Bay scallop; Carotenoid accumulation; Draft genome; GBS.; SNP
Year: 2017 PMID: 28775792 PMCID: PMC5535694 DOI: 10.7150/jgen.19146
Source DB: PubMed Journal: J Genomics
Figure 1Variation in the color of adductor muscles from a self-fertilized family of Argopecten irradians. (a) white-adductor progeny; (b) orange-adductor progeny.
Summary of genome survey sequencing data for Argopecten irradians
| Libraries | Insert size (bp) | Read length (bp) | Total data (Gb) |
|---|---|---|---|
| L1 | 180 | 2×100 | 52 |
| L2 | 500 | 2×100 | 42 |
| Total | - | - | 94 |
Figure 2Frequency distribution of 17-mer sequencing reads in the Argopecten irradians draft genome, as a function of sequencing depth. (a) Frequency distribution of all 17-mer reads. (b) Frequency distribution of unique 17-mer reads. The peak at Depth = 40 implies heterozygosity, whereas the peak at Depth = 160 implies repetition.
Argopecten irradians genome assembly statistics
| Index | Scaffold size (bp) | Number |
|---|---|---|
| N90 | 1,226 | 120,183 |
| N50 | 6,836 | 21,745 |
| Average length | 3,222 | - |
| Total length | 700,343,378 | - |
| >10k | - | 12,892 |
| 5k-10k | - | 19,421 |
| 2k-5k | - | 51,199 |
| 1k-2k | - | 54,292 |
| <1k | - | 79,506 |
| Total number | - | 217,310 |
Figure 3Carotenoid contents of adductor muscles from the white- and orange-adductor and market specimens of Argopecten irradians. The carotenoid contents are presented as absorption at 455 nm.
Figure 4Difference in single nucleotide polymorphism (SNP) allele frequencies between orange- and white-adductor progenies. Blue circles represent carotenoid accumulation-associated SNPs with (1) allele frequency differences of >0.5 (red dashed line) and (2) major allele frequencies of ≥0.9. All the SNPs were genotyped in ≥10 individuals from each of the two subgroups.
Enrichment of carotenoid accumulation-associated SNPs in the Argopecten irradians draft genome
| Scaffolds | Length (bp) | Identified SNPs | Associated SNPs | P value* |
|---|---|---|---|---|
| scaffold122141 | 40,093 | 7 | 4 | 8.56E-06 |
| scaffold126109 | 67,755 | 3 | 3 | 3.10E-05 |
| scaffold145003 | 106,655 | 5 | 3 | 3.05E-04 |
| scaffold153345 | 24,657 | 5 | 5 | 2.01E-09 |
| scaffold1582 | 22,190 | 3 | 2 | 1.13E-02 |
| scaffold216855 | 22,151 | 3 | 3 | 3.10E-05 |
| scaffold27933 | 188,857 | 20 | 10 | 1.00E-14 |
| scaffold2851 | 49,116 | 5 | 4 | 1.24E-06 |
| scaffold40958 | 9353 | 4 | 4 | 2.51E-07 |
| scaffold42737 | 11,692 | 3 | 3 | 3.10E-05 |
| scaffold43946 | 88,867 | 3 | 2 | 1.13E-02 |
| scaffold44628 | 128,299 | 2 | 2 | 3.81E-03 |
| scaffold50863 | 78,113 | 4 | 2 | 2.25E-02 |
| scaffold52582 | 35,812 | 2 | 2 | 3.81E-03 |
| scaffold52718 | 42,393 | 2 | 2 | 3.81E-03 |
| scaffold545 | 43,343 | 2 | 2 | 3.81E-03 |
| scaffold56532 | 7451 | 3 | 2 | 1.13E-02 |
| scaffold5873 | 86,577 | 6 | 6 | 1.60E-11 |
| scaffold62479 | 6552 | 2 | 2 | 3.81E-03 |
| scaffold63730 | 36,866 | 2 | 2 | 3.81E-03 |
| scaffold67078 | 42,696 | 3 | 2 | 1.13E-02 |
| scaffold73425 | 159,700 | 9 | 4 | 3.03E-05 |
| scaffold79588 | 10,069 | 2 | 2 | 3.81E-03 |
| scaffold81037 | 18,305 | 4 | 4 | 2.51E-07 |
| scaffold8210 | 18,353 | 2 | 2 | 3.81E-03 |
| scaffold83726 | 71,274 | 3 | 3 | 3.10E-05 |
| scaffold86529 | 16,314 | 2 | 2 | 3.81E-03 |
| scaffold95690 | 126,536 | 4 | 7 | 8.56E-06 |
* P values were adjusted using the fdr method in R.