| Literature DB >> 35260071 |
Sarah Farhat1, Eric Bonnivard2, Emmanuelle Pales Espinosa1, Arnaud Tanguy2, Isabelle Boutet2, Nadège Guiglielmoni3, Jean-François Flot3,4, Bassem Allam5.
Abstract
BACKGROUND: The hard clam Mercenaria mercenaria is a major marine resource along the Atlantic coasts of North America and has been introduced to other continents for resource restoration or aquaculture activities. Significant mortality events have been reported in the species throughout its native range as a result of diseases (microbial infections, leukemia) and acute environmental stress. In this context, the characterization of the hard clam genome can provide highly needed resources to enable basic (e.g., oncogenesis and cancer transmission, adaptation biology) and applied (clam stock enhancement, genomic selection) sciences.Entities:
Keywords: Gene duplication; Genome; Hard clam; Repeats; Steamer elements
Mesh:
Substances:
Year: 2022 PMID: 35260071 PMCID: PMC8905726 DOI: 10.1186/s12864-021-08262-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Contact map of the Mercenaria mercenaria genome assembly. Map generated from Hi-C data showing sequences interaction points in chromosomes (red dots). The color bar indicates the density of contact. The associated karyotype (from Wang and Guo, 2007 [26]) is shown under the plot
Fig. 2Dotplot contrasting the chromosomes from the two Mercenaria mercenaria genome assemblies. All x-axis represents the nucleotides number (in Mb) of M. mercenaria chromosomes (generated in this study) and all y-axis represents the nucleotides of YKG chromosomes [24]. A red point represents a forward match and a blue point a reverse match
Repeated sequences in Mercenaria mercenaria
| Genomic content (%) | Nb of copies | Nb of base pairs | Genomic content (%) | Nb of copies | Nb of base pairs | |
|---|---|---|---|---|---|---|
| 1.98 | 213,809 | 36,728,732 | 1.54 | 179,455 | 27,466,966 | |
| 0.50 | 11,273 | 9,261,821 | 0.36 | 7596 | 6,410,976 | |
| 1.56 | 26,146 | 28,997,539 | 1.41 | 22,581 | 25,216,806 | |
| 0.86 | 12,800 | 15,956,581 | 0.74 | 11,182 | 13,148,040 | |
| 0.57 | 7243 | 10,588,945 | 0.59 | 7013 | 10,529,781 | |
| 0.28 | 4000 | 5,126,704 | 0.32 | 4497 | 5,795,447 | |
| 0.01 | 153 | 168,036 | 0.01 | 164 | 226,620 | |
| 0.01 | 119 | 113,581 | 0.01 | 185 | 172,540 | |
| 0.00 | 112 | 80,085 | 0.00 | 0 | 0 | |
| 0.03 | 579 | 573,523 | 0.10 | 1558 | 1,708,752 | |
| 3.87 | 63,016 | 71,828,997 | 4.27 | 66,510 | 76,423,586 | |
| 0.36 | 5102 | 6,644,022 | 0.55 | 9351 | 9,821,340 | |
| 0.07 | 1425 | 1,289,690 | 0.13 | 2022 | 2,285,303 | |
| 0.08 | 2128 | 1,535,747 | 0.06 | 1277 | 988,401 | |
| 0.00 | 0 | 0 | 0.05 | 742 | 928,874 | |
| 0.14 | 1741 | 2,664,318 | 0.18 | 1801 | 3,287,176 | |
| 0.07 | 1136 | 1,363,973 | 0.10 | 1082 | 1,796,886 | |
| 1.69 | 7091 | 31,368,848 | 1.93 | 6453 | 34,529,082 | |
| 0.90 | 22,496 | 16,783,981 | 0.77 | 18,042 | 13,743,894 | |
| 0.62 | 13,852 | 11,531,215 | 0.71 | 15,005 | 12,617,816 | |
| 0.31 | 10,651 | 5,838,880 | 0.20 | 7691 | 3,611,369 | |
| 0.28 | 9809 | 5,274,930 | 0.46 | 13,654 | 8,303,711 | |
| 0.27 | 2833 | 5,097,100 | 0.31 | 3377 | 5,558,034 | |
| 0.13 | 2744 | 2,455,561 | 0.15 | 4019 | 2,760,579 | |
| 0.10 | 1838 | 1,822,048 | 0.04 | 895 | 662,200 | |
| 0.08 | 2470 | 1,526,495 | 0.11 | 3294 | 1,934,619 | |
| 0.00 | 0 | 0 | 0.01 | 228 | 181,113 | |
| 1.35 | 32,159 | 25,013,810 | 1.59 | 37,236 | 28,495,466 | |
| 4.81 | 389,027 | 89,393,728 | 4.06 | 374,505 | 72,579,061 | |
| 0.19 | 16,252 | 3,490,319 | 0.37 | 26,862 | 6,551,201 | |
| 10.40 | 645,877 | 193,200,141 | 10.68 | 623,224 | 190,956,718 | |
| 0.95 | 422,909 | 17,640,334 | 0.96 | 399,918 | 17,236,842 | |
| 0.17 | 65,697 | 3,083,823 | 0.16 | 58,512 | 2,789,745 | |
Copy number (Nb) and genomic content of the repeated sequences in M. mercenaria genomes generated following the annotation pipeline implemented in this study
Fig. 3Phylogenetic tree of Mollusca species. Tree of Mollusca species considered in this study based on single copy gene clusters (14 OGs). The tree scale is 0.1 and the bootstrap is represented at each node
Fig. 4Phylogenetic tree of TNF domains. TNF domains from all identified genes were extracted and used to generate a multiple alignment and a tree using MAFFT. The tree was drawn using iTOL. Colored branches represent the different orthogroups (the colors correspond to those of Table S8. Label colors also represent OG but highlighting only genes belonging to our Mercenaria mercenaria assembly
Fig. 5C1qDC proteins in Mercenaria mercenaria. Schematic structural domain representation of putative C1qDC proteins from M. mercenaria
Fig. 6Phylogenetic relationships of Steamer retrotransposons and their close groups. This tree is based on Neighbor-Joining analysis [33] of RT/RNaseH domain amino acid sequences. Node statistical support values come from non-parametric bootstrapping using 100 replicates and only those > 75% are shown. The three C-clade branches are indicated in color. Sequences from Lottia gigantea (LG), Pinctada fucata (PF) and Crassostrea gigas (CG) are included as known references of C9 groups [32], and SURL elements included as outgroup. Sequences obtained from TBLASTN searches are labelled by their accession number. For Steamer elements members of different taxa are color coded as shown. Givri: Gigantidas vrijenhoeki; Bapla: Bathymodiolus platifrons; Mygal: Mytilus galloprovincialis; Mycor: Mytilus coruscus; Mophi: Modiolus philippinarum; Lifor: Limnoperna fortunei; Myedu: Mytilus edulis; Anbro: Anadara broughtonii; Memer: Mercenaria mercenaria; Drros: Dreissena rostriformis; Cysin: Cyclina sinensis; Ruphi: Ruditapes philippinarum; Pemax: Pecten maximus; Cravi: Crassostrea virginica; Cragi: Crassostrea gigas; Saglo: Saccostrea glomerata
Steamer elements subfamilies found across Bivalvia
| Host species | Family | Number of copies | Genomic proportion (%) | Base pair masked in genome | Average length | |||
|---|---|---|---|---|---|---|---|---|
| LTRH | RM > 4 kb | RM > 2 kb * | RM | |||||
| SteMemer | 7 | 7 | 8 | 156 | 0.01 | 148,438 | 951 | |
| 1858 Mb | ||||||||
| SteMemer | 5 | 6 | 7 | 166 | 0.01 | 148,344 | 894 | |
| 1788 Mb | ||||||||
| SteRuphi | 0 | 0 | 3 | 22 | 0.00 | 21,751 | 989 | |
| 1123 Mb | ||||||||
| SteCysin | 2 | 2 | 1 | 64 | 0.01 | 54,243 | 848 | |
| 903 Mb | ||||||||
| SteDrros | 5 | 5 | 5 | 65 | 0.01 | 78,925 | 1214 | |
| 1242 Mb | ||||||||
| SteAnbro1 | 5 | 5 | 0 | 176 | 0.01 | 104,324 | 593 | |
| 885 Mb | SteAnbro2 | 5 | 5 | 5 | 153 | 0.01 | 114,234 | 747 |
| SteAnbro3 | 5 | 7 | 15 | 216 | 0.02 | 177,833 | 823 | |
| SteCragi | 1 | 1 | 10 | 99 | 0.01 | 96,188 | 971 | |
| 648 Mb | ||||||||
| SteCavi | 0 | 0 | 1 | 70 | 0.01 | 42,104 | 601 | |
| 685 Mb | ||||||||
| SteSaglo | 4 | 5 | 9 | 284 | 0.03 | 203,715 | 717 | |
| 788 Mb | ||||||||
| SteBapla | 3 | 3 | 8 | 216 | 0.01 | 168,122 | 778 | |
| 1658 Mb | ||||||||
| SteMophi | 2 | 2 | 13 | 244 | 0.01 | 195,023 | 799 | |
| 2630 Mb | ||||||||
| SteLifor | 1 | 5 | 19 | 142 | 0.01 | 161,653 | 1138 | |
| 1673 Mb | ||||||||
| SteMycor1 | 2 | 4 | 3 | 96 | 0.00 | 93,720 | 830 | |
| 1904 Mb | SteMycor2 | 9 | 11 | 6 | 265 | 0.01 | 256,509 | 968 |
| SteMygal1 | 1 | 1 | 10 | 81 | 0.01 | 92,780 | 1145 | |
| 1282 Mb | ||||||||
Number of copies and genomic proportions of Steamer elements subfamilies were estimated in bivalve genomes using LTRHarvest (LTRH) or RepeatMasker (RM). Copy numbers are given according to the estimation procedure. The star indicates between 2 and 4 kb