| Literature DB >> 27507266 |
Alyssa Mitchell1, Davide Guerra1, Donald Stewart2, Sophie Breton3.
Abstract
BACKGROUND: Many species of bivalves exhibit a unique system of mtDNA transmission named Doubly Uniparental Inheritance (DUI). Under this system, species have two distinct, sex-linked mitochondrial genomes: the M-type mtDNA, which is transmitted by males to male offspring and found in spermatozoa, and the F-type mtDNA, which is transmitted by females to all offspring, and found in all tissues of females and in somatic tissues of males. Bivalves with DUI also have sex-specific mitochondrial ORFan genes, (M-orf in the M mtDNA, F-orf in the F mtDNA), which are open reading frames having no detectable homology and no known function. DUI ORFan proteins have previously been characterized in silico in a taxonomically broad array of bivalves including four mytiloid, one veneroid and one unionoid species. However, the large evolutionary distance among these taxa prevented a meaningful comparison of ORFan properties among these divergent lineages. The present in silico study focuses on a suite of more closely-related Unionoid freshwater mussel species to provide more reliably interpretable information on patterns of conservation and properties of DUI ORFans. Unionoid species typically have separate sexes, but hermaphroditism also occurs, and hermaphroditic species lack the M-type mtDNA and possess a highly mutated version of the F-orf in their maternally transmitted mtDNA (named H-orf in these taxa). In this study, H-orfs and their respective proteins are analysed for the first time.Entities:
Keywords: Bivalvia; Doubly uniparental inheritance of mitochondria; Mitochondrial DNA; Mitochondrial ORFans; Mitochondrial inheritance; Unionoida
Mesh:
Substances:
Year: 2016 PMID: 27507266 PMCID: PMC4979158 DOI: 10.1186/s12864-016-2986-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Simplified phylogeny of some gonochoric and hermaphroditic unionoid bivalves redrawn after Breton et al. [16]. The presence of DUI (Doubly Uniparental Inheritance) with F- and M-type mtDNAs in gonochoric species is indicated in black, whereas hermaphroditism with SMI (Strict Maternal Inheritance) is indicated in red. Species in black have F-ORFs containing one predicted transmembrane (TM) helix in their N-terminal portion, whereas hermaphroditic species have macromutated H-ORFs with repeat units and sometimes more than one predicted TM helix [16]
Sequences analyzed in the present study for gonochoric species with DUI and hermaphroditic species with SMI
| Species | mtDNA type | Accession number | ORF names |
|---|---|---|---|
| Subfamiliy Ambleminae | |||
|
| M | FJ809751.1 | Qqu-M |
| M | FJ809751.1 | Qqu-M | |
| M | FJ809751.1 | Qqu-M | |
| F | FJ809750.1 | Qqu-F | |
| F | FJ809750.1 | Qqu-F | |
| F | FJ809750.1 | Qqu-F | |
|
| F | HM849457.1 | Tli-F |
|
| H | KU728097 | Tpa-H |
|
| M | FJ809752.1 | Vel-M |
| M | FJ809752.1 | Vel-M | |
| M | FJ809752.1 | Vel-M | |
| F | FJ809753.1 | Vel-F | |
| F | FJ809753.1 | Vel-F | |
| F | FJ809753.1 | Vel-F | |
| Subfamiliy Anodontinae | |||
|
| M | KF030962.1 | Aan-M |
| F | KF030964.1 | Aan-F | |
| Subfamiliy Gonideinae | |||
|
| M | AB055624.1 | Ija-M |
| M | AB055624.1 | Ija-M | |
| M | AB055624.1 | Ija-M | |
| F | AB055625.1 | Ija-F | |
| F | AB055625.1 | Ija-F | |
| F | AB055625.1 | Ija-F | |
|
| M | KC848655.1 | Sca-M |
| M | KC848655.1 | Sca-M | |
| M | KC848655.1 | Sca-M | |
| F | KC848654.1 | Sca-F | |
| F | KC848654.1 | Sca-F | |
| F | KC848654.1 | Sca-F | |
| Subfamiliy Hyriidae | |||
|
| M | KU728093 | Hme-M |
| M | KU728094 | Hme-M | |
| F | KU728092 | Hme-F | |
| F | AY785394.1 | Hme-F | |
| Subfamiliy Margaritiferinae | |||
|
| M | KU728095 | Cmo-M |
| M | KU728096 | Cmo-M | |
| F | HM849375.1 | Cmo-F | |
| F | KF647374.1 | Cmo-F | |
|
| H | HM849545.1 | Mfa-H |
| H | HM856634.1 | ||
| H | HM849547.1 | ||
| H | HM849548.1 | ||
| H | HM856634.1 | Mfa-H | |
| H | NC_015476.1 | ||
|
| F | HM849399.1 | Mma-F |
| F | HM849095.1 | Mma-F | |
| Subfamiliy Unioninae | |||
|
| F | HM849393.1 | Lco-F |
|
| H | HM849534.1 | Lco-H |
| H | HM849535.1 | ||
| H | HM856638.1 | Lco-H | |
| H | NC_015481.1 | ||
|
| H | HM849542.1 | Lsu-H |
| H | HM849543.1 | ||
|
| M | FJ809755.1 | Pgr-M |
| M | FJ809755.1 | Pgr-M | |
| M | FJ809755.1 | Pgr-M | |
| F | FJ809754.1 | Pgr-F | |
| F | FJ809754.1 | Pgr-F | |
| F | FJ809754.1 | Pgr-F | |
|
| H | HM849591.1 | Uim-H |
| H | HM849595.1 | ||
| H | HM849594.1 | ||
| H | HM849601.1 | ||
| H | HM849606.1 | ||
| H | HM849597.1 | ||
| H | HM849584.1 | ||
| H | NC_015479 | Uim-H | |
| H | HM856637.1 | ||
|
| M | HM856635.1 | Upe-M |
| M | HM856635.1 | Upe-M | |
| M | HM856635.1 | Upe-M | |
| F | HM856636.1 | Upe-F | |
| F | HM856636.1 | Upe-F | |
| F | HM856636.1 | Upe-F | |
Note: M M mtDNA in a DUI gonochoric breeding system, F F mtDNA in a DUI gonochoric breeding system, H H mtDNA in a non-DUI hermaphroditic breeding system
Fig. 2Alignments of M-ORF protein sequences. Global alignments and alignments for each subfamily are shown. a All M-ORF sequences, b M-ORFs from the subfamily Unioninae, c M-ORFs from the subfamily Gonideinae, d M-ORFs from the subfamily Ambleminae. Colour coding is applied to amino acid groups conserved in ≥70 % of sequences. Grey, aliphatic amino acids; orange, aromatic amino acids; yellow, sulfur amino acids; green, amino acids bearing a hydroxyl group; red, basic amino acids; blue, acidic amino acids; brown, amino acids with an amide group; pink, cyclic amino acids
Fig. 3Alignments of F-ORF protein sequences. Global alignments and alignments for each subfamily are shown. a all F-ORF sequences, b F-ORFs from the subfamily Unioninae, c F-ORFs from the subfamily Gonideinae, d F-ORFs from the subfamily Ambleminae (e) F-ORFs from the subfamily Margaritiferidae. Colour coding is applied to amino acid groups conserved in ≥70 % of sequences. Grey, aliphatic amino acids; orange, aromatic amino acids; yellow, sulfur amino acids; green, amino acids bearing a hydroxyl group; red, basic amino acids; blue, acidic amino acids; brown, amino acids with an amide group; pink, cyclic amino acids
p-distances (p-D) and standard error (SE) values for mitochondrial M-orfs, F-orfs, cox1 and atp8 in freshwater mussel subfamilies
| Subfamily | Gene (N) | Nucleotide | Amino acid |
| ||
|---|---|---|---|---|---|---|
| p-D | SE | p-D | SE | |||
| Unioninae | F- | 0.355 | 0.023 | 0.467 | 0.047 | 1.000 |
| F- | 0.103 | 0.007 | 0.014 | 0.005 | 1.000 | |
| F- | 0.300 | 0.011 | 0.333 | 0.015 | 1.000 | |
| M- | 0.350 | 0.018 | 0.502 | 0.034 | 1.000 | |
| M- | 0.165 | 0.010 | 0.094 | 0.012 | 1.000 | |
| M- | 0.250 | 0.010 | 0.267 | 0.013 | 1.000 | |
| Gonideinae | F- | 0.469 | 0.033 | 0.692 | 0.058 | 1.000 |
| F- | 0.132 | 0.008 | 0.033 | 0.008 | 1.000 | |
| F- | 0.400 | 0.025 | 0.222 | 0.010 | 1.000 | |
| M- | 0.384 | 0.025 | 0.552 | 0.044 | 1.000 | |
| M- | 0.175 | 0.009 | 0.130 | 0.015 | 1.000 | |
| M- | 0.301 | 0.019 | 0.421 | 0.039 | 1.000 | |
| Ambleminae | F- | 0.351 | 0.024 | 0.508 | 0.041 | 1.000 |
| F- | 0.128 | 0.009 | 0.033 | 0.007 | 1.000 | |
| F- | 0.278 | 0.018 | 0.370 | 0.031 | 1.000 | |
| M- | 0.421 | 0.027 | 0.687 | 0.047 | 1.000 | |
| M- | 0.179 | 0.010 | 0.145 | 0.015 | 1.000 | |
| M- | 0.211 | 0.012 | 0.233 | 0.017 | 1.000 | |
| Margaritiferinae | F- | 0.393 | 0.029 | 0.705 | 0.050 | 1.000 |
| F- | 0.164 | 0.009 | 0.068 | 0.009 | 1.000 | |
Note: N number of sequences used. The probability of rejecting the null hypothesis of strict-neutrality (d N = d S) in favor of the alternative hypothesis (d N > d S) (in the p column) is shown. d S and d N are the numbers of synonymous and nonsynonymous substitutions per site, respectively
p-distances (p-D) and standard error (SE) values of mitochondrial H-orfs and cox1 in hermaphroditic freshwater mussels
| Species | Gene (N) | Nucleotide | Amino acid |
| ||
|---|---|---|---|---|---|---|
| p-D | SE | p-D | SE | |||
|
| H- | 0.070 | 0.008 | 0.181 | 0.022 | 1.000 |
|
| 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | |
|
| H- | 0.003 | 0.002 | 0.004 | 0.004 | 1.000 |
|
| 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | |
|
| H- | 0.029 | 0.007 | 0.065 | 0.017 | 1.000 |
|
| 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | |
|
| H- | 0.016 | 0.005 | 0.021 | 0.010 | 1.000 |
Note: N number of sequences used. Multiple cox1 sequences were not available for L. subviridis. The probability of rejecting the null hypothesis of strict-neutrality (d N = d S) in favor of the alternative hypothesis (d N > d S) (in the p column) is shown. d S and d N are the numbers of synonymous and nonsynonymous substitutions per site, respectively
p-distances (p-D) and standard error (SE) values of mitochondrial F-orfs vs H-orfs and Fcox1 vs Hcox1 in comparisons between gonochoric vs. closely related hermaphroditic freshwater mussel species
| Species | Genes | Nucleotide | Amino acid | ||
|---|---|---|---|---|---|
| p-D | SE | p-D | SE | ||
|
| |||||
| F | 0.338 | 0.034 | 0.691 | 0.055 | |
| F | 0.310 | 0.032 | 0.721 | 0.054 | |
| F | 0.343 | 0.031 | 0.743 | 0.051 | |
| F | 0.335 | 0.034 | 0.729 | 0.054 | |
| F | 0.333 | 0.031 | 0.714 | 0.052 | |
| F | 0.333 | 0.031 | 0.714 | 0.052 | |
| F | 0.310 | 0.030 | 0.739 | 0.055 | |
|
|
|
|
|
| |
| F | 0.547 | 0.012 | 0.020 | 0.006 | |
| F | 0.547 | 0.012 | 0.020 | 0.006 | |
|
|
|
|
|
| |
|
| |||||
| F | 0.339 | 0.025 | 0.491 | 0.048 | |
| F | 0.336 | 0.026 | 0.491 | 0.049 | |
| F | 0.358 | 0.024 | 0.491 | 0.049 | |
| F | 0.336 | 0.026 | 0.491 | 0.049 | |
|
|
|
|
|
| |
| F | 0.469 | 0.022 | 0.000 | 0.000 | |
| F | 0.469 | 0.021 | 0.000 | 0.000 | |
|
|
|
|
|
| |
|
| |||||
| F | 0.218 | 0.028 | 0.394 | 0.059 | |
| F | 0.255 | 0.027 | 0.395 | 0.055 | |
|
|
|
|
|
| |
|
| |||||
| F | 0.269 | 0.029 | 0.429 | 0.054 | |
| F | 0.295 | 0.029 | 0.442 | 0.055 | |
|
|
|
|
|
| |
|
| F | 0.443 | 0.027 | 0.736 | 0.044 |
Note: Bold numbers indicate mean values
Fig. 4Hydrophobicity profiles of M-ORFs. Boxes indicate predicted TM helices, arrowheads indicate the end of predicted SPs. X-axis is amino acid position, Y-axis is hydrophobicity. Aan, Anodonta anatina; Cmo, Cumberlandia monodonta; Hme, Hyridella menziesii
Fig. 5Hydrophobicity profiles of F-ORFs (top) and H-ORFs vs. F-ORFs (bottom). Boxes indicate predicted TM helices, arrowheads indicate the end of predicted SPs. X-axis is amino acid position, Y-axis is hydrophobicity. Aan, Anodonta anatina; Cmo, Cumberlandia monodonta; Hme, Hyridella menziesii; Lco-HORFs, Lasmigona compressa H-ORFs; Lsu-HORFs, Lasmigona subviridis H-ORFs. For hermaphroditic species, only sequences with different hydrophobicity profiles are shown
Fig. 6Position of motifs frequently recurring in HHpred hits. Protein length in amino acids is indicated in parentheses. One representative sequence was chosen for each hermaphroditic species
Summary of hits to ligand-binding sites in M-ORFs, F-ORFs and H-ORFs
| Protein | DNA | RNA | Protein | Carbohydrate | Ion | Lipid | ATP |
|---|---|---|---|---|---|---|---|
| Vel-MORF | X | X | X | X | X | X | |
| Qqu-MORF | X | X | X | X | |||
| Pgr-MORF | X | X | X | X | X | X | |
| Ija-MORF | X | X | X | X | X | ||
| Upe-MORF | X | X | X | X | X | ||
| Sca-MORF | X | X | X | X | X | ||
| Cmo-MORF | X | X | X | X | X | ||
| Hme-MORF | X | X | X | X | X | X | |
| Aan-MORF | X | X | X | X | X | ||
|
|
|
|
|
|
|
|
|
| Vel-FORF | X | X | X | X | X | X | |
| Qqu-FORF | X | X | X | X | X | X | X |
| Pgr-FORF | X | X | X | X | X | X | |
| Ija-FORF | X | X | X | X | X | ||
| Upe-FORF | X | X | X | X | X | ||
| Sca-FORF | X | X | X | X | X | X | |
| Cmo-FORF | X | X | X | ||||
| Hme-FORF | X | X | X | X | X | ||
| Lco-FORF | X | X | X | X | X | X | |
| Tli-FORF | X | X | X | X | X | X | |
| Mma-FORF | X | X | X | X | |||
| Aan-FORF | X | X | X | X | X | ||
|
|
|
|
|
|
|
|
|
| Uim-HORF1 - 3 | X | X | X | X | X | X | |
| Uim-HORF4 - 7 | X | X | X | X | X | ||
| Mma-HORF1, 2, 4 | X | X | X | X | X | ||
| Mma-HORF3 | X | X | X | X | |||
| Tpa-HORF | X | X | X | X | X | X | X |
| Lco-HORF1 | X | X | X | X | |||
| Lco-HORF2 | X | X | X | X | X | X | |
| Lsu-HORF1 - 2 | X | X | X | X | X | ||
|
|
|
|
|
|
|
|
|
Note: Bold numbers indicate mean values
Hits to viral proteins from structural prediction analyses
| Gene | Hit | Function | Position |
|---|---|---|---|
| Aan-MORF | Nucleoprotein, | Nucleoprotein | NA |
| Regulatory protein MNT, | Gene regulation | NA | |
| Upe-MORF | Uncharacterized protein 56B, | Transcription repressor | NA |
| Pgr-MORF | Matrix protein 1, | Matrix protein | NA |
| Helix-destabilizing protein, | DNA binding protein | NA | |
| Ija-MORF | Nonstructural protein 5A, | Membrane protein | NA |
| Functional anti-apoptotic factor vBCL-2 homolog, | Apoptosis | NA | |
| Sca-MORF | Nonstructural protein 5A, | Membrane protein | NA |
| Vel-MORF | Macrophage galactose N-acetyl-galactosamine specific lectin 2 [Hhpred; 93.40] | C-type lectin | 20–171 |
| RhUL123, | Viral life cycle | NA | |
| Phosphoprotein, | Unknown function | NA | |
| Tail needle protein gp26, | Fibrous protein | NA | |
| Qqu-MORF | Virion RNA polymerase, | Transferase | NA |
| Cmo-MORF | No hits to viral proteins | ||
| Hme-MORF | Nucleoprotein, | Nucleoprotein | NA |
| Aan-FORF | No hits to viral proteins | ||
| Upe-FORF | BM2 protein, | Transport protein | NA |
| Pgr-FORF | V-cyclin, | Cell cycle | NA |
| Lco-FORF | Herpes simplex virus protein ICP47, | Membrane protein | NA |
| Ija-FORF | Non-structural RNA-binding protein 34, | Translation | NA |
| Sca-FORF | Major capsid protein (protein P3), | Capsid protein | NA |
| Tli-FORF | Envelope protein E, | Envelope protein | NA |
| Vel-FORF | V1V2 region of HIV-1 on 1FD6 scaffold, | Immune system | NA |
| Qqu-FORF | HIV-1 matrix protein, | Matrix protein | NA |
| Mma-FORF | ODV-E18: Occlusion-derived virus envelope protein ODV-E18 (2) [Hhpred; 72.05, 62.79] | Envelope protein | 21–62 |
| Adenovirus fibre, | Fibre protein | 23–55 | |
| Fibre protein 2 (receptor-binding domain), | Fibre protein, receptor binding | NA | |
| NA | |||
| Cmo-FORF | Virus attachment protein globular domain (49835) SCOP seed sequence: d1h7za [Hhpred; 21.78] | Viral attachment, entry into host cell | 50–68 |
| Adenovirus fibre protein; cell receptor recognition, receptor, | Fibre protein, Cell receptor recognition | 44–68 | |
| Fibre protein, | NA | ||
| Fibre protein, | NA | ||
| Type 5 fibre protein, | NA | ||
| Fibre protein, | NA | ||
| Hme-FORF | Nucleoprotein, | RNA binding protein | NA |
| Uim-HORFs | HIV-1 capsid, | Capsid protein | NA |
| Gag Polyprotein, | Precursor protein | NA | |
| Capsid protein P24, | Capsid protein | NA | |
| Nucleoprotein, | Nucleoprotein | NA | |
| Protein ICP47, | Membrane protein | NA | |
| LdOrf-129 peptide, | Structual protein | 74–144 | |
| ORF-132 protein, | Unknown | 74–131 | |
| orf-126 protein, | Unknown | 72–140 | |
| Central variable region protein, | Unknown | 60–154 | |
| Central variable region protein, | Unknown | 60–130 | |
| pB602L, | Structural capsid protein, chaperone in capsid assembly | 65–153 | |
| U1, | Spliceosomal RNA | 65–137 | |
| gp7, | DNA transfer protein | NA | |
| Long tail fibre protein p37, | Fibre protein | 88–166 | |
| RhUL123, | Viral life cycle | NA | |
| Nucleoprotein, | Nucleoporin | NA | |
| LdOrf-129 peptide, | Structual protein | NA | |
| ORF-132 protein, | Unknown | NA | |
| DNA stabilization protein, | DNA binding & stabilization | 87–188 | |
| Hexon protein, | Major coat protein | 139–223 | |
| Human T-cell leukemia virus type II matrix protein, | Matrix protein | NA | |
| Herpes simplex virus protein ICP47, | Blocks the major histocompatibility complex class I antigen presentation pathway | NA | |
| Lco-HORFs | Long tail fiber protein P37, | Receptor binding viral protein | NA |
| Capsid protein, | Capsid component | NA | |
| VPU protein, | Regulates degradation of receptor molecule CD4 | NA | |
| Lsu-HORFs | Major capsid protein, | Capsid component | NA |
| RhUL123, | Viral life cycle | 69–195 | |
| Herpes virus major outer envelope glycoprotein (BLLF1) [BLASTP/PSIBLAST; 2.73e-03] | Envelope protein | NA | |
| Short tail fiber protein, | Structural protein | NA | |
| Major capsid protein, | Capsid component | NA | |
| Coat protein, | Coat component | NA | |
| Herpes virus major outer envelope glycoprotein (BLLF1) [BLASTP/PSIBLAST; 4.85e-04] | Envelope protein | NA | |
| Tpa-HORF | VPU protein (Trans-membrane domain), | Regulates degradation of receptor molecule CD4 | NA |
| Mfa-HORFs | ODV-E18: Occlusion-derived virus envelope protein ODV-E18 [Hhpred; 74.97] | Envelope protein | 33–73 |
| Herpes_TK_C: Thymidine kinase from Herpesvirus C-terminal, | ATP binding, thymidine kinase | 33–73 | |
| Adenovirus fibre, | Fibre protein, receptor binding | NA |
Note: I-TASSER: Norm. Z-score > 1 indicates a good alignment; TM-score > 0.5 indicates a similar fold with query [46]; position = amino acid position in the query sequence; NA not applicable
List of BLAST hits for mitochondrial ORFans in freshwater mussels searched against NCBI NRDB mitochondrial proteins
| Species Name | M | F- | H- |
|---|---|---|---|
|
| NAD7 (0.61) | --- | |
| --- |
| ||
|
| ATP8 (0.81) | --- | |
| --- |
| ||
|
| ATP8 (0.61) | NAD2 (0.33) | |
|
|
| ||
|
| --- | ||
|
| |||
|
| F-ORF (4e-05) | ||
|
| |||
|
| F-ORF (6e-09) | ||
|
| |||
|
| |||
|
| ATP8 (0.62) | --- | |
|
|
| ||
|
| |||
|
| |||
|
| COX1 (0.94) | ||
| --- | |||
|
| NAD5 (0.093) | ||
| NAD2 (0.23) | |||
|
| |||
|
| NAD5 (0.046) | --- | |
|
|
| ||
|
| NAD5 (0.026) | NAD5 (0.31) | |
| ATP8 (0.070) |
| ||
|
| |||
|
| COX1 (0.41) | --- | |
| NAD5 (0.99) |
| ||
|
| |||
|
| --- | ||
| --- | |||
|
| F-ORF (0.020) | ||
| --- | |||
|
| --- | ||
|
| |||
|
| NAD5 (0.38) | --- | |
|
|
| ||
|
| NAD4 (0.19) | NAD4 (0.55) | |
| CYTB (0.21) |
| ||
| ATP8 (0.94) | |||
|
|
Note: Protein name and (e-values <1.0) identified using PSI-BLAST and tBLASTx are indicated above in capital letters and below in italics, respectively. Hits to freshwater mussel mitochondrial ORF homologs are not presented, except for the highly divergent H-ORFs
Fig. 7Most frequent categories of hits for biological processes or molecular functions for freshwater mussel mitochondrial ORFans. Categories presented are those returned for over 75 % of all analyzed species for each ‘sex’ (the number of analyzed species for each sex is indicated in parentheses). Blue, M-ORF; pink, F-ORF; green, H-ORF