| Literature DB >> 31053062 |
Han Ming Gan1,2,3,4, Frederic Grandjean5, Tom L Jenkins6, Christopher Mervyn Austin7,8,9,10.
Abstract
BACKGROUND: The recently published complete mitogenome of the European lobster (Homarus gammarus) that was generated using long-range PCR exhibits unusual gene composition (missing nad2) and gene rearrangements among decapod crustaceans with strong implications in crustacean phylogenetics. Such atypical mitochondrial features will benefit greatly from validation with emerging long read sequencing technologies such as Oxford Nanopore that can more accurately identify structural variation.Entities:
Keywords: Homarus gammarus; Lobster; Mitogenome; Nanopore sequencing; PCR bias; Tandem duplication
Mesh:
Substances:
Year: 2019 PMID: 31053062 PMCID: PMC6500004 DOI: 10.1186/s12864-019-5704-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 2Presence of nad2 can be detected in at least five H. gammarus individuals collected from various European regions. a Linear gene organization comparison of the previous reported H. gammarus mitogenome with that reported in this study confirming the absence of nad2 in the original report in addition to substantial structural variation. The direction of arrow indicates transcription orientation. Orange, teal, purple and red arrows correspond to protein-coding genes, putative pseudogenes, rRNAs and tRNAs, respectively. b Circular comparison of the reconstructed mitogenomes showing significant nucleotide alignment across the entire newly assembled H. gammarus mitogenome with minor assembly gaps in the high-AT ribosomal rRNA regions (See Fig. 3e for details). c Sampling map of the fiveH. gammarus individuals with their exact location indicated by a star symbol
Performance of different assemblers in the recovery of Homarus gammarus mitogenome
| Assembler | Contig Length (bait/contigID) | Protein coding gene (number of intact genes) |
|---|---|---|
| MITObim | 14,885 bp (rrnL) | |
| 13,143 bp (cytb) | ||
| 6984 bp (nad2) | ||
| Megahit | 5162 bp (ctg154136) | |
| 4045 bp (ctg252123) | ||
| 10,025 bp (ctg253367) | ||
| NOVOplasty | 22,032 bp |
Internal stop codon Fragmented
Fig. 1Identification and in-silico functional verification of the previously missing mitochondrial nad2 in H. gammarus. a Amino acid alignment of the H. gammarus nad2 protein with other lobsters showing high sequence conservation (residues in black boxes). Numbers above alignment indicate amino acid position. b Interproscan Identification of conserved protein domains in the H. gammarus nad2 protein (c) Maximum likelihood tree of nad2 proteins rooted with Cherax spp. as the outgroup. Node labels indicate IqTree ultra fast bootstrap support values and branch length indicates the number of substitutions per site
Fig. 3Tandem duplication in the H. gammarus mitogenome. a Linear gene organization comparison of the H. gammarus mitogenome with that of H. americanus (tblastx) showing broadly similar gene organization except for the presence of a tandemly duplicated region (cox2-atp8-atp6-cox3-nad3-nad5) in H. gammarus with dissimilar length. The direction of arrow indicates transcription orientation. Green, blue, purple and red arrows correspond to protein-coding genes, putative pseudogenes, rRNAs and tRNAs, respectively. b Minimap2 alignment of nanopore reads to the mitogenome showing broadly even read coverage with some extra long reads spanning the duplicated regions. c Minimap alignment of Megahit Illumina-only assembly indicating the presence of contigs corresponding to both duplicated regions presumably due to the presence of substantial nucleotide divergence. d Bowtie2 mapping of Illumina reads to the mitogenome showing uneven coverage that appears to correlate with the (e) positional (500 bp window size) GC-content of the mitogenome
Fig. 4Evidence for pseudogenization among genes located in the duplicated regions. Identification of (a) Internal stop codon and (b) Frameshift mutation in one of the duplicated cox2 and cox3 genes, respectively, with strong support from multiple Illumina short reads. Labels with bracketed “P” indicate putative pseudogenes (c) Maximum likelihood tree constructed based on the protein alignment of ATP6. Labels above branches are branch lengths representing the number of amino acid substitutions per site