Literature DB >> 32258427

The genome sequence of the Eurasian river otter, Lutra lutra Linnaeus 1758.

Dan Mead1, Frank Hailer2, Elisabeth Chadwick2, Roberto Portela Miguez3, Michelle Smith1, Craig Corton1, Karen Oliver1, Jason Skelton1, Emma Betteridge1, Jale Doulcan Doulcan1, Olga Dudchenko4, Arina Omer4, David Weisz4, Erez Lieberman Aiden4, Shane McCarthy1, Kerstin Howe1, Ying Sims1, James Torrance1, Alan Tracey1, Richard Challis1, Richard Durbin1, Mark Blaxter1.   

Abstract

We present a genome assembly from an individual male Lutra lutra (the Eurasian river otter; Vertebrata; Mammalia; Eutheria; Carnivora; Mustelidae). The genome sequence is 2.44 gigabases in span. The majority of the assembly is scaffolded into 20 chromosomal pseudomolecules, with both X and Y sex chromosomes assembled. Copyright:
© 2020 Mead D et al.

Entities:  

Keywords:  Lutra lutra river otter genome sequence chromosomal

Year:  2020        PMID: 32258427      PMCID: PMC7097881          DOI: 10.12688/wellcomeopenres.15722.1

Source DB:  PubMed          Journal:  Wellcome Open Res        ISSN: 2398-502X


Species taxonomy

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Mustelidae; Lutrinae; Lutra; Lutra lutra Linnaeus 1758 (NCBI txid 9657).

Background

The Eurasian river otter, Lutra lutra, is found along the coasts and inland waters of Europe, Asia, China, Japan, Java, Sri lanka, the Middle East and North Africa. Eurasia. Throughout Europe, populations of L. lutra declined precipitously through the latter half of the 20th century, and the species is of active conservation concern. In Ireland, L. lutra populations have remained relatively stable [1], and in Britain river restoration and active intervention have resulted in increased populations, and recolonisation of watersheds from which otters had been eliminated [2]. There is active research of the continuing impacts of pollutants on otters ( Pountney ), and on the population genetic patterns that have resulted from their near-extinction and subsequent recovery in Britain ( Stanton ). Here we present a chromosomally assembled genome sequence for L. lutra, based on a male specimen from Britain.

Genome sequence report

The genome was sequenced from a naturally deceased single male L. lutra collected by the Cardiff Otter Project from Wincanton, Somerset. A total of 63-fold coverage in Pacific Biosciences single-molecule long reads (N50 24 kb) and 58-fold coverage in 10X Genomics read clouds (from molecules with an estimated N50 of 57 kb) were generated. Primary assembly contigs were scaffolded with chromosome conformation HiC data (17-fold coverage). The final assembly has a total length of 2.44 Gb in 43 sequence scaffolds with a scaffold N50 of 149.0 Mb ( Table 1). The majority, 92.7%, of the assembly sequence was assigned to 20 chromosomal-level scaffolds representing 18 autosomes (numbered by sequence length), and the X and Y sex chromosomes ( Figure 1– Figure 4; Table 2). The assembly has a BUSCO ( Simão ) completeness of 95.8% using the mammalia_odb9 reference set. While not fully phased, the assembly deposited is of one haplotype. Contigs corresponding to the second haplotype have also been deposited.
Table 1.

Genome data for Lutra lutra mLutLut1.

Project accession data
Assembly identifiermLutLut1
Species Lutra lutra
SpecimenNHMUK ZD 2019.215
NCBI taxonomy ID9657
BioProjectPRJEB35340
Biosample IDSAMEA994731
Isolate informationWild casualty; male
Raw data accessions
PacificBiosciences SEQUEL IERR3313238, ERR3313239-ERR3313241, ERR3313246, ERR3313327, ERR3313330, ERR3313333-ERR3313341
10X Genomics IlluminaERR3316145-ERR3316148, ERR3316169-ERR3316171
Hi-C IlluminaSRR10119468
Genome assembly
Assembly accessionGCA_902655055.1
Accession of alternate haplotype GCA_902653095.1
Span (Mb)2,438.00
Number of contigs228
Contig N50 length (Mb)30.40
Number of scaffolds43
Scaffold N50 length (Mb)149.00
Longest scaffold (Mb)223.45
BUSCO * genome scoreC:95.8%[S:94.3%,D:1.5%],F:1.9%,M:2.3%,n:4104

* BUSCO scores based on the mammalia_odb9 BUSCO set using v3.0.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/mLutLut1_1/dataset/mLutLut1_1/busco.

Figure 1.

Genome assembly of Lutra lutra mLutLut1: BlobToolKit Snailplot.

The plot shows N50 metrics for L. lutra assembly mLutLut1 and BUSCO scores for the Euarchontoglires set of orthologues. The interactive version of this figure is hosted here.

Figure 4.

Genome assembly of Lutra lutra mLutLut1: Hi-C contact map.

Hi-C contact map of the L. lutra mLutLut1 assembly, visualized in Juicebox ( Durand ). An interactive version of the map hosted here, powered by Juicebox.js ( Robinson ).

Table 2.

Chromosomal pseudomolecules in the genome assembly of Lutra lutra mLutLut1.

ENA accession Chromosome Size (Mb) GC%
LR738403.11223.4541
LR738404.12210.6539
LR738405.13201.3239.5
LR738406.14197.7141.7
LR738407.15165.8140.3
LR738408.16154.4340.1
LR738409.17149.0141.9
LR738410.18144.7541.3
LR738411.19144.0942.9
LR738412.110114.6642.7
LR738413.111108.7940.6
LR738414.11296.4543
LR738415.11395.7342.7
LR738416.11489.0843.1
LR738417.11569.9942.8
LR738418.11661.4846.9
LR738419.11760.3546.2
LR738420.11840.4348.2
LR738421.1X99.6941.2
LR738422.1Y2.2538.8
* BUSCO scores based on the mammalia_odb9 BUSCO set using v3.0.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/mLutLut1_1/dataset/mLutLut1_1/busco.

Genome assembly of Lutra lutra mLutLut1: BlobToolKit Snailplot.

The plot shows N50 metrics for L. lutra assembly mLutLut1 and BUSCO scores for the Euarchontoglires set of orthologues. The interactive version of this figure is hosted here.

>Genome assembly of Lutra lutra mLutLut1: BlobToolKit GC-coverage plot.

The interactive version of this figure is hosted here.

Genome assembly of Lutra lutra mLutLut1: BlobToolKit Cumulative sequence plot.

The interactive version of this figure is hosted here.

Genome assembly of Lutra lutra mLutLut1: Hi-C contact map.

Hi-C contact map of the L. lutra mLutLut1 assembly, visualized in Juicebox ( Durand ). An interactive version of the map hosted here, powered by Juicebox.js ( Robinson ).

Methods

The river otter specimen was collected from Wincanton, Somerset by the Cardiff Otter Project. A full tissue dissection and preservation in 80% ethanol was undertaken and the specimen accessioned by the Natural History Museum, London. DNA was extracted using an agarose plug extraction from spleen tissue following the Bionano Prep Animal Tissue DNA Isolation Soft Tissue Protocol. Pacific Biosciences CLR long read and 10X Genomics read cloud sequencing libraries were constructed according to the manufacturers’ instructions. Sequencing was performed by the Scientific Operations core at the Wellcome Sanger Institute on Pacific Biosciences SEQUEL I and Illumina HiSeq X instruments. Hi-C data were generated by the Aiden lab using an optimised version of their protocols ( Dudchenko ). Assembly was carried out using Falcon-unzip ( Chin ), haplotypic duplication was identified and removed with purge_dups ( Guan ) and a first round of scaffolding carried out with 10X Genomics read clouds using scaff10x ( https://github.com/wtsi-hpag/Scaff10X). Scaffolding with Hi-C data ( Rao ) was carried out with 3D-DNA ( Dudchenko ), followed by manual curation with Juicebox Assembly Tools ( Dudchenko ; Durand ; Robinson ) and visualisation in HiGlass ( Kerpedjiev ). The Hi-C scaffolded assembly was polished with arrow using the PacBio data, then polished with the 10X Genomics Illumina data by aligning to the assembly with longranger align, calling variants with freebayes ( Garrison & Marth, 2012) and applying homozygous non-reference edits using bcftools consensus ( https://github.com/VGP/vgp-assembly/tree/master/pipeline/freebayes-polish). Two rounds of the Illumina polishing were applied. The assembly was checked for contamination and corrected using the gEVAL system ( Chow ). We removed two low-coverage scaffolds that were likely to have derived from the ribosomal DNA cistron of a Sarcocystis species (most similar to Sarcocystis lutrae). The genome was analysed within the BlobToolKit environment ( Challis ).

Data availability

European Nucleotide Archive: Lutra lutra (Eurasian otter) genome assembly, mLutLut1. BioProject accession number PRJEB35340; https://www.ebi.ac.uk/ena/data/view/PRJEB35340. The genome sequence is released openly for reuse. The L. lutra genome sequencing initiative is part of the Wellcome Sanger Institute’s “25 genomes for 25 years” project [3]. It is also part of the Vertebrate Genome Project (VGP) [4] ordinal references programme, the DNA Zoo Project [5] and the Darwin Tree of Life (DToL) project [6]. The specimen has been preserved in ethanol and deposited with the Natural History Museum, London under registration number NHMUK ZD 2019.215 where it will remain accessible to the research community for posterity. All raw data and the assembly have been deposited in the ENA. The genome will be annotated and presented through the Ensembl pipeline at the European Bioinformatics Institute. Raw data and assembly accession identifiers are reported in Table 1. The manuscript describes the de novo genome assembly of the Eurasian river otter ( Lutra lutra). The authors combine the latest state-of-the art sequencing methods, PacificBioscience and 10X Genomics, to generate primary contigs which are then assembled into 43 scaffolds, having a high N50 value of 149 Mb, using Hi-C data. In addition, 92.7% of the total assembly are assigned to chromosomal pseudomolecules, representing 18 autosomes as well as X and Y sex chromosomes. Together with a high level of completeness as analysed by BUSCO quality assessment, this de novo assembly provides a high-quality draft genome sequence. The genome resources generated in this project will be instrumental to investigate the genetic structure, genetic diversity and phylogeny of Eurasian otters. As monitoring tools are urgently needed for development and evaluation of conservation efforts the genome sequence will provide genetic markers to be used in conservation genetic studies. The manuscript is concise, data is presented comprehensively and the figures are linked to interactive versions Recommendation: The Busco software used for genome quality assessment should to be included in Table 3. I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. This manuscript described the detailed information about the chromosome-level de novo genome assembly of the Eurasian river otter. They used PacBio SMRT and 10X Genomics sequencing techniques to construct the scaffolds of the genome. These two techniques are widely popular sequencing methods that are good at constructing long scaffolds. They assembled as low as 43 scaffolds. Furthermore, they used Hi-C technique to assign the scaffolds on chromosomes. As a result, 92.7% of the assembly sequences were assigned to 20 chromosomal pseudomolecules. The levels of contig N50 and scaffold N50, and BUSCO genome assembly assessment showed that the genome assembly is of high-quality. The Eurasian river otter is listed as Near Threatened under the IUCN red list, and some populations have been decreasing. So, it is important to know the genetic diversity, genetic structure and adaptive evolution mechanisms of Eurasian river otters. The chromosome-level genome assembly will help answer the above questions, which has important conservation implications for this species. The report about the genome assembly was detailed, and the manuscript written well. I just have two comments. I suggest to add a photo of Eurasian river otter to show this species to readers. Table 2: it is better to add the number of scaffolds assigned for each chromosome. I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
Table 3.

Software tools used.

Software toolVersionSource
Falcon-unzipfalcon-kit 1.2.2( Chin et al., 2016)
purge_dups1.0.0( Guan et al., 2020)
3D-DNA180419( Dudchenko et al., 2018)
scaff10x4.2 https://github.com/wtsi-hpag/Scaff10X
arrowGenomicConsensus 2.3.3 https://github.com/PacificBiosciences/GenomicConsensus
longranger align2.2.2 https://support.10xgenomics.com/genome-exome/software/ pipelines/latest/advanced/other-pipelines
freebayesv1.1.0-3-g961e5f3( Garrison & Marth, 2012)
bcftools consensus1.9 http://samtools.github.io/bcftools/bcftools.html
gEVAL2016( Chow et al., 2016)
BlobToolKit1( Challis et al., 2019)
  11 in total

1.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.

Authors:  Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Bioinformatics       Date:  2015-06-09       Impact factor: 6.937

2.  Phased diploid genome assembly with single-molecule real-time sequencing.

Authors:  Chen-Shan Chin; Paul Peluso; Fritz J Sedlazeck; Maria Nattestad; Gregory T Concepcion; Alicia Clum; Christopher Dunn; Ronan O'Malley; Rosa Figueroa-Balderas; Abraham Morales-Cruz; Grant R Cramer; Massimo Delledonne; Chongyuan Luo; Joseph R Ecker; Dario Cantu; David R Rank; Michael C Schatz
Journal:  Nat Methods       Date:  2016-10-17       Impact factor: 28.547

3.  De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds.

Authors:  Olga Dudchenko; Sanjit S Batra; Arina D Omer; Sarah K Nyquist; Marie Hoeger; Neva C Durand; Muhammad S Shamim; Ido Machol; Eric S Lander; Aviva Presser Aiden; Erez Lieberman Aiden
Journal:  Science       Date:  2017-03-23       Impact factor: 47.728

4.  A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping.

Authors:  Suhas S P Rao; Miriam H Huntley; Neva C Durand; Elena K Stamenova; Ivan D Bochkov; James T Robinson; Adrian L Sanborn; Ido Machol; Arina D Omer; Eric S Lander; Erez Lieberman Aiden
Journal:  Cell       Date:  2014-12-11       Impact factor: 41.582

5.  Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom.

Authors:  Neva C Durand; James T Robinson; Muhammad S Shamim; Ido Machol; Jill P Mesirov; Eric S Lander; Erez Lieberman Aiden
Journal:  Cell Syst       Date:  2016-07       Impact factor: 10.304

6.  High liver content of polybrominated diphenyl ether (PBDE) in otters (Lutra lutra) from England and Wales.

Authors:  Angela Pountney; Amy L Filby; Gareth O Thomas; Vic R Simpson; Elizabeth A Chadwick; Jamie R Stevens; Charles R Tyler
Journal:  Chemosphere       Date:  2014-07-09       Impact factor: 7.086

7.  gEVAL - a web-based browser for evaluating genome assemblies.

Authors:  William Chow; Kim Brugger; Mario Caccamo; Ian Sealy; James Torrance; Kerstin Howe
Journal:  Bioinformatics       Date:  2016-04-07       Impact factor: 6.937

8.  BlobToolKit - Interactive Quality Assessment of Genome Assemblies.

Authors:  Richard Challis; Edward Richards; Jeena Rajan; Guy Cochrane; Mark Blaxter
Journal:  G3 (Bethesda)       Date:  2020-04-09       Impact factor: 3.154

9.  Identifying and removing haplotypic duplication in primary genome assemblies.

Authors:  Dengfeng Guan; Shane A McCarthy; Jonathan Wood; Kerstin Howe; Yadong Wang; Richard Durbin
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

10.  HiGlass: web-based visual exploration and analysis of genome interaction maps.

Authors:  Peter Kerpedjiev; Nezar Abdennur; Fritz Lekschas; Chuck McCallum; Kasper Dinkla; Hendrik Strobelt; Jacob M Luber; Scott B Ouellette; Alaleh Azhir; Nikhil Kumar; Jeewon Hwang; Soohyun Lee; Burak H Alver; Hanspeter Pfister; Leonid A Mirny; Peter J Park; Nils Gehlenborg
Journal:  Genome Biol       Date:  2018-08-24       Impact factor: 13.583

View more
  1 in total

1.  Sequence locally, think globally: The Darwin Tree of Life Project.

Authors: 
Journal:  Proc Natl Acad Sci U S A       Date:  2022-01-25       Impact factor: 12.779

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.