Literature DB >> 30953569

A draft nuclear-genome assembly of the acoel flatworm Praesagittifera naikaiensis.

Asuka Arimoto1, Tomoe Hikosaka-Katayama2, Akira Hikosaka3, Kuni Tagawa4, Toyoshige Inoue4, Tatsuya Ueki4,5, Masa-Aki Yoshida6, Miyuki Kanda7, Eiichi Shoguchi1, Kanako Hisata1, Noriyuki Satoh1.   

Abstract

BACKGROUND: Acoels are primitive bilaterians with very simple soft bodies, in which many organs, including the gut, are not developed. They provide platforms for studying molecular and developmental mechanisms involved in the formation of the basic bilaterian body plan, whole-body regeneration, and symbiosis with photosynthetic microalgae. Because genomic information is essential for future research on acoel biology, we sequenced and assembled the nuclear genome of an acoel, Praesagittifera naikaiensis.
FINDINGS: To avoid sequence contamination derived from symbiotic microalgae, DNA was extracted from embryos that were free of algae. More than 290x sequencing coverage was achieved using a combination of Illumina (paired-end and mate-pair libraries) and PacBio sequencing. RNA sequencing and Iso-Seq data from embryos, larvae, and adults were also obtained. First, a preliminary ∼17-kilobase pair (kb) mitochondrial genome was assembled, which was deleted from the nuclear sequence assembly. As a result, a draft nuclear genome assembly was ∼656 Mb in length, with a scaffold N50 of 117 kb and a contig N50 of 57 kb. Although ∼70% of the assembled sequences were likely composed of repetitive sequences that include DNA transposons and retrotransposons, the draft genome was estimated to contain 22,143 protein-coding genes, ∼99% of which were substantiated by corresponding transcripts. We could not find horizontally transferred microalgal genes in the acoel genome. Benchmarking Universal Single-Copy Orthologs analyses indicated that 77% of the conserved single-copy genes were complete. Pfam domain analyses provided a basic set of gene families for transcription factors and signaling molecules.
CONCLUSIONS: Our present sequencing and assembly of the P. naikaiensis nuclear genome are comparable to those of other metazoan genomes, providing basic information for future studies of genic and genomic attributes of this animal group. Such studies may shed light on the origins and evolution of simple bilaterians.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Keywords:  zzm321990 Praesagittifera naikaiensiszzm321990 ; acoel; draft nuclear genome

Mesh:

Year:  2019        PMID: 30953569      PMCID: PMC6451197          DOI: 10.1093/gigascience/giz023

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Background

Acoels are small, very simple, planaria-like animals lacking a coelom, a gut, and a circulatory system. Traditional taxonomy positioned the Acoela as the most basal order of the phylum Platyhelminthes [1]. Recent analyses using molecular data, however, have suggested that acoels are members of a new phylum, the Xenacoelomorpha, together with nemertodermatids and xenoturbellids [2-4]. However, whether Xenacoelomorpha is a monophyletic taxon, whether xenacoelomorphs are basal to all other bilaterians, and whether they have close affinity to ambulacrarians are matters of debate [2-4]. Nonetheless, acoels are pivotal to understanding the origins and evolution of bilaterians. Acoels also provide a platform for molecular studies of whole-body regeneration [5] and symbiosis with photosynthetic microalgae. Although mitochondrial genomes of 4 acoel species have been reported [6-8], their nuclear genomes have not been explored yet. Because nuclear genome information is essential to investigate biological questions regarding acoels, we sequenced and assembled a draft nuclear genome of the acoel Praesagittifera naikaiensis (urn:lsid:marinespecies.org:taxname:379972).

Sampling and Sequencing

Biological materials

The marine acoel worm P. naikaiensis is 2–3 mm in length (Fig. 1A) [9]. Members of this species are easily found at seashores of the Seto Inland Sea, Japan, especially during the early summer season (Fig. 1B). Adults contain symbiotic microalgae, Tetraselmis species, which are integrated during juvenile growth (Fig. 1C). Adults were collected at the seashore near the Marine Biological Laboratory of Hiroshima University and maintained in aquaria in the laboratory on a 12-h light/12-h dark photoperiod. Naturally laid eggs were collected and cultured for embryogenesis (Fig. 1C). Embryos were free of symbiotic microalgae. After washing embryos with filtered seawater, genomic DNA was extracted from them using the phenol/chloroform extraction method.
Figure 1:

The acoel worm P. naikaiensis. (A) An adult, dorsal view. Anterior, top and posterior, bottom. Green dots throughout the entire body are symbiotic green algae. Two eggs are seen in the center of the worm. (B) An enormous number of adults gathering on the sandy seashore, resembling dark masses. (C) Embryogenesis. A newly laid egg within the eggshell (i), a 4-cell stage embryo (ii), a gastrula (iii), a flattened-stage embryo (iv), newly hatched aposymbiotic algae (v), and a symbiotic juvenile with symbiotic algae (vi). (D) A peripheral region of an adult worm showing symbiotic microalgae, Tetraselmis species. Scale = 50 μm in (A), (C), (D); 10 cm in (B).

The acoel worm P. naikaiensis. (A) An adult, dorsal view. Anterior, top and posterior, bottom. Green dots throughout the entire body are symbiotic green algae. Two eggs are seen in the center of the worm. (B) An enormous number of adults gathering on the sandy seashore, resembling dark masses. (C) Embryogenesis. A newly laid egg within the eggshell (i), a 4-cell stage embryo (ii), a gastrula (iii), a flattened-stage embryo (iv), newly hatched aposymbiotic algae (v), and a symbiotic juvenile with symbiotic algae (vi). (D) A peripheral region of an adult worm showing symbiotic microalgae, Tetraselmis species. Scale = 50 μm in (A), (C), (D); 10 cm in (B). Embryos, juveniles, and adults were sampled for RNA sequencing. Total RNA extraction was performed using TRIzol Reagent (ThermoFisher, MA, USA, 15596-026) and an RNeasy mini Kit (Qiagen, Hilden, Germany, 74104).

Library preparation and sequencing

DNA

All sequencing libraries were constructed according to the manufacturers’ standard protocols. Briefly, for the Illumina platform, polymerase chain reaction (PCR)-free, paired-end libraries were prepared using an Illumina TruSeq DNA PCR-Free LT Library Prep Kit (Illumina, CA, USA, FC-121-3001). Four mate-pair libraries were prepared using a Nextera Mate Pair Library Prep Kit (Illumina, FC-132-1001) (Supplementary Table 1). For the PacBio platform, a DNA library was prepared using the manufacturer's 10-kb (kilobase pair) template preparation protocol. A SMRTbell Template Prep Kit 1.0 (Pacific Biosciences, CA, USA, 100-259-100) was used for PacBio library preparation. The long-read DNA library was sequenced using a PacBio RSII sequencer employing P6-C4 chemistry (Pacific Biosciences, 100-372-700) with 360-min movie lengths. A total of 52 SMRT Cells were sequenced for long-read DNA library.

RNA

An RNA sequencing (RNA-seq) library was prepared using a TruSeq Stranded mRNA LT Sample Prep Kit (Illumina, RS-122-2101). The library was sequenced using the Illumina HiSeq 2500 platform (Supplementary Table 1). Complementary DNAs for Iso-Seq libraries were prepared using a SMARTer PCR cDNA Synthesis Kit (Clontech, CA, USA, 634925). The SageELF size selection system (Sage Science, MA, USA) was used following the manufacturer's standard protocol (Supplementary Table 1). A SMRTbell Template Prep Kit 1.0 (Pacific Biosciences, 100-259-100) was used for Iso-Seq library preparation. The library was sequenced using a PacBio RSII sequencer employing P6-C4 chemistry (Pacific Biosciences, 100-372-700) with 360-min movie lengths. A total of 8 SMRT Cells were sequenced for the Iso-Seq RNA library.

Assembly of mitochondrial and nuclear genomes

Adapter sequences in PCR-free and mate-pair Illumina reads were removed with Trimmomatic 0.36 (Trimmomatic, RRID:SCR_011848) [10] and NextClip 1.3.1 (NextClip, RRID:SCR_005465) [11], respectively. Low-quality (12] after adapter cleanup. Reads that lacked a corresponding pair were discarded.

Mitochondrial genome assembly

To distinguish mitochondrial genome sequences from the nuclear genome assembly, we first assembled the mitochondrial genome. To this end, a mitochondrial 16S ribosomal RNA of Symsagittifera roscoffensis (accession No. NC_014578) [7] was used to collect PacBio long reads of P. naikaiensis sequences using BLAST+ 2.3.0 [13] with the “dc-megablast” option. Collected reads longer than 1 kb and shorter than 12 kb were assembled using sprai 0.9.9.19 [14] with default settings. Circularity of assembled contigs was checked automatically in the sprai assembly pipeline.

Nuclear genome assembly

P. naikaiensis mitochondrial sequences were mapped onto trimmed reads using BWA 0.7.12 (BWA, RRID:SCR_010910) [15], and those read pairs that mapped onto the mitochondrial sequences were excluded from the data set. PacBio long reads were also mapped against the mitochondrial genome using BLASR (BLASR, RRID:SCR_000764) (commit version: 5.3.574e1c2) [16]. Only unmapped or cleaned-up Illumina and PacBio reads were assembled using the MaSuRCA (MaSuRCA, RRID:SCR_010691) assembler 3.2.2 [17]. Putative heterozygous and/or polymorphic sequences that remained in the assembled genome were merged as homozygous sequences using redundans 0.13c [18]. Gaps in the homozygous genome were filled using PBJelly (PBJelly, RRID:SCR_012091) in PBSuite 15.8.24 [19]. After gap closing, BESST 2.2.6 [20] and LINKS 1.8.5 [21] were used to perform scaffolding with Illumina and PacBio reads, respectively. Scaffolds were polished using Racon (commit version: 083444) [22] with PacBio long reads. PacBio Iso-Seq reads were mapped onto scaffolds using GMAP (GMAP, RRID:SCR_008992) version 2017-08-15 [23], and then L_RNA_scaffolder [24] was used to concatenate scaffolds based on the results of Iso-Seq read mapping. Scaffolds were polished using Pilon 1.22 (Pilon, RRID:SCR_014731) [25] with PCR-free Illumina reads used for the MaSuRCA assembly. BUSCO 3.0.2 (Benchmarking Universal Single-Copy Orthologs, RRID:SCR_015008) [26] with a metazoan data set was used to evaluate the polished final genome assembly.

Genome size estimation

PCR-free, paired-end reads used for genome assembly were analyzed. K-mers in the data set were counted with Jellyfish 2.2.3 (Jellyfish, RRID:SCR_005491) [27] (Supplementary Figure 1). The genome size of P. naikaiensis was estimated from obtained k-mer frequencies using GenomeScope web tools [28].

Repeat analysis

Repetitive sequences in the assembled genome were identified using RepeatModeler 1.0.11 (RepeatModeler, RRID:SCR_015027) [29] and RepeatMasker 4.0.7 (RepeatMasker, RRID:SCR_012954) [30].

Transcriptome assembly, gene prediction, and gene annotation

Adapter sequences and low-quality (31] with default settings and the strand-specific option. In addition, genome-guided transcriptome assembly was performed. RNA-seq reads were mapped onto the genome using STAR 2.5.2a (STAR, RRID:SCR_015899) [32] and then mapped reads were assembled using Trinity. De novo assembled Illumina transcriptome and PacBio Iso-Seq sequences were mapped onto the genome using minimap-2 version 2.6 [33] with the “-ax splice” option. These mapping results and the genome-guided assembly of Illumina RNA-seq reads were integrated based on genome sequences using PASA 2.2.0 (PASA, RRID:SCR_014656) [34]. Putative full-length transcripts having both a 5′ and a 3′ untranslated region were detected using TransDecoder 5.0.2 [35]. These full-length transcripts were used as a training set for gene prediction. De novo transcriptome assembly of a data set containing 15 xenacoelomorphs (Supplementary Table 2) was also performed, following the procedure described above to create similarity hints for gene prediction. Assembled sequences of other acoels were translated into protein sequences using TransDecoder and then mapped against the P. naikaiensis genome using Exonerate 2.2.0 (Exonerate, RRID:SCR_016088) [36]. A final set of gene models reflecting hint information was generated with AUGUSTUS 3.2.1 (Augustus, RRID:SCR_008417) [37]. Gene models were annotated using BLAST searches (E-value cutoff of 10−5) against the NCBI RefSeq protein database release 88. Protein domains in gene models were detected using HMMER 3.1b2 (Hmmer, RRID:SCR_005305) [38] and Pfam-A 31.0 under default settings, except for an E-value cutoff of 10−5.

A draft assembly

Mitochondrial genome

The complete, closed circular mitochondrial genome of P. naikaiensis was recovered from genome-sequencing data. The mitochondrial genome is 17,787 base-pairs long and contains 12 protein-coding genes, small and large ribosomal RNAs, and 22 predicted transfer RNAs (Supplementary Figure 2). When cox1 was positioned at the start of the genome on the “positive” strand, 8 protein-coding genes were found in the same strand while nad2, cytb, and nad5 were found on the “negative” strand (Supplementary Figure 2). Both ribosomal RNAs were found separately on the positive strand. Although the number of mitochondrial genes of P. naikaiensis is comparable to that of the previously reported mitochondrial genes of Archaphanostoma ylvae [8], the order of genes in the genomes was quite different between them.

Nuclear genome

K-mer analysis showed that the P. naikaiensis genome constitutes ∼654 Mb (Table 1; Supplementary Figure 1). Illumina paired-end and mate-pair, and PacBio reads provided 204x and 221x, and 73x coverage of the estimated genome, respectively (Supplementary Table 1 and Supplementary Figure 1). The assembly appeared to plateau during both scaffolding and contig formation (Supplementary Figure 3). As a result, the draft assembly comprised 656 Mb (Table   1), very close to the estimated genome size. The scaffold N50 reached 117 kb, and 12 scaffolds were >500 kb in length (Table 1; Supplementary Table 3). Inserted gaps made up only 1.7% of the total scaffold assembly (Supplementary Table 3). The contig N50 was 57 kb, and 41 contigs exceeded 250 kb (Table   1; Supplementary Table 3). The guanine + cytosine content of the genome was estimated at 39.1% (Table   1; Supplementary Figure 4).
Table 1:

Genome assembly statistics for P. naikaiensis

Genome featureValue
Estimated genome size*654.1 Mb
Assembled genome size656.1 Mb
Scaffolds (≥500 bp)
 No.12,072
 N50117 kb
Contigs (≥500 bp)
 No.24,071
 N5057 kb
Gaps1.66%
Repetitive sequences69.8%
Guanine + cytosine content39.1%
Predicted protein-coding genes (loci)22,143
Genes with transcript support99%
Mean transcript length2447 nucleotides
Mean exon frequency per gene5.7
BUSCO analysis
 Complete748/978 (76.5%)
 Fragmented37/978 (3.8%)

*Estimated by k-mer analysis of Illumina PCR-free reads as shown in Supplementary Figure 1.

Genome assembly statistics for P. naikaiensis *Estimated by k-mer analysis of Illumina PCR-free reads as shown in Supplementary Figure 1. Analysis of repetitive sequences showed that ∼69.8% of the genome consists of repetitive sequences (Tables 1 and 2; this value was estimated by deletion of overlapped sequences from the total data). DNA transposons (mutator-like transposable elements [MULEs], Marverick, hAT, and others), retrotransposons (long terminal repeats [LTRs], long and short interspersed nuclear elements), and other repetitive sequences (including low-complexity repeats and simple repeats) represented 12.2%, 41.5%, and 2.2% of the assembly, respectively (Table 2). The most prominent family was Gypsy of LTR, occupying 28.2% of the genome. In addition, the genome contains unclassified repetitive elements that accounted for 20.4% of it (Table 2). Thus, the P. naikaiensis genome contains a comparatively high percentage of repetitive sequences.
Table 2:

Repetitive sequences in the P. naikaiensis genome assembly

Class%
DNA transposons12.2
  MULE5.7
  Maverick4.5
  hAT0.9
  Others1.1
Retrotransposons41.5
 LTR35.7
  Gypsy28.2
  Copia1.5
  Pao1.1
  Others4.9
 Long interspersed nuclear elements4.8
  CR11.7
  CRE1.0
  L20.9
  Others1.2
 Short interspersed nuclear elements1.0
Others2.2
 RNA0.03
 Low complexity0.07
 Satellite0.4
 Simple repeat1.7
Unclassified20.4
Total (excluding overlapped sequences)69.8
Repetitive sequences in the P. naikaiensis genome assembly An interesting question concerns the locations of transposable elements (TEs) in introns. We found that 32,110 TEs are present in intron regions; 29%, 28%, and 12% of them correspond to “uncharacterized,” “LTR (Gypsy),” and “DNA transposon (MULE),” respectively. On the other hand, we failed to find introns that are composed of only TEs.

Transcriptomes

Transcriptome data, especially those from PacBio Iso-Seq long reads, provided a set of high-quality RNA data (Supplementary Table 1). The mean length of transcriptomes was 2,447 nucleotides, and the mean number of exons per gene was 5.7 (Table 1).

Gene modeling

Gene modeling of the P. naikaiensis genome produced 22,143 protein-coding genes (Table 1). As mentioned above, we obtained a set of high-quality RNA data. As a result, 99% of gene models were substantiated by the transcriptomes (Table 1). BUSCO analysis indicated that 76.5% and 3.8% of them were supported as complete and fragmented genes, respectively (Table 1).

Gene annotation

Gene families predicted by RefSeq (BLAST), Pfam (HMMER), and PANTHER (HMMER) were 15,294, 13,225, and 17,384 in number, respectively (Supplementary Figure 5). Using Pfam-supported families, we examined the number of gene families. Table   3 shows numbers of putative transcription regulator genes in the P. naikaiensis genome. The 2 most abundant families were zinc finger (C2H2 type) and homeobox domain-containing genes, with 73 and 62 members, respectively. Twenty each were annotated to the helix-loop-helix and zinc finger (C4 types) families. Although more detailed analysis is required, the P. naikaiensis genome appears to contain numbers of transcription regulator genes comparable to those of other bilaterian genomes.
Table 3:

Numbers of putative transcriptional regulator genes in the P. naikaiensis genome

AccessionIDDescriptionNo. of genes
PF00010HLHHelix-loop-helix DNA-binding domain20
PF00046HomeoboxHomeobox domain62
PF00096zf-C2H2Zinc finger, C2H2 type73
PF00104Hormone_recepLigand-binding domain of nuclear hormone14
PF00105zf-C4Zinc finger, C4 type20
PF00157PouPou domain3
PF00170bZIP_1bZIP transcription factor13
PF00178EtsEts domain13
PF00250Fork_headFork head domain11
PF00292PAX“Paired box” domain5
PF00319SRF-TFSRF-type transcription factor2
PF00320GATAGATA zinc finger7
PF00505HMG_boxHMG (high mobility group) box12
PF00554RHDRel homology domain (RHD)1
PF00853RuntRunt domain1
PF00870P53P53 DNA-binding domain1
PF00907T-boxT-box4
PF01388ARIDARID/BRIGHT DNA-binding domain4
PF01530zf-C2HCZinc finger, C2HC type2
PF02376CUTCUT domain3
PF03299TF_AP-2Transcription factor AP-21
PF05044Prox1Homeo-prospero domain1
PF07527Hairy_orangeHairy orange1
PF07716bZIP_2Basic region leucine zipper11
Numbers of putative transcriptional regulator genes in the P. naikaiensis genome A similar analysis was carried out on putative signaling molecule genes (Table 4). The largest gene family was tyrosine kinase, represented by 316 genes. In addition, epidermal growth factor (EGF)-like domain genes, G-protein α subunit genes, and regulator of G-protein signaling genes numbered 28, 31, and 16, respectively (Table 4).
Table 4:

Numbers of genes encoding putative signaling molecules in the P. naikaiensis genome

AccessionIDDescriptionNo. of genes
PF00008EGFEGF-like domain28
PF00019TGF_betaTransforming growth factor β like5
PF00110wntwnt family4
PF00167FGFFibroblast growth factor3
PF00219IGFBPInsulin-like growth factor binding protein1
PF00503G-alphaG-protein α subunit31
PF00615RGSRegulator of G protein signaling16
PF00631G-gammaGGL domain5
PF00688TGFb_propeptideTransforming growth factor β propeptide3
PF00778DIXDIX domain5
PF01017STAT_alphaSTAT protein, all-α domain2
PF01534FrizzledFrizzled/Smoothened family membrane region6
PF02262Cbl_NCBL proto-oncogene N-terminal domain 12
PF02377DishevelledDishevelled specific domain1
PF02761Cbl_N2CBL proto-oncogene N-terminus, EF hand-like2
PF02762Cbl_N3CBL proto-oncogene N-terminus, SH2-like domain2
PF02864STAT_bindSTAT protein, DNA binding domain2
PF02865STAT_intSTAT protein, protein interaction domain2
PF07714Pkinase_TyrTyrosine kinase316
Numbers of genes encoding putative signaling molecules in the P. naikaiensis genome

Genome browser

A genome browser was established for the assembled sequences using the JBrowser 1.12.3 [39]. Its URL is [40] (Fig. 2). The gene annotations from the protein domain search and BLAST search have similarly been shown on the site.
Figure 2:

A screen shot of the genome browser of Praesagittifera naikaiensis[40].

A screen shot of the genome browser of Praesagittifera naikaiensis[40].

Availability of supporting data and materials

Genomic and transcriptomic sequence reads have been deposited in the DDBJ sequence read archive under accession No. PRJDB7329. All data are also available from the GigaScience GigaDB repository [41].

Additional files

Supplementary Table 1: Sequence data summary. Supplementary Figure 1: K-mer analysis and genome size estimation of Praesagittifera naikaiensis genomic DNA reads. Supplementary Table 2: Xenacoelomorph dataset used for gene prediction. Supplementary Figure 2: A preliminary circular assembly of the mitochondria genome of Praesagittifera naikaiensis. Supplementary Figure 3: Accumulation of assembled sequences (contigs, blue and scaffolds, red) reaching over 600 Mb. Supplementary Table 3: Summary of the Praesagittifera naikaiensis genome assembly. Supplementary Figure 4: GC content of the Praesagittifera naikaiensis genome. Supplementary Figure 5: Praesagittifera naikaiensis gene annotation.

Abbreviations

BUSCO: Benchmarking Universal Single-Copy Orthologs; EGF: epidermal growth factor; kb: kilobase pair; LTR: long terminal repeat; Mb: megabase pair; MULE: mutator-like transposable element; PCR: polymerase chain reaction; RNA-seq: RNA sequencing; TE: transposable element.

Competing interests

The authors declare that they have no competing interests.

Funding

This work was funded by Okinawa Institute of Science and Technology Internal Funds to the Marine Genomics Unit (N.S.). This work was also supported by a JSPS grant (No. 17K07535) and a research grant from the Research Institute of Marine Invertebrates (Japan) to A.H.

Author contributions

N.S., T.H.K., A.H., K.T., A.A., M.A.Y., and T.U. conceived and supervised the project. T.H.K. and T.I. collected the majority of samples. M.K. and A.A. performed sequencing. A.A., K.H., E.S., and T.H.K. performed analyses. N.S., A.A., and T.H.K. prepared the manuscript, and all authors approved the final manuscript. Click here for additional data file. Click here for additional data file. Click here for additional data file. 11/10/2018 Reviewed Click here for additional data file. 11/22/2018 Reviewed Click here for additional data file. 1/29/2019 Reviewed Click here for additional data file. Click here for additional data file.
  31 in total

1.  Acoel flatworms: earliest extant bilaterian Metazoans, not members of Platyhelminthes.

Authors:  I Ruiz-Trillo; M Riutort; D T Littlewood; E A Herniou; J Baguña
Journal:  Science       Date:  1999-03-19       Impact factor: 47.728

2.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.

Authors:  Guillaume Marçais; Carl Kingsford
Journal:  Bioinformatics       Date:  2011-01-07       Impact factor: 6.937

3.  Using native and syntenically mapped cDNA alignments to improve de novo gene finding.

Authors:  Mario Stanke; Mark Diekhans; Robert Baertsch; David Haussler
Journal:  Bioinformatics       Date:  2008-01-24       Impact factor: 6.937

4.  The MaSuRCA genome assembler.

Authors:  Aleksey V Zimin; Guillaume Marçais; Daniela Puiu; Michael Roberts; Steven L Salzberg; James A Yorke
Journal:  Bioinformatics       Date:  2013-08-29       Impact factor: 6.937

5.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

6.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

7.  Assembly scaffolding with PE-contaminated mate-pair libraries.

Authors:  Kristoffer Sahlin; Rayan Chikhi; Lars Arvestad
Journal:  Bioinformatics       Date:  2016-03-02       Impact factor: 6.937

8.  Acoelomorph flatworms are deuterostomes related to Xenoturbella.

Authors:  Hervé Philippe; Henner Brinkmann; Richard R Copley; Leonid L Moroz; Hiroaki Nakano; Albert J Poustka; Andreas Wallberg; Kevin J Peterson; Maximilian J Telford
Journal:  Nature       Date:  2011-02-10       Impact factor: 49.962

9.  GenomeScope: fast reference-free genome profiling from short reads.

Authors:  Gregory W Vurture; Fritz J Sedlazeck; Maria Nattestad; Charles J Underwood; Han Fang; James Gurtowski; Michael C Schatz
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

10.  NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries.

Authors:  Richard M Leggett; Bernardo J Clavijo; Leah Clissold; Matthew D Clark; Mario Caccamo
Journal:  Bioinformatics       Date:  2013-12-02       Impact factor: 6.937

View more
  3 in total

1.  A draft nuclear-genome assembly of the acoel flatworm Praesagittifera naikaiensis.

Authors:  Asuka Arimoto; Tomoe Hikosaka-Katayama; Akira Hikosaka; Kuni Tagawa; Toyoshige Inoue; Tatsuya Ueki; Masa-Aki Yoshida; Miyuki Kanda; Eiichi Shoguchi; Kanako Hisata; Noriyuki Satoh
Journal:  Gigascience       Date:  2019-04-01       Impact factor: 6.524

2.  Extensive loss of Wnt genes in Tardigrada.

Authors:  Raul A Chavarria; Mandy Game; Briana Arbelaez; Chloe Ramnarine; Zachary K Snow; Frank W Smith
Journal:  BMC Ecol Evol       Date:  2021-12-27

3.  Structural and Functional Characterization of the FGF Signaling Pathway in Regeneration of the Polychaete Worm Alitta virens (Annelida, Errantia).

Authors:  Alexandra Y Shalaeva; Roman P Kostyuchenko; Vitaly V Kozin
Journal:  Genes (Basel)       Date:  2021-05-21       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.