Literature DB >> 23433242

Orthology Guided Assembly in highly heterozygous crops: creating a reference transcriptome to uncover genetic diversity in Lolium perenne.

Tom Ruttink1, Lieven Sterck, Antje Rohde, Christian Bendixen, Pierre Rouzé, Torben Asp, Yves Van de Peer, Isabel Roldan-Ruiz.   

Abstract

Despite current advances in next-generation sequencing data analysis procedures, de novo assembly of a reference sequence required for SNP discovery and expression analysis is still a major challenge in genetically uncharacterized, highly heterozygous species. High levels of polymorphism inherent to outbreeding crop species hamper De Bruijn Graph-based de novo assembly algorithms, causing transcript fragmentation and the redundant assembly of allelic contigs. If multiple genotypes are sequenced to study genetic diversity, primary de novo assembly is best performed per genotype to limit the level of polymorphism and avoid transcript fragmentation. Here, we propose an Orthology Guided Assembly procedure that first uses sequence similarity (tBLASTn) to proteins of a model species to select allelic and fragmented contigs from all genotypes and then performs CAP3 clustering on a gene-by-gene basis. Thus, we simultaneously annotate putative orthologues for each protein of the model species, resolve allelic redundancy and fragmentation and create a de novo transcript sequence representing the consensus of all alleles present in the sequenced genotypes. We demonstrate the procedure using RNA-seq data from 14 genotypes of Lolium perenne to generate a reference transcriptome for gene discovery and translational research, to reveal the transcriptome-wide distribution and density of SNPs in an outbreeding crop and to illustrate the effect of polymorphisms on the assembly procedure. The results presented here illustrate that constructing a non-redundant reference sequence is essential for comparative genomics, orthology-based annotation and candidate gene selection but also for read mapping and subsequent polymorphism discovery and/or read count-based gene expression analysis.
© 2013 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

Entities:  

Mesh:

Year:  2013        PMID: 23433242     DOI: 10.1111/pbi.12051

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


  11 in total

1.  An ultra-high density genetic linkage map of perennial ryegrass (Lolium perenne) using genotyping by sequencing (GBS) based on a reference shotgun genome assembly.

Authors:  Janaki Velmurugan; Ewan Mollison; Susanne Barth; David Marshall; Linda Milne; Christopher J Creevey; Bridget Lynch; Helena Meally; Matthew McCabe; Dan Milbourne
Journal:  Ann Bot       Date:  2016-06-06       Impact factor: 4.357

2.  Towards an improved apple reference transcriptome using RNA-seq.

Authors:  Yang Bai; Laura Dougherty; Kenong Xu
Journal:  Mol Genet Genomics       Date:  2014-02-16       Impact factor: 3.291

3.  Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

Authors:  Carole F S Koning-Boucoiran; G Danny Esselink; Mirjana Vukosavljev; Wendy P C van 't Westende; Virginia W Gitonga; Frans A Krens; Roeland E Voorrips; W Eric van de Weg; Dietmar Schulz; Thomas Debener; Chris Maliepaard; Paul Arens; Marinus J M Smulders
Journal:  Front Plant Sci       Date:  2015-04-21       Impact factor: 5.753

4.  De novo assembly of the perennial ryegrass transcriptome using an RNA-Seq strategy.

Authors:  Jacqueline D Farrell; Stephen Byrne; Cristiana Paina; Torben Asp
Journal:  PLoS One       Date:  2014-08-15       Impact factor: 3.240

5.  Comparative transcriptome analysis within the Lolium/Festuca species complex reveals high sequence conservation.

Authors:  Adrian Czaban; Sapna Sharma; Stephen L Byrne; Manuel Spannagl; Klaus F X Mayer; Torben Asp
Journal:  BMC Genomics       Date:  2015-03-28       Impact factor: 3.969

6.  Single-Copy Genes as Molecular Markers for Phylogenomic Studies in Seed Plants.

Authors:  Zhen Li; Amanda R De La Torre; Lieven Sterck; Francisco M Cánovas; Concepción Avila; Irene Merino; José Antonio Cabezas; María Teresa Cervera; Pär K Ingvarsson; Yves Van de Peer
Journal:  Genome Biol Evol       Date:  2017-05-01       Impact factor: 3.416

7.  Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.

Authors:  Alix Armero; Luc Baudouin; Stéphanie Bocs; Dominique This
Journal:  PLoS One       Date:  2017-03-23       Impact factor: 3.240

8.  Overcoming challenges in variant calling: exploring sequence diversity in candidate genes for plant development in perennial ryegrass (Lolium perenne).

Authors:  Elisabeth Veeckman; Sabine Van Glabeke; Annelies Haegeman; Hilde Muylle; Frederik R D van Parijs; Stephen L Byrne; Torben Asp; Bruno Studer; Antje Rohde; Isabel Roldán-Ruiz; Klaas Vandepoele; Tom Ruttink
Journal:  DNA Res       Date:  2019-02-01       Impact factor: 4.458

9.  Genetic-geographic correlation revealed across a broad European ecotypic sample of perennial ryegrass (Lolium perenne) using array-based SNP genotyping.

Authors:  T Blackmore; I Thomas; R McMahon; W Powell; M Hegarty
Journal:  Theor Appl Genet       Date:  2015-06-21       Impact factor: 5.699

10.  In Silico Identification of Candidate Genes for Fertility Restoration in Cytoplasmic Male Sterile Perennial Ryegrass (Lolium perenne L.).

Authors:  Timothy Sykes; Steven Yates; Istvan Nagy; Torben Asp; Ian Small; Bruno Studer
Journal:  Genome Biol Evol       Date:  2017-02-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.