Lucas Costa1, André Marques2, Chris Buddenhagen3, William Wayt Thomas4, Bruno Huettel5, Veit Schubert6, Steven Dodsworth7, Andreas Houben6, Gustavo Souza1, Andrea Pedrosa-Harand1. 1. Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Recife-PE, Brazil. 2. Max Planck Institute for Plant Breeding Research, Cologne, Germany. 3. AgResearch, Plant Functional Biology, Ruakura, New Zealand. 4. New York Botanical Garden, Bronx, New York, NY, USA. 5. Max Planck Genome Centre Cologne, Max Planck Institute for Plant Breeding Research, Cologne, Germany. 6. Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany. 7. School of Life Sciences, University of Bedfordshire, Luton, UK.
Abstract
BACKGROUND AND AIMS: With the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data. METHODS: Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree. KEY RESULTS: All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58-0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66-0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree. CONCLUSIONS: Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.
BACKGROUND AND AIMS: With the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data. METHODS: Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree. KEY RESULTS: All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58-0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66-0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree. CONCLUSIONS: Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.
Authors: Roswitha Schmickl; Aaron Liston; Vojtěch Zeisek; Kenneth Oberlander; Kevin Weitemier; Shannon C K Straub; Richard C Cronn; Léanne L Dreyer; Jan Suda Journal: Mol Ecol Resour Date: 2015-12-15 Impact factor: 7.090
Authors: Thomas J Albert; Michael N Molla; Donna M Muzny; Lynne Nazareth; David Wheeler; Xingzhi Song; Todd A Richmond; Chris M Middle; Matthew J Rodesch; Charles J Packard; George M Weinstock; Richard A Gibbs Journal: Nat Methods Date: 2007-10-14 Impact factor: 28.547
Authors: Nadezhda L Bolsheva; Nataliya V Melnikova; Ilya V Kirov; Alexey A Dmitriev; George S Krasnov; Аlexandra V Amosova; Tatiana E Samatadze; Olga Yu Yurkevich; Svyatoslav A Zoshchuk; Anna V Kudryavtseva; Olga V Muravenko Journal: BMC Evol Biol Date: 2019-02-26 Impact factor: 3.260
Authors: Steven Dodsworth; Mark W Chase; Laura J Kelly; Ilia J Leitch; Jiří Macas; Petr Novák; Mathieu Piednoël; Hanna Weiss-Schneeweiss; Andrew R Leitch Journal: Syst Biol Date: 2014-09-25 Impact factor: 15.683
Authors: Maïté S Guignard; Richard A Nichols; Robert J Knell; Andy Macdonald; Catalina-Andreea Romila; Mark Trimmer; Ilia J Leitch; Andrew R Leitch Journal: New Phytol Date: 2016-02-15 Impact factor: 10.151
Authors: Jiří Macas; Petr Novák; Jaume Pellicer; Jana Čížková; Andrea Koblížková; Pavel Neumann; Iva Fuková; Jaroslav Doležel; Laura J Kelly; Ilia J Leitch Journal: PLoS One Date: 2015-11-25 Impact factor: 3.240
Authors: Tobias Andermann; Maria Fernanda Torres Jiménez; Pável Matos-Maraví; Romina Batista; José L Blanco-Pastor; A Lovisa S Gustafsson; Logan Kistler; Isabel M Liberal; Bengt Oxelman; Christine D Bacon; Alexandre Antonelli Journal: Front Genet Date: 2020-02-21 Impact factor: 4.599
Authors: Luise Schulte; Stefano Meucci; Kathleen R Stoof-Leichsenring; Tony Heitkam; Nicola Schmidt; Barbara von Hippel; Andrei A Andreev; Bernhard Diekmann; Boris K Biskaborn; Bernd Wagner; Martin Melles; Lyudmila A Pestryakova; Inger G Alsos; Charlotte Clarke; Konstantin V Krutovsky; Ulrike Herzschuh Journal: Commun Biol Date: 2022-06-09