Boas Pucker1,2, Nils Kleinbölting3, Bernd Weisshaar4. 1. Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec), Bielefeld University, Sequenz 1, 33615, Bielefeld, Germany. 2. Evolution and Diversity, Department of Plant Sciences, University of Cambridge, Cambridge, UK. 3. Bioinformatics Resource Facility, Center for Biotechnology (CeBiTec, Bielefeld University, Sequenz 1, 33615, Bielefeld, Germany. 4. Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec), Bielefeld University, Sequenz 1, 33615, Bielefeld, Germany. bernd.weisshaar@uni-bielefeld.de.
Abstract
BACKGROUND: Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decades. RESULTS: We sequenced the genomes of 14 A. thaliana GABI-Kat T-DNA insertion lines, which eluded flanking sequence tag-based attempts to characterize their insertion loci, with Oxford Nanopore Technologies (ONT) long reads. Complex T-DNA insertions were resolved and 11 previously unknown T-DNA loci identified, resulting in about 2 T-DNA insertions per line and suggesting that this number was previously underestimated. T-DNA mutagenesis caused fusions of chromosomes along with compensating translocations to keep the gene set complete throughout meiosis. Also, an inverted duplication of 800 kbp was detected. About 10 % of GABI-Kat lines might be affected by chromosomal rearrangements, some of which do not involve T-DNA. Local assembly of selected reads was shown to be a computationally effective method to resolve the structure of T-DNA insertion loci. We developed an automated workflow to support investigation of long read data from T-DNA insertion lines. All steps from DNA extraction to assembly of T-DNA loci can be completed within days. CONCLUSIONS: Long read sequencing was demonstrated to be an effective way to resolve complex T-DNA insertions and chromosome fusions. Many T-DNA insertions comprise not just a single T-DNA, but complex arrays of multiple T-DNAs. It is becoming obvious that T-DNA insertion alleles must be characterized by exact identification of both T-DNA::genome junctions to generate clear genotype-to-phenotype relations.
BACKGROUND: Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decades. RESULTS: We sequenced the genomes of 14 A. thaliana GABI-Kat T-DNA insertion lines, which eluded flanking sequence tag-based attempts to characterize their insertion loci, with Oxford Nanopore Technologies (ONT) long reads. Complex T-DNA insertions were resolved and 11 previously unknown T-DNA loci identified, resulting in about 2 T-DNA insertions per line and suggesting that this number was previously underestimated. T-DNA mutagenesis caused fusions of chromosomes along with compensating translocations to keep the gene set complete throughout meiosis. Also, an inverted duplication of 800 kbp was detected. About 10 % of GABI-Kat lines might be affected by chromosomal rearrangements, some of which do not involve T-DNA. Local assembly of selected reads was shown to be a computationally effective method to resolve the structure of T-DNA insertion loci. We developed an automated workflow to support investigation of long read data from T-DNA insertion lines. All steps from DNA extraction to assembly of T-DNA loci can be completed within days. CONCLUSIONS: Long read sequencing was demonstrated to be an effective way to resolve complex T-DNA insertions and chromosome fusions. Many T-DNA insertions comprise not just a single T-DNA, but complex arrays of multiple T-DNAs. It is becoming obvious that T-DNA insertion alleles must be characterized by exact identification of both T-DNA::genome junctions to generate clear genotype-to-phenotype relations.
Authors: Friedrich Fauser; Nadine Roth; Michael Pacher; Gabriele Ilg; Rocío Sánchez-Fernández; Christian Biesgen; Holger Puchta Journal: Proc Natl Acad Sci U S A Date: 2012-04-23 Impact factor: 11.205
Authors: Bekir Ulker; Edgar Peiter; David P Dixon; Caroline Moffat; Richard Capper; Nicolas Bouché; Robert Edwards; Dale Sanders; Heather Knight; Marc R Knight Journal: Plant J Date: 2008-07-04 Impact factor: 6.417
Authors: Maartje van Kregten; Sylvia de Pater; Ron Romeijn; Robin van Schendel; Paul J J Hooykaas; Marcel Tijsterman Journal: Nat Plants Date: 2016-10-31 Impact factor: 15.793
Authors: Florian Jupe; Angeline C Rivkin; Todd P Michael; Mark Zander; S Timothy Motley; Justin P Sandoval; R Keith Slotkin; Huaming Chen; Rosa Castanon; Joseph R Nery; Joseph R Ecker Journal: PLoS Genet Date: 2019-01-18 Impact factor: 5.917
Authors: Lejon E M Kralemann; Sylvia de Pater; Hexi Shen; Susan L Kloet; Robin van Schendel; Paul J J Hooykaas; Marcel Tijsterman Journal: Nat Plants Date: 2022-05-09 Impact factor: 17.352
Authors: Natalya V Permyakova; Tatyana V Marenkova; Pavel A Belavin; Alla A Zagorskaya; Yuriy V Sidorchuk; Elena V Deineko Journal: Int J Mol Sci Date: 2022-08-03 Impact factor: 6.208
Authors: Laura S Lopez; Carsten Völkner; Philip M Day; Chance M Lewis; Chase L Lewis; Dominik Schneider; Viviana Correa Galvis; Jeffrey A Cruz; Ute Armbruster; David M Kramer; Hans-Henning Kunz Journal: Plant Direct Date: 2022-07-20