| Literature DB >> 33270111 |
Adam Frankish1, Mark Diekhans2, Irwin Jungreis3,4, Julien Lagarde5, Jane E Loveland1, Jonathan M Mudge1, Cristina Sisu6,7, James C Wright8, Joel Armstrong2, If Barnes1, Andrew Berry1, Alexandra Bignell1, Carles Boix3,4,9, Silvia Carbonell Sala5, Fiona Cunningham1, Tomás Di Domenico10, Sarah Donaldson1, Ian T Fiddes2, Carlos García Girón1, Jose Manuel Gonzalez1, Tiago Grego1, Matthew Hardy1, Thibaut Hourlier1, Kevin L Howe1, Toby Hunt1, Osagie G Izuogu1, Rory Johnson11,12, Fergal J Martin1, Laura Martínez10, Shamika Mohanan1, Paul Muir13,14, Fabio C P Navarro6, Anne Parker1, Baikang Pei6, Fernando Pozo10, Ferriol Calvet Riera1, Magali Ruffier1, Bianca M Schmitt1, Eloise Stapleton1, Marie-Marthe Suner1, Irina Sycheva1, Barbara Uszczynska-Ratajczak15, Maxim Y Wolf16, Jinuri Xu6, Yucheng T Yang6,17, Andrew Yates1, Daniel Zerbino1, Yan Zhang6,18, Jyoti S Choudhary8, Mark Gerstein6,17,19, Roderic Guigó5,20, Tim J P Hubbard21, Manolis Kellis3,4, Benedict Paten2, Michael L Tress10, Paul Flicek1.
Abstract
The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33270111 PMCID: PMC7778937 DOI: 10.1093/nar/gkaa1087
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971