| Literature DB >> 30611188 |
Zhong-Tao Yin1, Feng Zhu1, Fang-Bin Lin1, Ting Jia2, Zhen Wang1, Dong-Ting Sun2, Guang-Shen Li1, Cheng-Lin Zhang2, Jacqueline Smith3, Ning Yang1, Zhuo-Cheng Hou4.
Abstract
BACKGROUND: Argument remains as to whether birds have lost genes compared with mammals and non-avian vertebrates during speciation. High quality-reference gene sets are necessary for precisely evaluating gene gain and loss. It is essential to explore new reference transcripts from large-scale de novo assembled transcriptomes to recover the potential hidden genes in avian genomes.Entities:
Keywords: Avian genome; Evolution; Missing gene; de novo assembly
Mesh:
Year: 2019 PMID: 30611188 PMCID: PMC6321700 DOI: 10.1186/s12864-018-5407-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Analysis pipeline showing how to get high quality transcriptome datasets and identify “missing genes”
Summary of RNA-Seq samples and de novo assembly statistics
| Species | Total Tissue Numbers | Total Clean Reads(M) | Assembled Transcripts Numbers | Assembled Transcripts N50 (bp) |
|---|---|---|---|---|
| Chicken | 26 | 7353 | 2,048,631 | 596 |
| Duck | 24 | 2282 | 2,012,592 | 656 |
| Pigeon | 11 | 904 | 1,491,614 | 1533 |
| Goose | 8 | 708 | 1,264,301 | 1004 |
| Zebra finch | 22 | 1372 | 2,479,109 | 965 |
| Total |
|
|
|
Total clean reads (M): millions of paired-end reads for each tissue
Fig. 2Venn diagram of recovered ‘missing’ genes in each species. High-confidence recovered ‘missing’ genes from five species (chicken, duck, goose, pigeon and zebra finch). Most of the ‘missing’ genes were recovered from all five species
Fig. 3GC-content and GC-repeats in high-confidence genes. a GC content of high-confidence genes in five bird species (chicken, duck, pigeon, goose, zebra finch). This figure shows GC content distribution of missing genes in five bird species and representatives of non-avian animals (human, mouse and anole lizard). b This figure shows the relationship of GC content and GC repeats in the five birds
Fig. 4Expression pattern of high-confidence genes and annotated genes. a Tissue-specific expression index (TSI) of high-confidence genes in chicken. TSI of high-confidence genes is significantly higher than genome background. b The percentage of genes that expressed most highly in each tissue and the percentage of expressed genes in each tissue both in high-confidence genes and annotated genes. In all chicken tissues, the percentage of expressed genes in high-confidence genes is significantly lower than in annotated genes
Fig. 5The relationship between GC percentage and Ka/Ks in high-confidence genes and genome-wide genes. a Ka/Ks comparisons of high-confidence genes and genome-wide genes. b This figure shows a higher dispersion pattern as the GC-content increases while average Ka/Ks values decrease as the GC-content increases