| Literature DB >> 30678108 |
Keyur Talsania1, Monika Mehta2, Castle Raley3, Yuliya Kriga4, Sujatha Gowda5, Carissa Grose6, Matthew Drew7, Veronica Roberts8, Kwong Tai Cheng9, Sandra Burkett10, Steffen Oeser11, Robert Stephens12, Daniel Soppet13, Xiongfeng Chen14, Parimal Kumar15, Oksana German16, Tatyana Smirnova17, Christopher Hautman18, Jyoti Shetty19, Bao Tran20, Yongmei Zhao21, Dominic Esposito22.
Abstract
BACKGROUND: Trichoplusia ni derived cell lines are commonly used to enable recombinant protein expression via baculovirus infection to generate materials approved for clinical use and in clinical trials. In order to develop systems biology and genome engineering tools to improve protein expression in this host, we performed de novo genome assembly of the Trichoplusia ni-derived cell line Tni-FNL.Entities:
Keywords: PacBio single molecule real-time sequencing; Tricoplusia ni; de novo assembly; insect genome; next generation sequencing; optical mapping
Mesh:
Substances:
Year: 2019 PMID: 30678108 PMCID: PMC6409714 DOI: 10.3390/genes10020079
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Comparison of Tni-FNL de novo assembly statistics.
| Types | HGAP Contigs | Canu Contigs | Supernova Contigs |
|---|---|---|---|
| Total contigs | 1428 | 2101 | 18,923 |
| Contigs (≥1000 bp) | 1418 | 2101 | 18,196 |
| Contigs (≥10,000 bp) | 1323 | 2097 | 6467 |
| Contigs (≥25,000 bp) | 1101 | 1780 | 3592 |
| Contigs (≥50,000 bp) | 706 | 1041 | 1828 |
| Largest contig (bp) | 4,352,893 | 6,104,320 | 445,812 |
| Total length (bp) | 366,261,337 | 408,408,011 | 316,721,011 |
| GC (%) | 35.66 | 35.74 | 35.33 |
| N50 (bp) | 939,843 | 737,233 | 54,240 |
| N75 (bp) | 421,565 | 250,244 | 22,505 |
| L50 (bp) | 115 | 158 | 1653 |
| L75 (bp) | 259 | 399 | 3894 |
Figure 1Genome assembly and optical maps hybrid scaffold workflow. The workflow steps for de novo assembly, hybrid scaffold, genome assembly error correction and polishing.
Comparison of Tni-FNL genome hybrid scaffold statistics.
| Types | Supernova Scaffolds | Hybrid Scaffolds (V1) | Polished Hybrid Scaffolds (V2) |
|---|---|---|---|
| Total scaffolds | 12,875 | 301 | 280 |
| Scaffolds (≥1000 bp) | 12,875 | 301 | 280 |
| Scaffolds (≥50,000 bp) | 355 | 301 | 280 |
| Largest scaffold (bp) | 7,830,761 | 10,389,188 | 12,760,714 |
| Total scaffold length (bp) | 375,813,451 | 328,208,105 | 359,075,955 |
| GC (%) | 35.33 | 35.5 | 35.56 |
| N50 (bp) | 1,628,260 | 1,737,254 | 2,326,860 |
| N75 (bp) | 449,524 | 1,019,983 | 1,198,934 |
| L50 (bp) | 66 | 57 | 44 |
| L75 (bp) | 172 | 120 | 98 |
| # N’s per 100 kb | 15,724 | 2919 | 5453 |
Comparison of Tni-FNL genome assembly with other lepidopteran genome assemblies.
| Types | Tni-FNL |
|
|
|---|---|---|---|
| Total length (bp) | 359,075,955 | 481,803,763 | 358,050,723 |
| Total scaffolds | 280 | 43,462 | 37,243 |
| Ungapped length (bp) | 339,494,557 | 431,707,935 | 332,569,779 |
| Scaffold N50 (bp) | 2,326,860 | 4,008,358 | 53,779 |
| Total contigs | 2,043 | 88,672 | >49,244 |
| Contig N50 (bp) | 893,993 | 15,508 | 7,851 |
| Largest Scaffold (bp) | 12,760,714 | 14,496,184 | 641,448 |
| Largest Contig (bp) | 6,104,547 | 139,031 | 234,570 |
| GC (%) | 35.56 | 37.70 | 32.97 |
| Gap size (bp) | 19,581,398 | 50,095,828 | 25,480,944 |
Figure 2Gene ontology classification of the genes predicted. Gene Ontology (GO) classification of the predicted genes. Only the most abundance ones are displayed. (a) biological processes, (b) cellular components, (c) molecule functions.
Figure 3Functional annotation of the Tni-FNL (Trichoplusia ni), B. mori and D. melanogaster results comparison. The circos plot describes the shared cellular components, molecular functions and biological processes among the three species.
Figure 4BUSCO assessment results of orthologs among 10 species. Colors refer to the percentage of the complete single-copy orthologs (blue), complete duplicated orthologs (green), fragmented or incomplete orthologs (orange), and missing orthologs (red).
Figure 5Phylogenetic analysis of Trichoplusia ni and closely related insect genomes. Phylogenetic analyses from 10 species including 6 lepidopterans. It depicts the relationship of Trichoplusia ni with the other nine insects. Maximum Likelihood tree based on a genome-wide one-to-one orthologs from 10 species. The scale bar denotes substitutions per site.