| Literature DB >> 25625010 |
Naim Matasci1, Ling-Hong Hung2, Zhixiang Yan3, Eric J Carpenter4, Norman J Wickett5, Siavash Mirarab6, Nam Nguyen6, Tandy Warnow6, Saravanaraj Ayyampalayam7, Michael Barker8, J Gordon Burleigh9, Matthew A Gitzendanner9, Eric Wafula10, Joshua P Der10, Claude W dePamphilis10, Béatrice Roure11, Hervé Philippe12, Brad R Ruhfel13, Nicholas W Miles14, Sean W Graham15, Sarah Mathews16, Barbara Surek17, Michael Melkonian17, Douglas E Soltis18, Pamela S Soltis18, Carl Rothfels19, Lisa Pokorny20, Jonathan A Shaw21, Lisa DeGironimo22, Dennis W Stevenson22, Juan Carlos Villarreal23, Tao Chen24, Toni M Kutchan25, Megan Rolf25, Regina S Baucom26, Michael K Deyholos4, Ram Samudrala2, Zhijian Tian3, Xiaolei Wu3, Xiao Sun3, Yong Zhang3, Jun Wang3, Jim Leebens-Mack7, Gane Ka-Shu Wong27.
Abstract
The 1,000 plants (1KP) project is an international multi-disciplinary consortium that has generated transcriptome data from over 1,000 plant species, with exemplars for all of the major lineages across the Viridiplantae (green plants) clade. Here, we describe how to access the data used in a phylogenomics analysis of the first 85 species, and how to visualize our gene and species trees. Users can develop computational pipelines to analyse these data, in conjunction with data of their own that they can upload. Computationally estimated protein-protein interactions and biochemical pathways can be visualized at another site. Finally, we comment on our future plans and how they fit within this scalable system for the dissemination, visualization, and analysis of large multi-species data sets.Entities:
Keywords: Biodiversity; Interactions; Pathways; Phylogenomics; Transcriptomes; Viridiplantae
Year: 2014 PMID: 25625010 PMCID: PMC4306014 DOI: 10.1186/2047-217X-3-17
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Data sources for phylogenomics analyses
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| genome | n/a | n/a | |
| meta-assembly | PlantGDB | AQUI | |
| meta-assembly | SRX022306, SRX022215 | CYCA | |
| meta-assembly | PRJNA46857 | LIRI | |
| meta-assembly | PRJNA46857 | PERS | |
| meta-assembly | PRJNA79733 | PINU | |
| meta-assembly | PRJNA48473 | PTER | |
| meta-assembly | PRJNA46857 | ZAMI | |
| OneKP meta-assembly | ERR364395, PRJNA46857 | ACOR | |
| OneKP meta-assembly | ERR364329, PRJNA46857 | AMBO | |
| OneKP meta-assembly | ERR364390, PRJNA79951, PRJNA236160 | CATH | |
| OneKP meta-assembly | ERR364338, ERR364335, ERR364336, ERR364337, ERR364334, SRX002988, SRX002987, PlantGDB | ESCH | |
| OneKP meta-assembly | ERR364401, PlantGDB | GINK | |
| OneKP meta-assembly | ERR364330, PRJNA46857 | NUPH | |
| OneKP meta-assembly | ERR364410, SRX666586 | OPHI | |
| OneKP meta-assembly | ERR364383, PRJNA46857 | SARU | |
| OneKP meta-assembly | ERR364404, PRJNA46857 | WELW | |
| OneKP | ERR364389 | MGVU | |
| OneKP | ERR364409 | NHCM | |
| OneKP | ERR364349 | QMWB | |
| OneKP | ERR364415 | WZYK | |
| OneKP | ERR364387 | ACFP | |
| OneKP | ERR364348 | JMXW | |
| OneKP | ERR364342 | GGEA | |
| OneKP | ERR364350 | FFPD | |
| OneKP | ERR364369 | DRGY | |
| OneKP | ERR364366 | CHAR | |
| OneKP | ERR364371 | AZZW | |
| OneKP | ERR364397 | NHIX | |
| OneKP | ERR364367 | QPDY | |
| OneKP | ERR364368 | VQBJ | |
| OneKP | ERR364376 | STKJ | |
| OneKP | ERR364340 | OUOI | |
| OneKP | ERR364412 | GANB | |
| OneKP | ERR364405 | XZUY | |
| OneKP | ERR364378 | YOXI | |
| OneKP | ERR364373 | JOJQ | |
| OneKP | ERR364346 | XNXF | |
| OneKP | ERR364396 | OCWZ | |
| OneKP | ERR364339 | KVFU | |
| OneKP | ERR364372 | BFIK | |
| OneKP | ERR364402 | VDAO | |
| OneKP | ERR364408 | CAPN | |
| OneKP | ERR364403 | GTHK | |
| OneKP | ERR364352 | YWNF | |
| OneKP | ERR364388 | OLXF | |
| OneKP | ERR364332 | CSSK | |
| OneKP | ERR364407 | GAON | |
| OneKP | ERR364393 | AFQQ | |
| OneKP | ERR364392 | VXKB | |
| OneKP | ERR364341 | XMGP | |
| OneKP | ERR364331 | NWMY | |
| OneKP | ERR364370 | FQLP | |
| OneKP | ERR364385 | WGET | |
| OneKP | ERR364386 | UDUT | |
| OneKP | ERR364353 | ZACW | |
| OneKP | ERR364417 | TFYI | |
| OneKP | ERR364416 | JPYU | |
| OneKP | ERR364365 | KYIO | |
| OneKP | ERR364377 | WDCW | |
| OneKP | ERR364359 | NRWZ | |
| OneKP | ERR364362 | BTFM | |
| OneKP | ERR364374 | ZRMT | |
| OneKP | ERR364363 | ISIM | |
| OneKP | ERR364379 | FFGR | |
| OneKP | ERR364356 | DXOU | |
| OneKP | ERR364357 | TCBC | |
| OneKP | ERR364382 | AEKF | |
| OneKP | ERR364384 | WFBF | |
| OneKP | ERR364413 | SZYG | |
| OneKP | ERR364343 | EGLZ | |
| OneKP | ERR364345 | UPMJ | |
| OneKP | ERR364411 | QVMR | |
| OneKP | ERR364361 | TNAW | |
| OneKP | ERR364355 | JADL | |
| OneKP | ERR364358 | WJLO | |
| OneKP | ERR364391 | FDMM | |
| OneKP | ERR364351 | XWHK | |
| OneKP | ERR364380 | XRTZ | |
| OneKP | ERR364400 | HWUP | |
| OneKP | ERR364333 | OSHQ | |
| OneKP | ERR364344 | YFZK | |
| OneKP | ERR364347 | ZZOL | |
| OneKP | ERR364398 | MWYQ | |
| OneKP | ERR364360 | HERT | |
| OneKP | ERR364414 | GOWD | |
| OneKP | ERR364375 | HAOX | |
| OneKP | ERR364381 | NNHQ | |
| OneKP | ERR364394 | DUQG | |
| OneKP | ERR364406 | WWSS | |
| OneKP | ERR364354 | EEMJ | |
| OneKP | ERR364364 | ISGT | |
| OneKP | ERR364399 | ICNN |
Meta-assembly refers to a transcriptome assembled from more than one sequenced sample. Some of these were a combination of 1KP and other data; some were entirely non-1KP. Accession numbers (SRA or otherwise) are given for all of the transcriptomes that we used.
Number and size of data files on websites
| 68,253 | 0.0 | 0.3 | 481.1 | 23,116.6 | | onekp_pilot |
| 48,053 | 0.0 | 0.3 | 481.1 | 14,956.7 | | onekp_pilot/orthogroups |
| 19,220 | 0.1 | 0.7 | 243.8 | 13,276.5 | | onekp_pilot/orthogroups/alignments |
| 9,610 | 0.1 | 0.3 | 79.8 | 3,289.6 | | onekp_pilot/orthogroups/alignments/FAA |
| 9,610 | 0.2 | 1.0 | 243.8 | 9,986.9 | | onekp_pilot/orthogroups/alignments/FNA |
| 28,833 | 0.0 | 0.1 | 481.1 | 1,680.2 | | onekp_pilot/orthogroups/gene_trees |
| 9,611 | 0.0 | 0.1 | 481.1 | 583.3 | | onekp_pilot/orthogroups/gene_trees/FAA |
| 9,610 | 0.0 | 0.0 | 0.5 | 102.2 | | onekp_pilot/orthogroups/gene_trees/FAA/trees |
| 19,222 | 0.0 | 0.1 | 458.0 | 1,096.8 | | onekp_pilot/orthogroups/gene_trees/FNA |
| 9,611 | 0.0 | 0.1 | 458.0 | 556.6 | | onekp_pilot/orthogroups/gene_trees/FNA/12_codon |
| 9,610 | 0.0 | 0.0 | 0.5 | 98.5 | | onekp_pilot/orthogroups/gene_trees/FNA/12_codon/trees |
| 9,611 | 0.0 | 0.1 | 439.1 | 540.3 | | onekp_pilot/orthogroups/gene_trees/FNA/all_codon |
| 9,610 | 0.0 | 0.0 | 0.5 | 101.2 | | onekp_pilot/orthogroups/gene_trees/FNA/all_codon/dna_tree |
| 19,919 | 0.0 | 0.2 | 175.2 | 3,468.8 | | onekp_pilot/phylogenetic_analysis |
| 2,556 | 0.1 | 0.1 | 1.0 | 292.7 | | onekp_pilot/phylogenetic_analysis/alignments |
| 852 | 0.0 | 0.0 | 0.3 | 41.8 | | onekp_pilot/phylogenetic_analysis/alignments/FAA |
| 852 | 0.1 | 0.1 | 1.0 | 125.5 | | onekp_pilot/phylogenetic_analysis/alignments/FNA |
| 852 | 0.1 | 0.1 | 0.9 | 125.4 | | onekp_pilot/phylogenetic_analysis/alignments/FNA2AA |
| 17,197 | 0.0 | 0.1 | 0.4 | 1,827.3 | | onekp_pilot/phylogenetic_analysis/gene_trees |
| 1,704 | 0.0 | 0.1 | 0.4 | 238.3 | | onekp_pilot/phylogenetic_analysis/gene_trees/FAA |
| 2 | 0.3 | 0.1 | 0.4 | 0.3 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/FAA/raxmlboot.#### |
| 1,704 | 0.0 | 0.1 | 0.4 | 238.3 | | onekp_pilot/phylogenetic_analysis/gene_trees/FNA |
| 2 | 0.3 | 0.1 | 0.4 | 0.3 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/FNA/raxmlboot.#### |
| 3,408 | 0.0 | 0.1 | 0.4 | 476.7 | | onekp_pilot/phylogenetic_analysis/gene_trees/FNA2AA |
| 2 | 0.3 | 0.1 | 0.4 | 0.3 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/FNA2AA/raxmlboot.#### |
| 2 | 0.3 | 0.1 | 0.4 | 0.3 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/FNA2AA/raxmlboot.####.c1c2 |
| 10,381 | 0.0 | 0.1 | 0.4 | 874.0 | | onekp_pilot/phylogenetic_analysis/gene_trees/filtered |
| 2,548 | 0.0 | 0.1 | 0.4 | 169.3 | | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FAA |
| 1 | 0.0 | 0.0 | 0.0 | 0.0 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FAA/raxmlboot.####.f25 |
| 1 | 0.2 | 0.1 | 0.4 | 0.2 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FAA/raxmlboot.####.filterlen33 |
| 852 | 0.0 | 0.0 | 0.0 | 3.8 | | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA |
| 1 | 0.0 | 0.0 | 0.0 | 0.0 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA/raxmlboot.####.f25 |
| 6,980 | 0.0 | 0.1 | 0.4 | 700.9 | | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA |
| 2 | 0.3 | 0.1 | 0.4 | 0.3 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA/raxmlboot.####.GAMMA.2 |
| 2 | 0.3 | 0.1 | 0.4 | 0.3 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA/raxmlboot.####.c1c2.GAMMA.2 |
| 1 | 0.0 | 0.0 | 0.0 | 0.0 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA/raxmlboot.####.c1c2.f25 |
| 1 | 0.0 | 0.0 | 0.0 | 0.0 | 852 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA/raxmlboot.####.f25 |
| 2 | 0.2 | 0.1 | 0.4 | 0.2 | 844 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA/raxmlboot.####.filterlen33 |
| 1 | 0.3 | 0.3 | 0.4 | 0.3 | 180 | onekp_pilot/phylogenetic_analysis/gene_trees/filtered/FNA2AA/raxmlboot.####.filtered25.GAMMA.2 |
| 166 | 0.0 | 8.1 | 175.2 | 1,348.8 | | onekp_pilot/phylogenetic_analysis/species_level |
| 50 | 15.0 | 27.0 | 175.2 | 1,348.1 | | onekp_pilot/phylogenetic_analysis/species_level/alignments |
| 15 | 14.7 | 14.3 | 58.3 | 214.2 | | onekp_pilot/phylogenetic_analysis/species_level/alignments/FAA |
| 35 | 29.4 | 32.4 | 175.2 | 1,133.9 | | onekp_pilot/phylogenetic_analysis/species_level/alignments/FNA |
| 116 | 0.0 | 0.0 | 0.0 | 0.6 | | onekp_pilot/phylogenetic_analysis/species_level/trees |
| 276 | 10.0 | 17.0 | 157.4 | 4,691.1 | | onekp_pilot/taxa |
| 3 | 9.7 | 17.0 | 157.4 | 51.0 | 92 | onekp_pilot/taxa/####-############ |
| 1 | 30.8 | 17.0 | 157.4 | 36.0 | 92 | onekp_pilot/taxa/####-############/assemblies |
| 2 | 9.7 | 7.5 | 45.2 | 15.0 | 92 | onekp_pilot/taxa/####-############/translations |
| 5 | 0.0 | 0.0 | 0.0 | 0.1 | | onekp_pilot/tools |
| 178 | 1,915.0 | 2,045.5 | 3,371.0 | 364,100.0 | | total of all short reads -- uncompressed, but downloads are compressed to a quarter of these sizes |
| 2 | 1,915.0 | 2,045.5 | 3,371.0 | 4,091.0 | 89 | expecting per sample -- uncompressed, but downloads are compressed to a quarter of these sizes |
In some instances, users will find many directories with similar names, as indicated in this table by hash (#) marks. The total number of directories is given in the preceding column.
Figure 1iPlant DE data window.
Figure 2Bulk download window if Java is disabled. Click on the circled link to access the instructions to install and configure an iDrop desktop.
Figure 3Realigning a group of sequences using
Figure 4display of 1KP species phylogeny.
Figure 5Phenylpropanoid synthesis pathway for Labelled rectangles are proteins. Small circles are metabolites. Black lines show the KEGG pathway. Red lines show the BioGRID interactions emanating from protein (K12355), which was interactively selected. A right-click on the protein will display the inferred function and a link to the sequence(s).