| Literature DB >> 28185558 |
Geneviève C Vallée1, Daniella Santos Muñoz1, David Sankoff2.
Abstract
BACKGROUND: Of the approximately two hundred sequenced plant genomes, how many and which ones were sequenced motivated by strictly or largely scientific considerations, and how many by chiefly economic, in a wide sense, incentives? And how large a role does publication opportunity play?Entities:
Keywords: Crop plants; Genome sequencing; Model organisms
Mesh:
Year: 2016 PMID: 28185558 PMCID: PMC5123250 DOI: 10.1186/s12864-016-3100-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fragment of data on species, family, order and year sequenced
| Species | Common Name | Family | Order | Year |
|---|---|---|---|---|
| ⋮ | ||||
| Azadirachta indica | Neem | Meliaceae | Sapindales | 2012 |
| Beta vulgaris | Sugar Beet | Amaranthaceae | Caryophyllales | 2014 |
| Betula nana | Alpine Birch | Betulaceae | Fagales | 2013 |
| Brachypodium distachyon | Brachypodium | Poaceae | Poales | 2010 |
| Brassica napus | Rape | Brassicaceae | Brassicales | 2003 |
| Brassica oleracea | Cabbage/Cauliflower | Brassicaceae | Brassicales | 2011 |
| Brassica rapa | Field Mustard | Brassicaceae | Brassicales | 2011 |
| Cajanus cajan | Pigeon Pea | Fabaceae | Fabales | 2011 |
| Camelina sativa | False Flax | Brassicaceae | Brassicales | 2013 |
| Cannabis sativa | Hemp | Cannabaceae | Rosales | 2011 |
| Capsella rubella | Caspella | Brassicaceae | Brassicales | 2013 |
| Capsicum annuum | Cayenne Pepper | Solanaceae | Solanales | 2014 |
| Carica papaya | Papaya | Caricaceae | Brassicales | 2008 |
| Carthamus tinctorius | Safflower | Asteraceae | Asterales | 2016 |
| Castanea mollissima | Chinese Chestnut | Fagaceae | Fagales | 2011 |
| Catharanthus roseus | Madagascar Periwinkle | Apocynaceae | Gentianales | 2013 |
| ⋮ | ||||
Data set on families, including species abundance, economic value, and number of sequenced genomes
| Total value | Total | |||||||
|---|---|---|---|---|---|---|---|---|
| Family | Species | (Million $) | Seqs. | Family | Species | Value | Seqs. | |
| Poaceae | 11,554 | 963,585 | 31 | Ericaceae | 3,554 | 1,371 | 1 | |
| Solanaceae | 2,678 | 280,810 | 14 | Grossulariaceae | 195 | 1,247 | 0 | |
| Fabaceae | 24,505 | 214,599 | 15 | Linaceae | 213 | 848 | 1 | |
| Rosaceae | 4,828 | 158,890 | 10 | Actinidiaceae | 176 | 788 | 1 | |
| Malvaceae | 4,465 | 112,394 | 3 | Polygonaceae | 1,384 | 693 | 0 | |
| Cucurbitaceae | 965 | 102,053 | 4 | Aquifoliaceae | 480 | 690 | 0 | |
| Arecaceae | 2,522 | 89,828 | 3 | Cannabaceae | 102 | 528 | 2 | |
| Brassicaceae | 4,060 | 79,650 | 19 | Salicaceae | 1,269 | 372 | 2 | |
| Euphorbiaceae | 6,547 | 69,650 | 4 | Canellaceae | 21 | 344 | 0 | |
| Vitaceae | 985 | 68,942 | 3 | Sapotaceae | 1,343 | 221 | 0 | |
| Rutaceae | 1,730 | 64,431 | 2 | Papaveraceae | 920 | 132 | 0 | |
| Amaryllidaceae | 2,258 | 63,376 | 0 | Myrtaceae | 5,970 | 111 | 2 | |
| Anacardiaceae | 701 | 45,283 | 0 | Urticaceae | 1,465 | 99 | 0 | |
| Musaceae | 78 | 44,859 | 3 | Lecythidaceae | 341 | 67 | 0 | |
| Asteraceae | 23,600 | 37,734 | 4 | Orchidaceae | 27,801 | 9 | 2 | |
| Convolvulaceae | 1,296 | 26,797 | 1 | Lamiaceae | 7,886 | 0 | 1 | |
| Amaranthaceae | 2,052 | 25,548 | 4 | Apocynaceae | 5,556 | 0 | 1 | |
| Dioscoreaceae | 653 | 20,858 | 0 | Araceae | 3,368 | 0 | 1 | |
| Oleaceae | 688 | 19,467 | 1 | Gesneriaceae | 3,122 | 0 | 1 | |
| Pinaceae | 255 | 19,268 | 5 | Primulaceae | 2,788 | 0 | 2 | |
| Rubiaceae | 13,673 | 16,060 | 1 | Caryophyllaceae | 2,456 | 0 | 2 | |
| Juglandaceae | 89 | 15,650 | 1 | Plantaginaceae | 1,614 | 0 | 6 | |
| Theaceae | 370 | 12,871 | 0 | Moraceae | 1,217 | 0 | 1 | |
| Bromeliaceae | 2,929 | 11,618 | 1 | Thymelaeaceae | 938 | 0 | 1 | |
| Asparagaceae | 200 | 11,453 | 0 | Rhamnaceae | 839 | 0 | 1 | |
| Apiaceae | 3,257 | 8,666 | 1 | Meliaceae | 669 | 0 | 1 | |
| Fagaceae | 1,101 | 7,805 | 2 | Capparaceae | 449 | 0 | 1 | |
| Pedaliaceae | 67 | 4,642 | 1 | Lentibulariaceae | 312 | 0 | 2 | |
| Caricaceae | 47 | 4,054 | 1 | Phrymaceae | 199 | 0 | 1 | |
| Ebenaceae | 751 | 2,811 | 1 | Zosteraceae | 23 | 0 | 1 | |
| Betulaceae | 234 | 2,667 | 1 | Nelumbonaceae | 2 | 0 | 1 | |
| Piperaceae | 2,658 | 2,478 | 0 | Amborellaceae | 1 | 0 | 1 | |
| Zingiberaceae | 1,587 | 2,430 | 0 |
Data set on order, including species abundance, economic value, and number of sequenced genomes
| Total value | Total | |||||||
|---|---|---|---|---|---|---|---|---|
| Order | Species | (Million $) | Seqs. | Order | Species | Value | Seqs. | |
| Poales | 18,000 | 975,203 | 32 | Lamiales | 24,000 | 24,109 | 13 | |
| Solanales | 4,080 | 307,607 | 15 | Dioscoreales | 1,040 | 20,858 | 0 | |
| Fabales | 25,794 | 214,599 | 17 | Pinales | 550 | 19,268 | 5 | |
| Rosales | 7,700 | 159,517 | 14 | Ericales | 8,000 | 18,128 | 5 | |
| Malvales | 6,000 | 112,394 | 5 | Gentianales | 17,000 | 16,060 | 2 | |
| Sapindales | 5,700 | 109,714 | 3 | Apiales | 5,489 | 8,666 | 1 | |
| Cucurbitales | 2,600 | 102,053 | 4 | Alismatales | 4,500 | 4,408 | 2 | |
| Arecales | 2,600 | 89,828 | 3 | Piperales | 4,090 | 2,478 | 0 | |
| Brassicales | 4,450 | 89,705 | 21 | Saxifragales | 2,500 | 1,247 | 0 | |
| Asparagales | 26,000 | 74,838 | 2 | Aquifoliales | 536 | 690 | 0 | |
| Malpighiales | 16,000 | 70,871 | 7 | Canellales | 136 | 344 | 0 | |
| Vitales | 850 | 68,942 | 3 | Rannunculales | 2,830 | 132 | 0 | |
| Zingiberales | 2,100 | 47,288 | 3 | Myrtales | 11,000 | 111 | 2 | |
| Asterales | 27,500 | 37,876 | 4 | Proteales | 1,060 | 0 | 1 | |
| Caryophyllales | 11,155 | 26,241 | 6 | Amborellales | 1 | 0 | 1 | |
| Fagales | 1,900 | 26,123 | 4 |
Fig. 1Total value (USD Millions) line fit plot, for family data
Descriptive statistics for families and orders
| Families | |||||
|---|---|---|---|---|---|
| Mean | Median | Std dev | Min | Max | |
| Genomes sequenced | 2.65 | 1 | 5.02 | 0 | 31 |
| Value (USD millions) | 40,288 | 2,429 | 127,325 | 0 | 963,585 |
| No. of species | 3,077 | 1,217 | 5,577 | 1 | 27,801 |
| Orders | |||||
| Mean | Median | Std dev | Min | Max | |
| Genomes sequenced | 5.65 | 3 | 7.35 | 0 | 32 |
| Value (USD millions) | 84,816 | 26,122 | 179,613 | 0 | 975,203 |
| No. of species | 7,908 | 5,450 | 8,531 | 1 | 27,500 |
Regressions of number of sequenced genomes in a taxon as a function of total value of species in that taxon and the number of species in the taxon
| Data set | ||||
|---|---|---|---|---|
| Family | Order | |||
| Intercept | 0.9656 |
| 1.7453 |
|
| Value (USD millions) | 0.0000327 |
| 0.0000313 |
|
| Abundance | 0.000119 |
| 0.000158 |
|
|
| 0.77 | 0.70 | ||
| No. of observations | 65 | 31 | ||
Fig. 2Total value (USD Millions) line fit plot, without Poaceae
Fig. 3Distribution of sequenced genomes among families and orders as a function of species abundance
Regressions in Table 5 repeated without family Poaceae and order Poales
| Data set | ||||
|---|---|---|---|---|
| Family | Order | |||
| Intercept | 0.6818 |
| 1.7453 |
|
| Value (USD millions) | 0.0000489 |
| 0.0000507 |
|
| Abundance | 0.00008719 |
| 0.000141 |
|
|
| 0.52 | 0.52 | ||
| No. of observations | 64 | 30 | ||
Fig. 4Line fit plot of year of first sequenced genome in a family versus total value (USD Millions)
Regressions of year of first sequenced genome in a family as a function of total value of species in that taxon and the number of species in the taxon
| Intercept | 2012.7 |
|
| Value (USD millions) | -0.0000236 |
|
| Abundance | -0.0000202 |
|
|
| 0.15 | |
| No. of observations | 50 |