| Literature DB >> 29528046 |
Andrew Spriggs1, Steven T Henderson2, Melanie L Hand2,3, Susan D Johnson2, Jennifer M Taylor1, Anna Koltunow2.
Abstract
Cowpea ( Vigna unguiculata (L.) Walp) is an important legume crop for food security in areas of low-input and smallholder farming throughout Africa and Asia. Genetic improvements are required to increase yield and resilience to biotic and abiotic stress and to enhance cowpea crop performance. An integrated cowpea genomic and gene expression data resource has the potential to greatly accelerate breeding and the delivery of novel genetic traits for cowpea. Extensive genomic resources for cowpea have been absent from the public domain; however, a recent early release reference genome for IT97K-499-35 ( Vigna unguiculata v1.0, NSF, UCR, USAID, DOE-JGI, http://phytozome.jgi.doe.gov/) has now been established in a collaboration between the Joint Genome Institute (JGI) and University California (UC) Riverside. Here we release supporting genomic and transcriptomic data for two transformable cowpea varieties, IT97K-499-35 and IT86D-1010. The transcriptome resource includes six tissue-specific datasets for each variety, with particular emphasis on reproductive tissues that extend and support the V. unguiculata v1.0 reference. Annotations have been included in our resource to allow direct mapping to the v1.0 cowpea reference. The resource described here is supported by downloadable raw and assembled sequence data.Entities:
Keywords: cowpea; genome; male and female gametogenesis; seed; transcriptome
Year: 2018 PMID: 29528046 PMCID: PMC5841572 DOI: 10.12688/gatesopenres.12777.2
Source DB: PubMed Journal: Gates Open Res ISSN: 2572-4754
Details of IT97K-499-35 and IT86D-1010 genomic DNA contigs generated and assembled in this study.
Contigs of less than 1000 base pairs were excluded in this summary. Comparison to the V. unguiculata genome v1.0 of IT97K-499-35 is provided.
| IT97K-499-35
| IT86D-1010
| V.Ung v1.0
[ | |
|---|---|---|---|
|
| 57,690 | 39,123 | 686 |
|
| 568,059,011 | 609,523,031 | 519,435,864 |
|
| 1,000 | 1,000 | 2,922 |
|
| 9,847 | 15,580 | 757,195 |
|
| 17,952 | 36,693 | 41,684,185 |
|
| 150,032 | 347,074 | 65,292,630 |
1. Genomic DNA assembled contigs (gDNA)
2. Vigna unguiculata v1.0, NSF, UCR, USAID, DOE-JGI, http://phytozome.jgi.doe.gov/
3. Sequence lengths are in basepairs (bp)
Proportion of filtered DNA paired-end reads that uniquely align to the assembled genomic DNA sequence sets from IT97K-499-35 and IT86D-1010, and to the Vigna unguiculata genome v1.0 assembly of IT97K-499-35.
IT86D-1010 and IT97K-499-35 are the genome contig assemblies generated in this resource. Alignments were accepted if they were unique pair-end alignments within 1 kilobase of each other, with auto end-trimming of reads where required, and up to 3 mismatches per 100 base pairs.
| Raw read set | IT86D-1010 | IT97K-499-35 | V.Ung v1.0
[ |
|---|---|---|---|
|
| 72.6% | 64.8% | 64.8% |
|
| 62.5% | 68.1% | 65.9% |
1. Vigna unguiculata v1.0, NSF, UCR, USAID, DOE-JGI, http://phytozome.jgi.doe.gov/
2. Genomic DNA assembled contigs (gDNA)
Proportion of assembled DNA and tissue-specific transcript contigs that align to the Vigna unguiculata v1.0 reference genome at three thresholds of overlap.
| Query
| 50%
[ | 70%
[ | 90%
[ | |
|---|---|---|---|---|
|
| 131,241 | 98.0% | 91.1% | 65.4% |
|
| 57,690 | 98.2% | 87.8% | 60.1% |
|
| 73,278 | 87.7% | 80.3% | 56.8% |
|
| 36,179 | 90.7% | 83.9% | 59.8% |
|
| 36,058 | 91.8% | 84.5% | 60.1% |
|
| 40,158 | 92.2% | 87.0% | 66.2% |
|
| 37,710 | 91.4% | 86.6% | 65.5% |
|
| 38,623 | 91.5% | 86.6% | 65.8% |
|
| 73,967 | 88.8% | 81.9% | 59.9% |
|
| 35,503 | 91.8% | 85.3% | 61.9% |
|
| 41,783 | 92.5% | 86.0% | 64.7% |
|
| 41,580 | 92.0% | 85.7% | 64.1% |
|
| 36,592 | 92.4% | 87.8% | 68.0% |
|
| 37,470 | 92.9% | 88.0% | 67.8% |
1. Minimum overlap of query contig required within the target reference genome Vigna unguiculata v1.0.
2. Genomic DNA contigs (gDNA)
3. Pooled male gametophyte (PMG)
4. Transcript contigs (tr)
5. Megaspore mother cell stage (MMC)
6. Female meiotic tetrads (FMT)
7. Mature female gametophyte (MFG)
8. Early seeds (ES)
Details of predicted coding gene sequences with 300bp minimum length predicted by Augustus within the assembled genomic DNA contig sets from IT97K-499-35 and IT86D-1010.
Matches to NCBI’s ‘nr’ protein sequence database found through ‘blastp’ of translated predicted genes, with an e-value threshold of 1e-50.
| Augustus
[ | IT97K-499-35
| IT86D-1010
|
|---|---|---|
|
| 61,195 | 62,963 |
|
| 81,479,968 | 87,223,042 |
|
| 300 | 300 |
|
| 1,331 | 1,385 |
|
| 1,791 | 1,887 |
|
| 14,583 | 15,909 |
|
| 41,874 | 43,253 |
|
| 68% | 69% |
1. Augustus in-silico gene prediction ( bioinf.uni-greifswald.de/augustus/; Stanke & Waack, 2003)
2. Genomic DNA assembled contigs (gDNA)
3. Coding DNA Sequence (CDS)
4. Sequence lengths are in basepairs (bp)
5. NCBI ‘nr’ database downloaded 8 th August 2017
Details of assembled tissue-specific polyA RNA sequence sets from IT86D-1010.
Assembled contigs of less than 300 base pairs were excluded in this analysis.
| IT86D-1010 | Leaf-tr
[ | PMG
[ | MMC
[ | FMT
[ | MFG
[ | ES
[ |
|---|---|---|---|---|---|---|
|
| 73,278 | 36,179 | 36,058 | 40,158 | 37,710 | 38,623 |
|
| 68,247,480 | 40,853,458 | 42,326,934 | 43,555,218 | 41,562,341 | 41,760,972 |
|
| 300 | 300 | 300 | 300 | 300 | 300 |
|
| 931 | 1,129 | 1,174 | 1,085 | 1,102 | 1,081 |
|
| 1,208 | 1,602 | 1,660 | 1,494 | 1,538 | 1,501 |
|
| 14,930 | 12,310 | 12,441 | 12,276 | 11,392 | 12,272 |
1. Transcript contigs (tr)
2. Pooled male gametophyte (PMG)
3. Megaspore mother cell stage (MMC)
4. Female meiotic tetrads (FMT)
5. Mature female gametophyte (MFG)
6. Early seeds (ES)
7. Sequence lengths are in base pairs
Details of assembled tissue-specific polyA RNA sequence sets from IT97K-499-35.
Assembled contigs of less than 300 base pairs were excluded in this analysis.
| IT97K-499-35 | Leaf-tr
[ | PMG
[ | MMC
[ | FMT
[ | MFG
[ | Seed
[ |
|---|---|---|---|---|---|---|
|
| 73,967 | 35,503 | 41,783 | 41,580 | 36,592 | 37,470 |
|
| 69,053,233 | 40,224,171 | 46,244,665 | 45,725,500 | 39,970,331 | 41,525,557 |
|
| 300 | 300 | 300 | 300 | 300 | 300 |
|
| 934 | 1,133 | 1,107 | 1,100 | 1,092 | 1,108 |
|
| 1,223 | 1,619 | 1,565 | 1,557 | 1,528 | 1,547 |
|
| 13,965 | 12,238 | 12,960 | 13,799 | 12,605 | 16,435 |
1. Transcript contigs (tr)
2. Pooled male gametophyte (PMG)
3. Megaspore mother cell stage (MMC)
4. Female meiotic tetrads (FMT)
5. Mature female gametophyte (MFG)
6. Early seeds (ES)
7. Sequence lengths are in base pairs
Proportion of filtered raw RNASeq paired-end reads that uniquely align to the assembled genomic DNA sequence sets from IT97K-499-35 and IT86D-1010, and to the Vigna unguiculata genome v1.0 assembly of IT97K-499-35.
Alignments by ‘biokanga align’, with up to 3 substitutions per 100 base pairs, paired-ends retained within 1 kilobase of each other and auto end-trimming of reads where required.
| Raw read set | IT86D-1010 | IT97K-499-35 | V.Ung v1.0
[ |
|---|---|---|---|
|
| 69.2% | 68.6% | 68.2% |
|
| 72.6% | 72.1% | 70.7% |
|
| 73.5% | 72.7% | 72.3% |
|
| 71.4% | 70.7% | 70.1% |
|
| 73.3% | 72.7% | 72.3% |
|
| 71.3% | 70.7% | 70.3% |
|
| 66.6% | 67.2% | 66.7% |
|
| 69.2% | 70.0% | 68.5% |
|
| 69.9% | 70.4% | 69.9% |
|
| 69.3% | 69.8% | 69.3% |
|
| 69.7% | 70.1% | 69.6% |
|
| 68.4% | 68.8% | 68.1% |
1. Vigna unguiculata v1.0, NSF, UCR, USAID, DOE-JGI, http://phytozome.jgi.doe.gov/
2. Transcript contigs (tr)
3. Pooled male gametophyte (PMG)
4. Megaspore mother cell stage (MMC)
5. Female meiotic tetrads (FMT)
6. Mature female gametophyte (MFG)
7. Early seeds (ES)
Proportion of predicted gene models that accrue RNA sequencing reads.
Counts shown for gene models with more than 20 uniquely aligning RNASeq reads.
| Transcriptome | August Gene Models
[ | Proportion of total
|
|---|---|---|
|
| 21,024 | 31% |
|
| 21,315 | 31% |
|
| 20,672 | 31% |
|
| 21,356 | 32% |
|
| 20,290 | 30% |
|
| 20,486 | 30% |
|
| 21,088 | 31% |
|
| 20,953 | 31% |
|
| 20,905 | 31% |
|
| 20,274 | 30% |
|
| 20,005 | 30% |
|
| 20,871 | 31% |
1. Augustus in-silico gene prediction on IT86D genomic contigs ( bioinf.uni-greifswald.de/augustus/; Stanke & Waacke, 2003)
2. Transcript contigs (tr)
3. Pooled male gametophyte (PMG)
4. Megaspore mother cell stage (MMC)
5. Female meiotic tetrads (FMT)
6. Mature female gametophyte (MFG)
7. Early seeds (ES)