Literature DB >> 26744602

SorGSD: a sorghum genome SNP database.

Hong Luo1,2, Wenming Zhao3, Yanqing Wang3, Yan Xia1, Xiaoyuan Wu1, Limin Zhang1, Bixia Tang3, Junwei Zhu3, Lu Fang3, Zhenglin Du3, Wubishet A Bekele4, Shuaishuai Tai5, David R Jordan6, Ian D Godwin7, Rod J Snowdon4, Emma S Mace8, Hai-Chun Jing1, Jingchu Luo9.   

Abstract

BACKGROUND: Sorghum (Sorghum bicolor) is one of the most important cereal crops globally and a potential energy plant for biofuel production. In order to explore genetic gain for a range of important quantitative traits, such as drought and heat tolerance, grain yield, stem sugar accumulation, and biomass production, via the use of molecular breeding and genomic selection strategies, knowledge of the available genetic variation and the underlying sequence polymorphisms, is required.
RESULTS: Based on the assembled and annotated genome sequences of Sorghum bicolor (v2.1) and the recently published sorghum re-sequencing data, ~62.9 M SNPs were identified among 48 sorghum accessions and included in a newly developed sorghum genome SNP database SorGSD (http://sorgsd.big.ac.cn). The diverse panel of 48 sorghum lines can be classified into four groups, improved varieties, landraces, wild and weedy sorghums, and a wild relative Sorghum propinquum. SorGSD has a web-based query interface to search or browse SNPs from individual accessions, or to compare SNPs among several lines. The query results can be visualized as text format in tables, or rendered as graphics in a genome browser. Users may find useful annotation from query results including type of SNPs such as synonymous or non-synonymous SNPs, start, stop of splice variants, chromosome locations, and links to the annotation on Phytozome (www.phytozome.net) sorghum genome database. In addition, general information related to sorghum research such as online sorghum resources and literature references can also be found on the website. All the SNP data and annotations can be freely download from the website.
CONCLUSIONS: SorGSD is a comprehensive web-portal providing a database of large-scale genome variation across all racial types of cultivated sorghum and wild relatives. It can serve as a bioinformatics platform for a range of genomics and molecular breeding activities for sorghum and for other C4 grasses.

Entities:  

Keywords:  Bio-energy plant; Database curation; Genome variation; SNPs; Sorghum

Year:  2016        PMID: 26744602      PMCID: PMC4704391          DOI: 10.1186/s13068-015-0415-8

Source DB:  PubMed          Journal:  Biotechnol Biofuels        ISSN: 1754-6834            Impact factor:   6.040


Background

Sorghum (Sorghum bicolor) originated from Africa and became an important cereal crop after a long period of domestication and selective breeding [1]. Nowadays, it feeds over 500 million people in 98 countries [2], with an estimation of 42 million hectares of cultivated area and 62 million tons of yield per year (FAOSTAT data 2013, http://faostat3.fao.org). In contrast to C3 crops such as rice and wheat, sorghum has the C4 photosynthetic pathway, which leads to higher photosynthetic efficiency under circumstances of intense light, high temperature and low water supply [2-4]. As such, sorghum has remarkable drought and heat tolerance, and can produce high yield and biomass in areas of harsh conditions with low inputs. Sorghum is not only used for food, but also cultivated with other important economic impacts for forage, sugars and biomass. Furthermore, in recent years sorghum has been regarded as a promising bioenergy feedstock [5], which is comparable to other important biofuel grasses such as maize, sugarcane, Miscanthus and switch grass [6, 7]. Moreover, the compact genome and high degree of genetic synteny to other C4 grasses make sorghum a potential genetic model for the design of bioenergy crops [8, 9]. Sorghum’s genome is relatively small (~730 M) and simple (10 chromosomes, diploid) compared to other C4 crops in the Poaceae subfamily, such as maize and sugarcane. The recent completion and availability of a whole genome reference sequence, based on the elite line BTx623, has accelerated the pace of genetic and genomic research in sorghum [10]. The genetic basis of a range of important agronomic traits in sorghum has been elucidated, including drought tolerance and maturity [2]. Nevertheless, to better understand the genetic basis for the considerable phenotypic variation observed in many more agronomic and bioenergy traits of different sorghum accessions, it is necessary to have insight into genomic variation including single nucleotide polymorphisms (SNPs), insertions/deletions (INDELs) and structure variation (SV). Recently, various high throughput strategies have been developed for genome re-sequencing [11-13], resulting in a large amount of SNP data being generated for sorghum [14-18]. These SNP data, representing high density biomarkers, are a valuable resource for researchers to perform genetic and breeding studies, such as genotyping by sequencing (GBS) [19-21], bulked segregant analysis (BSA) [22], and genome-wide association studies (GWAS) [18, 23, 24]. These studies will not only lead to the highly efficient discovery of key QTLs or genes relevant to important traits, but also contribute to the understanding of the evolutionary relationship of cultivated and wild Sorghum species and subspecies. To enhance the utility of sorghum SNP data, we developed a web-based large-scale genome variation database (SorGSD, http://sorgsd.big.ac.cn). SorGSD contains ~62.9 million SNPs from a diverse panel of 48 sorghum accessions divided into four groups, including improved inbreds, landraces, wild/weedy sorghums, and accessions of the wild relative Sorghum propinquum. These SNP data have been annotated and an easy-to-use web interface has been designed for users to browse, search and analyze the SNPs efficiently. SorGSD allows users to query the SNP information and their relevant annotations for individual samples. The search results can be visualized graphically in a genome browser or displayed in formatted tables. Users can also compare SNP data between two and more sorghum accessions. The output of query results can be downloaded for further investigation, or users can bulk download the entire SNP dataset of 48 accessions. SorGSD also manages additional sorghum related information, such as general descriptions of sorghum and its genome, sorghum research institutions around the world, and lists of sorghum literature references.

Result and discussion

Database content

SorGSD contains ~62.9 million SNPs identified from the re-sequencing data of 48 sorghum lines mapped to the reference genome BTx623. These sorghum lines represent major cultivated races grouped into landraces or improved varieties, and weedy or wild subspecies. Figure 1 shows the phylogenetic relationship among these sorghum lines [16], with the genotype name and group indicated. Racial type and geographic origin are also included. Additionally, the total number of SNPs identified per sample is indicated. The two margaritiferum cultivars (PI525695 M Margaritiferum Mali 1964025 and PI586430 M Margaritiferum Sierra Leone 1938008) are separated into a distinct group since they are highly divergent from other S. bicolor races (Fig. 1). Two samples of the allopatric Asian species Sorghum propinquum are clustered within a distant group as the outgroup.
Fig. 1

A dendrogram showing the phylogenetic relationships among the diverse set of sorghum lines. Each sample is labelled as follows; the genotype name, sample type (coded, as detailed below), racial type, geographic origin, and total number of SNPs identified. Sample type codes: I improved variety, L landrace, W weedy or wild, M margaritiferum, P Sorghum propinquum. The sorghum reference genome BTx623 is shown in bold, sweet sorghums are in italic. (Adapted from Mace et al. [16] and redrawn using the tool “Display Newick Trees” under MEGA 6.0, SS79 was added based on the output results of the SNPhylo program [34] using the SNP data.)

A dendrogram showing the phylogenetic relationships among the diverse set of sorghum lines. Each sample is labelled as follows; the genotype name, sample type (coded, as detailed below), racial type, geographic origin, and total number of SNPs identified. Sample type codes: I improved variety, L landrace, W weedy or wild, M margaritiferum, P Sorghum propinquum. The sorghum reference genome BTx623 is shown in bold, sweet sorghums are in italic. (Adapted from Mace et al. [16] and redrawn using the tool “Display Newick Trees” under MEGA 6.0, SS79 was added based on the output results of the SNPhylo program [34] using the SNP data.) The SNP numbers of each sample give an overview of the genomic difference between the reference genome BTx623 and individual genomes. Detailed information about distribution of SNPs in different genomic regions, including genic, intergenic, and intronic regions is provided (Table 1). For genic regions, SNPs found in specific positions such as start and stop codons, splice donator and acceptor sites are listed (Table 2).
Table 1

Distribution of SNPs in different genomic regions in 48 sorghum accessions

GenotypeTypeRacial typeGeographic originTotal SNP numbers
AllIntergenic5′ UTRIntronicNon-synSyn3′ UTR
BTx623IComplexUSA0000000
Malisor84-7IComplexMali347707284944207936036783471758261
ICSV745IComplexIndia906800762772616681888174761594319300
EarlyHegariLCaudatumSudan910090748915689390114203751926720719
CherekitLCaudatumEthiopia932505763491717996419197651898922799
SC103-14ELGuinea-CaudatumSouth Africa800708657087558982356172161589919275
MaciaICaudatumMozambique776904632772569884057161031519919937
SC108-14ELCaudatumEthiopia738736600093596978205166471592618758
SC237-14ELCaudatumSudan865874708878729987805189471817421154
B923296IComplexAustralia652758537078431266567132191259116395
B963676IComplexAustralia646947521677509571404142771400717587
M35-1LDurraIndia799673659631562981167157271523919067
R931945-2-2IComplexAustralia1240958104524310365109678229042198926306
SC170-6-8LCaudatumEthiopia834928698227610777492162241549918180
SC326-6LCaudatum-BicolorEthiopia-USA862389702263686991952184101752921739
SC56-14ELCaudatum-NigricansSudan963728788668783198457214511978323541
SC62-14ELComplexKenya9960818032348304108872231862151626629
E-TianIKafirChina434744323422866645605196832002314334
RioIKafirUSA824373660153741092526197511889021916
SS79IKafirEthiopia1291250104875216348122623340793258630973
IS8525LKafirEthiopia941482777365792692487195271821022188
KellerIComplexUSA335625238622409650617131431256014148
RTx7000IKafir-CaudatumUSA11254229431429873102492210751984624795
IS3614-2LGuineaNigeria131306811027248066123657229312174929188
Karper669IComplexUSA-Sudan11217809353937839106347220612073825193
PI563516IDurra-CaudatumMali10145308353828632100645209992010124679
QL12IComplexAustralia10372528609487376101401202971893324245
IS9710LCaudatumSudan9618667832996937102930205842007124013
KS115IDurra-CaudatumUSA9374497675524773102892178301645424350
Ji2731LCaudatumChina5389893950201024660269248472565219250
AI4IComplexChina11601619634947757112978227222219326553
LR9198IComplexChina125317010393619778121486244832356529609
BTx642 LDurraEthiopia1524769128787612862132322275412602132749
SC35C-14ELDurraEthiopia122881410280727766115689231082214327565
SC23-14ELDurraEthiopia136209811463779130123135249862368029949
Yik.solateLDurraEthiopia11180669330125181116030170591538027540
IBC/E-38432LDurraEthiopia1715354143019311247167795303532906140485
PI585749LDurra-BicolorMali1446371121009711321133531274492591732590
PI330272WDrummondiiEthiopia1501312124239410899147465304482904835194
ZengadaWWeedyEthiopia1581684131547810824155882286242724737720
KiloWWeedyEthiopia126776010474675627137344214731990931449
GreenleafWWeedyUSA1522107126828710468145993292472813034204
PI226096WWeedyKenya1956801164144416255179268352503373043508
PI525695MMargaritiferumMali1964025162845512730197292362023556946286
PI586430MMargaritiferumSierra Leone1938008159434813766198477388943843146271
PI300119WVerticilliflorumSouth Africa2995879248229426648290919562135661771315
AusTRCF317961WVerticilliflorumAustralia2003360162541912596226288389533928352566
Sorpr369-1PPropinquum5200279397168558105713492124517141591163430
Sorpr369-2PPropinquum4993948379452453315704812118631135432160696

I improved variety, L landrace, W wild/weedy, M margaritiferum, P Sorghum propinquum

Table 2

Distribution of major effect SNPs in different genic sites and regions in 48 sorghum accessions

GenotypeTypeRacial typeGeographic originStart codonStop codonSplice sites
GainLostVariantGainLostRetainDonorAcceptorRegion
BTx623IComplexUSA000000000
Malisor84-7IComplexMali38016113039102832742
ICSV745IComplexIndia1084459246642169741643
EarlyHegariLCaudatumSudan12813512282811869691960
CherekitLCaudatumEthiopia12834212291681866722011
SC103-14ELGuinea-CaudatumSouth Africa1014398244722051751763
MaciaICaudatumMozambique996385239622651601661
SC108-14ELCaudatumEthiopia1054429230602051681604
SC237-14ELCaudatumSudan1242506282792257721807
B923296IComplexAustralia770269199531953721395
B963676IComplexAustralia962305192621857641510
M35-1LDurraIndia1048395246682166741646
R931945-2-2IComplexAustralia170374153318129881102042
SC170-6-8LCaudatumEthiopia1065445245692755581631
SC326-6LCaudatum-BicolorEthiopia-USA12203410266822662961831
SC56-14ELCaudatum-NigricansSudan14334010328782458801946
SC62-14ELComplexKenya14553810295992871942250
E-TianIKafirChina14304114228652557661085
RioIKafirUSA12734716259722165751899
SS79IKafirEthiopia26027821419116371021302384
IS8525LKafirEthiopia13534617261692464871858
KellerIComplexUSA750277212481236451302
RTx7000IKafir-CaudatumUSA16054715284933278971948
IS3614-2LGuineaNigeria142148133589537831122586
Karper669IComplexUSA-Sudan13625211301932463942209
PI563516IDurra-CaudatumMali14275313298862762942032
QL12IComplexAustralia13214510313792564982097
IS9710LCaudatumSudan12653810301852173782161
KS115IDurra-CaudatumUSA9003512270773371892111
Ji2731LCaudatumChina16665213265762374621474
AI4IComplexChina14164512291782490992409
LR9198IComplexChina1735471033195271031072433
BTx642 LDurraEthiopia211475233631073893992486
SC35C-14ELDurraEthiopia14024816317893284942389
SC23-14ELDurraEthiopia1587551438410731871082468
Yik.solateLDurraEthiopia990258249672879902328
IBC/E-38432LDurraEthiopia19657014442113451081213342
PI585749LDurra-BicolorMali1930651738810943951302689
PI330272WDrummondiiEthiopia18655611458123491001483054
ZengadaWWeedyEthiopia1864591341311145951473162
KiloWWeedyEthiopia10583552947332851062803
GreenleafWWeedyUSA18716016411122341101163038
PI226096WWeedyKenya27677616495145481481483503
PI525695MMargaritiferumMali23187315524135461361624082
PI586430MMargaritiferumSierra Leone25258215562144471381754133
PI300119WVerticilliflorumSouth Africa444113229786204902112245756
AusTRCF317961WVerticilliflorumAustralia22788016521145531631854814
Sorpr369-1PPropinquum985924942151937823640748114288
Sorpr369-2PPropinquum916924141143735924040546514181

I improved variety; L landrace; W wild/weedy; M margaritiferum; P Sorghum propinquum

Distribution of SNPs in different genomic regions in 48 sorghum accessions I improved variety, L landrace, W wild/weedy, M margaritiferum, P Sorghum propinquum Distribution of major effect SNPs in different genic sites and regions in 48 sorghum accessions I improved variety; L landrace; W wild/weedy; M margaritiferum; P Sorghum propinquum All the SNP data shown in the two tables can be easily accessed either as statistical information through the Help page of the database, or through the user interface. The original data of sequencing short reads, the assembled sequence and the SNP data of each accession can be downloaded.

User interface

SorGSD offers three main functions (search, compare and browse), for users to search, display and retrieve the SNPs and their annotations. The search function provides a user-friendly web interface to query SNP information. Users can search SNPs by specifying chromosomal co-ordinates or the locus ID. Users can also query SNPs based on their genotypes, and predicted variant effects. In addition, users can compare the SNPs between two and more sorghum lines. The query results can be shown as a formatted table which contains the information of ID, chromosome position, genomic location and predicted coding effects, 5′ and 3′ flanking sequences, reference and derived alleles, respectively. SNPs from the stringent set identified by both pipelines (see description in “Methods” and Fig. 2 for details) are highlighted with a green background in the result page. The output of the query results can be downloaded as flat text or formatted tables for further investigation.
Fig. 2

Venn diagram of SNPs identified by two pipelines. A SNPs called by the GATK-based pipeline. B SNPs called by the SOAPsnp- and realSFS-based pipeline. C The set of highly reliable SNPs those were identified by both pipelines

Venn diagram of SNPs identified by two pipelines. A SNPs called by the GATK-based pipeline. B SNPs called by the SOAPsnp- and realSFS-based pipeline. C The set of highly reliable SNPs those were identified by both pipelines SorGSD also provides several data browsing functionalities under the “Browse” pull-down menu. The “Total SNPs” tab lists the SNP numbers on 10 chromosomes of all 48 accessions. Users can select a group, e.g. Landraces, to display the SNP numbers of these accessions within this group. Mouse-clicking these SNP numbers will bring up the list of SNPs of a specific accession. Given that the different location in genes such as coding regions, as well as the non-synonymous information are often of great interest for further study, the “Genic SNP” tab lists several submenus including “Coding SNP”, “Synonymous SNP”, and “Non-synonymous SNP” so that information can be tailored to user requirements. The “Browse on Chromosome” tab leads to an interactive graphic window to visualize SNPs in a genome browser. Users can customize the visualization interface by selecting different data types, including SNPs, genes, transcripts, allele frequencies, and the SNP density information. Users can obtain a pie chart showing the allele frequency, SNP density in 300 kb windows size, related gene and transcript information.

Help information

SorGSD provides a help resource for users to better access the SNP data, as well as proving links to additional sorghum research related resources. The help menu provides a “How to” page, which gives a number of examples for users to learn how to search and compare target SNPs. For example, a step-by-step user-guide shows how to obtain non-synonymous SNPs in chromosome 1 of sweet sorghum E-Tian, and how to compare SNPs between sweet sorghum E-Tian and two grain sorghum Ji2731 and Keller. An FAQs page provides answers to a range of frequently asked questions not only about the content and usage of SorGSD but more broadly about sorghum genomics. Detailed information including software tools, parameters and data sources is presented in the “Pipeline” page. The “Statistics” page shows the SNP numbers distributed in different genomic regions (Table 1) and specific genic sites (Table 2). The “Data source” page shows the general information of 48 sorghum lines, including their geographic origins, and links to the US Germplasm Resources Information Network (http://www.ars-grin.gov). The “About” tab contains several pages related to sorghum research. The Sorghum Genome page provides a brief introduction to the reference genome BTx623, including genome size and gene number. The Resource page provides links to online databases, research institutions, sorghum producers and handbooks. The reference page lists selected recently published papers in the fields of sorghum genomics, genetics, QTLs, etc., with links to full lists in PubMed.

Conclusions and future directions

High coverage resequencing data from two previous sorghum studies [15, 16] were used to identify SNPs among 48 sorghum genotypes by combining three SNP calling tools and updating the SNPs datasets using the sorghum reference annotation (Version 2.1). In addition, we annotated the effect of SNP variants on genes of each sorghum accession. SorGSD has already received over two thousands of visits from more than 30 countries around the world since it went online a few months ago. During the review process of this manuscript, we were happy to know that a new website Sorghum Genomics (https://www.purdue.edu/sorghumgenomics) developed at Purdue University became available as a functional gene discovery platform. We will improve the SNP calling pipeline and the annotation procedure to obtain more accurate SNP data and upload them into the database. Furthermore, we will include additional types of genome variation data detected by newly developed pipelines, including INDELs and copy number variations (CNVs). At the same time, we will improve the web interface especially in the search function and give more examples in the user guide to help novice users to access the database easily. We will add more analytical functionalities so that users can perform more analyses such as Blast search, sequence alignment and phylogenetic analysis. SorGSD can serve as a bioinformatics platform to inform wet-lab experiments including biomarker development, allele mining and gene function assessment. In addition to the collaboration among research groups involving in this work, we will collaborate with other domestic and international laboratories in the sorghum research community to sequence and annotate more sorghum accessions in the future. We will update the database regularly and add SNP datasets with newly available re-sequenced sorghum accessions. We hope that the high density of these SNP data at genomic level collected from the major races of cultivated sorghum as well as other subspecies is a rich repository for a broader research community working in biomarker identification, genetic analysis and molecular breeding, especially for energy plant sweet sorghum cultivation.

Methods

The construction of SorGSD was a multi-step process. Firstly, the sorghum re-sequencing paired-end raw reads reported in the previously published works were downloaded [15, 16]. In addition, the paired-end raw reads generated in-house for a sweet sorghum line SS79 were included [unpublished data]. Secondly, the raw reads were mapped to the reference sorghum genome (BTx623) [10] using the BWA program [25]. SNPs were identified using the software GATK [26, 27], realSFS (http://popgen.dk/angsd/index.php/RealSFS) and SOAPsnp [28] and annotated using SnpEff [29]. With the SNP matrix finalized, a web interface was designed for users to browse and search the SNPs and related annotations. Details for the database construction are described as follows and are also available on the designated website.

Data source

The raw reads of sequencing data were from three original datasets. The largest dataset [16] contains 44 sorghum accessions and represent the major races of cultivated sorghum as well as their wild relatives. The second dataset [15] contains three accessions of cultivated sorghums. The raw reads of these two datasets can be downloaded from the NCBI sequence read archive (SRA) (accessions SRS378430-SRS378473, and accessions SRX100115-SRX100138). The third dataset contains the paired-end reads of sorghum line SS79, a cultivated sweet sorghum inbred. These data were recently generated in our laboratory using an Illumina HiSeq 2000 platform with insert size of 500 bp and have not been submitted to NCBI. The average sequencing depth of all sorghum accessions is about 20×, ranging from 12 to 54×.

SNP calling pipeline

After trimming adapters, the clean reads were mapped to version 2.1 of the reference genome (available via http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Sbicolor) using the BWA program [25], allowing a maximum of five mismatches and disabling long gaps in the mapping procedure. The average counts of the mapping rate, the unique mapping rate and the mapping coverage were 0.957, 0.681 and 0.881 respectively, excluding the two S. propinquum accessions. The SAM tools package [30] was used to convert mapping results to BAM format, and then the Picard program (http://picard.sourceforge.net) was applied to eliminate duplicated reads generated during the process of library construction. Subsequently, the GATK tools [26, 27] were used to recalibrate the base quality score to obtain more accurate quality scores for each base and realign reads around known INDELs. The refined data from all individuals were jointly used to call a raw SNPs set by GATK HaplotypeCaller. Finally, a set of SNPs were identified, using the variant quality score to recalibrate the procedure in GATK. In total, we identified 62,888,582 SNPs across all 48 sorghum lines, corresponding to 15,357,261 sites in the reference genome. The GATK based SNP calling pipeline is similar to that reported in a recent publication [31]. SNPs were additionally identified using the pipeline described previously using realSFS (http://popgen.dk/angsd/index.php/RealSFS) and SOAPsnp [28], described by Mace et al. [16]. Approximately 28 million highly stringent SNPs were in common between the two SNP identification pipelines (Fig. 2) with the GATK-based pipeline identifying more SNPs than the SOAPsnp-based pipeline. The total number of SNPs called by the GATK based pipeline was found to be comparable to the study by Evans et al. [32], which employed the CLC Workbench software (CLC Bio-Qiagen, Aarhus, Denmark). All the SNPs identified by the GATK pipeline were stored in SorGSD, with the subset of 28 million highly stringent SNPs highlighted in the results page. Finally, the effect of variants on all the v2.1 predicted gene models for each sorghum accession were predicted and annotated using the SnpEff program (version 4.0e) [29].

Database implementation

The SNP data and their related annotations were formatted into tables and stored in SorGSD using the MySQL database management system (version 5). The web interface of SorGSD was designed by JAVA/JSP (JDK 1.6) under the Apache/Tomcat web server (version 2.0) running under a Linux operation system (CentOS 6). We installed the generic genome browser GBrowse [33] as a chromosome-based visualization tool to display these genomic SNPs and annotations.
  32 in total

1.  The generic genome browser: a building block for a model organism system database.

Authors:  Lincoln D Stein; Christopher Mungall; ShengQiang Shu; Michael Caudy; Marco Mangone; Allen Day; Elizabeth Nickerson; Jason E Stajich; Todd W Harris; Adrian Arva; Suzanna Lewis
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

Review 2.  Survey of genomics approaches to improve bioenergy traits in maize, sorghum and sugarcane.

Authors:  Wilfred Vermerris
Journal:  J Integr Plant Biol       Date:  2011-02       Impact factor: 7.061

Review 3.  Nature's green revolution: the remarkable evolutionary rise of C4 plants.

Authors:  Colin P Osborne; David J Beerling
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2006-01-29       Impact factor: 6.237

4.  Plant genetics. How sorghum withstands heat and drought.

Authors:  Elizabeth Pennisi
Journal:  Science       Date:  2009-01-30       Impact factor: 47.728

Review 5.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing.

Authors:  John W Davey; Paul A Hohenlohe; Paul D Etter; Jason Q Boone; Julian M Catchen; Mark L Blaxter
Journal:  Nat Rev Genet       Date:  2011-06-17       Impact factor: 53.242

6.  Population genomic and genome-wide association studies of agroclimatic traits in sorghum.

Authors:  Geoffrey P Morris; Punna Ramu; Santosh P Deshpande; C Thomas Hash; Trushar Shah; Hari D Upadhyaya; Oscar Riera-Lizarazu; Patrick J Brown; Charlotte B Acharya; Sharon E Mitchell; James Harriman; Jeffrey C Glaubitz; Edward S Buckler; Stephen Kresovich
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-24       Impact factor: 11.205

7.  Genome-wide patterns of genetic variation in sweet and grain sorghum (Sorghum bicolor).

Authors:  Lei-Ying Zheng; Xiao-Sen Guo; Bing He; Lian-Jun Sun; Yao Peng; Shan-Shan Dong; Teng-Fei Liu; Shuye Jiang; Srinivasan Ramachandran; Chun-Ming Liu; Hai-Chun Jing
Journal:  Genome Biol       Date:  2011-11-21       Impact factor: 13.583

8.  RIG: Recalibration and interrelation of genomic sequence data with the GATK.

Authors:  Ryan F McCormick; Sandra K Truong; John E Mullet
Journal:  G3 (Bethesda)       Date:  2015-02-13       Impact factor: 3.154

9.  Combining Next Generation Sequencing with Bulked Segregant Analysis to Fine Map a Stem Moisture Locus in Sorghum (Sorghum bicolor L. Moench).

Authors:  Yucui Han; Peng Lv; Shenglin Hou; Suying Li; Guisu Ji; Xue Ma; Ruiheng Du; Guoqing Liu
Journal:  PLoS One       Date:  2015-05-18       Impact factor: 3.240

10.  Digital genotyping of sorghum - a diverse plant species with a large repeat-rich genome.

Authors:  Daryl T Morishige; Patricia E Klein; Josie L Hilley; Sayed Mohammad Ebrahim Sahraeian; Arun Sharma; John E Mullet
Journal:  BMC Genomics       Date:  2013-07-05       Impact factor: 3.969

View more
  12 in total

1.  Sweet Sorghum Originated through Selection of Dry, a Plant-Specific NAC Transcription Factor Gene.

Authors:  Li-Min Zhang; Chuan-Yuan Leng; Hong Luo; Xiao-Yuan Wu; Zhi-Quan Liu; Yu-Miao Zhang; Hong Zhang; Yan Xia; Li Shang; Chun-Ming Liu; Dong-Yun Hao; Yi-Hua Zhou; Cheng-Cai Chu; Hong-Wei Cai; Hai-Chun Jing
Journal:  Plant Cell       Date:  2018-10-11       Impact factor: 11.277

2.  Validation of QTL mapping and transcriptome profiling for identification of candidate genes associated with nitrogen stress tolerance in sorghum.

Authors:  Malleswari Gelli; Anji Reddy Konda; Kan Liu; Chi Zhang; Thomas E Clemente; David R Holding; Ismail M Dweikat
Journal:  BMC Plant Biol       Date:  2017-07-11       Impact factor: 4.215

3.  Characterization and Regulation of Aquaporin Genes of Sorghum [Sorghum bicolor (L.) Moench] in Response to Waterlogging Stress.

Authors:  Suhas Kadam; Alejandra Abril; Arun P Dhanapal; Robert P Koester; Wilfred Vermerris; Shibu Jose; Felix B Fritschi
Journal:  Front Plant Sci       Date:  2017-05-30       Impact factor: 5.753

4.  The BIG Data Center: from deposition to integration to translation.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

Review 5.  A comprehensive review of web-based resources of non-coding RNAs for plant science research.

Authors:  Peiran Liao; Shipeng Li; Xiuming Cui; Yun Zheng
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

6.  QTG-Finder2: A Generalized Machine-Learning Algorithm for Prioritizing QTL Causal Genes in Plants.

Authors:  Fan Lin; Elena Z Lazarus; Seung Y Rhee
Journal:  G3 (Bethesda)       Date:  2020-07-07       Impact factor: 3.154

7.  A Fast and Scalable Workflow for SNPs Detection in Genome Sequences Using Hadoop Map-Reduce.

Authors:  Muhammad Tahir; Muhammad Sardaraz
Journal:  Genes (Basel)       Date:  2020-02-05       Impact factor: 4.096

Review 8.  Sorghum breeding in the genomic era: opportunities and challenges.

Authors:  Huaiqing Hao; Zhigang Li; Chuanyuan Leng; Cheng Lu; Hong Luo; Yuanming Liu; Xiaoyuan Wu; Zhiquan Liu; Li Shang; Hai-Chun Jing
Journal:  Theor Appl Genet       Date:  2021-03-02       Impact factor: 5.574

Review 9.  Progress and challenges in sorghum biotechnology, a multipurpose feedstock for the bioeconomy.

Authors:  Tallyta N Silva; Jason B Thomas; Jeff Dahlberg; Seung Y Rhee; Jenny C Mortimer
Journal:  J Exp Bot       Date:  2022-01-27       Impact factor: 6.992

10.  Erratum to: SorGSD: a sorghum genome SNP database.

Authors:  Hong Luo; Wenming Zhao; Yanqing Wang; Yan Xia; Xiaoyuan Wu; Limin Zhang; Bixia Tang; Junwei Zhu; Lu Fang; Zhenglin Du; Wubishet A Bekele; Shuaishuai Tai; David R Jordan; Ian D Godwin; Rod J Snowdon; Emma S Mace; Jingchu Luo; Hai-Chun Jing
Journal:  Biotechnol Biofuels       Date:  2016-02-15       Impact factor: 6.040

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.