| Literature DB >> 35605195 |
Fei Zhao1, Shilong Tian1, Qiuhong Wu2, Zijuan Li1, Luhuan Ye1, Yili Zhuang1, Meiyue Wang3, Yilin Xie1, Shenghao Zou4, Wan Teng5, Yiping Tong5, Dingzhong Tang6, Ajay Kumar Mahato7, Moussa Benhamed8, Zhiyong Liu9, Yijing Zhang10.
Abstract
Triticeae species, including wheat, barley, and rye, are critical for global food security. Mapping agronomically important genes is crucial for elucidating molecular mechanisms and improving crops. However, Triticeae includes many wild relatives with desirable agronomic traits, and frequent introgressions occurred during Triticeae evolution and domestication. Thus, Triticeae genomes are generally large and complex, making the localization of genes or functional elements that control agronomic traits challenging. Here, we developed Triti-Map, which contains a suite of user-friendly computational packages specifically designed and optimized to overcome the obstacles of gene mapping in Triticeae, as well as a web interface integrating multi-omics data from Triticeae for the efficient mining of genes or functional elements that control particular traits. The Triti-Map pipeline accepts both DNA and RNA bulk-segregated sequencing data as well as traditional QTL data as inputs for locating genes and elucidating their functions. We illustrate the usage of Triti-Map with a combination of bulk-segregated ChIP-seq data to detect a wheat disease-resistance gene with its promoter sequence that is absent from the reference genome and clarify its evolutionary process. We hope that Triti-Map will facilitate gene isolation and accelerate Triticeae breeding.Entities:
Keywords: Triti-Map; Triticeae; agronomic gene mapping; bulk-segregated ChIP-seq; wheat
Mesh:
Year: 2022 PMID: 35605195 PMCID: PMC9284283 DOI: 10.1016/j.xplc.2022.100304
Source DB: PubMed Journal: Plant Commun ISSN: 2590-3462
Pros and cons of different sequencing strategies for identifying molecular markers
| Data type | Sequencing cost | Library construction | Genomic coverage | SNP identification accuracy | Hardware requirements | References |
|---|---|---|---|---|---|---|
| WGS based | high | Simple | whole genome | no obvious bias | high | |
| RNA-seq based | low | Simple | expressed gene | affected by gene expression level and alternative splicing | low | |
| Exome capture | low | complicated | exons designed in probe | affected by reference genome, gene annotation, and probe design | low | |
| ChIP-seq based | low | medium | core-genome, including gene and regulatory elements | no obvious bias | medium |
Figure 1Triti-Map workflow
Triti-Map, which accepts raw sequencing data (ChIP-seq, RNA-seq, or WGS data) from bulks with different traits, comprises the interval mapping module (blue) for locating genomic regions associated with a target trait, the de novo assembly module (orange) for assembling trait-related sequences, and the web-based annotation module (green) for locating causal variants, candidate genes, or regulatory elements based on integrated multi-omics data and information on Triticeae species.
Figure 2Diagram of the web-based annotation module function
(A) Data integrated with the web-based annotation module.
(B) To locate causal variants and candidate genes or regulatory elements, Triti-Map integrates multi-omics data and provides different levels of analysis, including a collinearity analysis of target regions among Triticeae species, as well as a functional and evolutionary characterization of SNPs, genes, or other sequences related to a target trait.
Figure 3Optimization to address specific challenges of Triticeae gene mapping and annotation
The major steps that were optimized are marked by the following numbers: (1) steps that use specific tools or parameters that shorten the analysis time; (2) steps that split genomes for parallel analyses; (3) steps in which candidate sequences are filtered according to the colinear regions of candidate intervals across Triticeae species; (4) steps that use APIs from public databases to ensure timely updates and minimize local data storage. In each module, nodes with a colored background represent important result files, whereas nodes without a colored background represent the main analysis steps and the tools used.
Figure 4Triti-Map case study results
(A) Interval mapping module results. Upper panel: causal interval detected using the ΔSNP-index method. Lower panel: enlarged candidate region.
(B) Web-based annotation module results. From top to bottom: SNP annotation, SNP localization, and epigenome tracks of related regions.
(C) Collinearity analysis results. The regions in Triticeae species that are colinear with the detected candidate region are listed.
(D) Assembly module results. Top: Two newly assembled sequences (purple) highly similar to Pm60. Bottom: Phylogenetic tree presenting the evolutionary distance between Pm60 and homologous genes in wheat species with a different ploidy level.
Comparison of features between Triti-Map and published databases
| GrainGenes | Wheat-SnpHub-Portal | GeneTribe | WheatGmap | WheatOmics | Triti-Map | |
|---|---|---|---|---|---|---|
| Description | an improved resource for the small-grains community | SnpHub, an easy-to-set up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat | Triticeae GeneTribe, a collinearity-incorporating homology inference strategy for connecting emerging assemblies in the Triticeae Tribe as a pilot practice in the plant pangenomic era | WheatGmap, which integrates multiple BSA mapping models and large amounts of public data to accelerate gene cloning and functional research and facilitate resource sharing | WheatOmics, a platform combining multiple omics data to accelerate functional genomics studies in wheat | Triti-Map is composed of both gene mapping scripts and downstream analysis tools for efficient mapping of both candidate gene and intergenic regulatory elements |
| Year founded | 2005 | 2020 | 2020 | 2020 | 2018 | 2021 |
| URL | ||||||
| Applicable platform | web | web and Linux | web | web | web | |
| Input data type | / | / | gene, gene list, or fasta file | bulk-sequencing VCF | gene, gene list, or fasta file | |
| gene, gene list, or fasta file | bulk-sequencing VCF | |||||
| gene, gene list, or fasta file | ||||||
| Data in website | genome, molecular, and phenotypic information for 4 wheat relative species | genomic variation datasets of 7 wheats and their progenitors | homology inference information for 12 Triticeae and 3 outgroup species | high-throughput BSA sequencing datasets of hexaploid wheat (>3,500 groups) | multi-omics data, including genomes, transcriptomes, variomes, and epigenomes of multiple Triticeae species | multi-omics data, including genomes, transcriptomes, genetic variation, and epigenomes of 7 major Triticeae species |
| Website main function module | blast function | raw variation data and genomic sequence retrieval | collinear information and analysis | gene mapping | multi-omics data information | |
| gene and SNP annotations | transcriptional analysis | gene and SNP annotations | ||||
| transcriptional analysis | regulatory element analysis | collinear information and analysis | ||||
| genome browsers | blast function | functional analysis | epigenetic features and transcription factor binding motifs | |||
| genome browsers | gene identification | |||||
| gene identification | ||||||
| Gene mapping tool | no | No | no | yes | no |
/, no data available. Bold indicates the uniqueness of Triti-Map across multiple feature comparisons.