| Literature DB >> 35134149 |
Jiaxuan Li, Shuai Yang, Xiaojie Yang, Hui Wu, Heng Tang, Long Yang.
Abstract
Gene families contain genes that come from the same ancestor and have similar sequences and structures. They perform certain specific functions within and among different species. Currently, there is no complete process or platform for the rapid analysis of plant gene families. In this study, a comprehensive query and analysis platform of plant gene families, the Plant Gene Family Platform (PlantGF), was constructed. The platform is composed of four main parts: Search, Tools, Statistics and Auxiliary. A total of 2 909 580 gene family members were identified from 138 plant species in PlantGF. The data can be queried in the Search section through a user-friendly interface. A general process for gene family analysis, having nine steps, is provided. The platform also includes four online tools (HMM-Search, BLAST, MAFFT and HMMER) in the Tools section for useful additional analyses. The statistical analysis of the relevant gene families is shown on the Statistics page. Auxiliary pages are provided for data downloading. The datasets for all 138 plant species' protein sequences and their gene families can be acquired on the Download page. A user's manual and some useful links are displayed on the Manual and Links pages, respectively. To the best of our knowledge, PlantGF is the first comprehensive platform for studying plant gene families, and it will make important contributions to plant gene family-related research. Database URL: http://biodb.sdau.edu.cn/PGF/index.html.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35134149 PMCID: PMC9278324 DOI: 10.1093/database/baab088
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 4.462
Figure 1.Main PlantGF web page. (A) PlantGF homepage: provides quick entry paths to all main parts. (B) Search: contains 2 909 580 gene family members and their specific detail annotations. (C) Tools: consists of nine steps in gene family analysis. Among that, four online tools (HMM-Search, BLAST, MAFFT and HMMER) also exist. (D) Statistics: statistics among these gene families. (E) Datasets Download: provides download links of 138 plant species’ protein sequences and their gene families.
Figure 2.Statistics of plant gene families. (A) The composition structure of 138 plant species’ genes. (B) Statistics of the largest number of gene families in each species. (C) The number of gene families of each species.
Nine steps of gene family analysis
| Steps | Name | Tools |
|---|---|---|
| 1 | Data Acquisition | Expression Patterns; Homologous Families (Genera) Database; Single species Database |
| 2 | Family Identification | HMM-Search ( |
| 3 | Physicochemical Properties | Compute pI/Mw tool ( |
| 4 | Structural Analysis | GSDS ( |
| 5 | Phylogenetic Analysis | MEGA ( |
| 6 | Collinearity Analysis | Cricos ( |
| 7 | Annotation Analysis | Gene Ontology ( |
| 8 | Gene location | MapChart ( |
| 9 | Expression Patterns | ArrayExpress ( |