| Literature DB >> 29144398 |
Xiangxiang Zeng1, Shuting Jin2, Jin Jiang3, Kunhuang Han4, Xiaoping Min5, Xiangrong Liu6.
Abstract
The importance of a gene's impact on traits is well appreciated. Gene expression will affect the growth, immunity, reproduction and environmental resistance of some fish, and then affect the economic performance of fish-related business. Studying the connection between gene and character can help elucidate the growth of fishes. Thus far, a collected database containing large yellow croaker (Larimichthys crocea) genes does not exist. The gene having to do with the growth efficiency of fish will have a huge impact on research. For example, the protein encoded by the IFIH1 gene is associated with the function of viral infection in the immune system, which affects the survival rate of large yellow croakers. Thus, we collected data through the published literature and combined them with a biological genetic database related to the large yellow croaker. Based on the data, we can predict new gene-trait associations which have not yet been discovered. This work will contribute to research on the growth of large yellow croakers.Entities:
Keywords: KATZ; large yellow croaker; trait gene prediction
Mesh:
Substances:
Year: 2017 PMID: 29144398 PMCID: PMC6150269 DOI: 10.3390/molecules22111978
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Performance comparison among different parameter settings (k = 2, 3, 4) in the framework of LOOCV.
| K = 2 | K = 3 | K = 4 |
|---|---|---|
| 0.7564 | 0.7766 | 0.7760 |
Figure 1Prediction performance of KATZ-YC with different parameter settings (k = 2, 3 and 4) in terms of the receiver–operating characteristics (ROC) curve and area under the curve (AUC), based on K-fold cross validation: (a) based on Leave One Out Cross Valication (LOOCV); (b) based on 2-fold cross validation; and (c) based on 5-fold cross validation.
Performance comparison among different parameter settings (k = 2, 3, 4) in the framework of 2-fold cross validation.
| K = 2 | K = 3 | K = 4 |
|---|---|---|
| 0.7036 ± 0.03 | 0.7183 ± 0.03 | 0.7132 ± 0.05 |
Performance comparison among different parameter settings (k = 2, 3, 4) in the framework of 5-fold cross validation.
| K = 2 | K = 3 | K = 4 |
|---|---|---|
| 0.7338 ± 0.03 | 0.7476 ± 0.03 | 0.7357 ± 0.04 |
Figure 2Description the use of a known gene–trait network to obtain trait interaction profiles, gene interaction profiles and the corresponding binary adjacency matrix A. The orange node represents the trait, and the blue node represents the gene in the network.
matrix A:
Figure 3The similarity network between traits, where the nodes represent traits, the weighted edges represent the similarity associations between nodes, the size of a node corresponds to the number of neighbors of the node, and nodes in the network with identical colors have the same similarity value.
matrix A:
| T | … | |||||
|---|---|---|---|---|---|---|
| G | ||||||
| 1 | 0 | 0 | 1 | … | ||
| 1 | 1 | 0 | 0 | … | ||
| 1 | 1 | 0 | 0 | … | ||
| 0 | 0 | 1 | 1 | … | ||
| … | … | … | … | … | … |