| Literature DB >> 32960944 |
Wei Huang1, Ping Zheng2, Zhenhai Cui3, Zhuo Li4, Yifeng Gao4, Helong Yu5, You Tang4,5, Xiaohui Yuan6, Zhiwu Zhang7.
Abstract
Accurately predicting phenotypes from genotypes holds great promise to improve health management in humans and animals, and breeding efficiency in animals and plants. Although many prediction methods have been developed, the optimal method differs across datasets due to multiple factors, including species, environments, populations and traits of interest. Studies have demonstrated that the number of genes underlying a trait and its heritability are the two key factors that determine which method fits the trait the best. In many cases, however, these two factors are unknown for the traits of interest. We developed a cloud computing platform for Mining the Maximum Accuracy of Predicting phenotypes from genotypes (MMAP) using unsupervised learning on publicly available real data and simulated data. MMAP provides a user interface to upload input data, manage projects and analyses and download the output results. The platform is free for the public to conduct computations for predicting phenotypes and genetic merit using the best prediction method optimized from many available ones, including Ridge Regression, gBLUP, compressed BLUP, Bayesian LASSO, Bayes A, B, Cpi and many more. Users can also use the platform to conduct data analyses with any methods of their choice. It is expected that extensive usage of MMAP would enrich the training data, which in turn results in continual improvement of the identification of the best method for use with particular traits.Entities:
Year: 2021 PMID: 32960944 PMCID: PMC8189680 DOI: 10.1093/bioinformatics/btaa824
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.The workflow and performance of MMAP. As a cloud computing platform, MMAP integrate existing knowledge and interactively search for the best GS method for a particular dataset (a). The search is based on the characteristics of the input data and IDE initiated with the gBLUP method (b). MMAP has the highest average prediction accuracy (c) with minimal effort required for uploading phenotypic data, genotypic data and covariable data (d)