Literature DB >> 33664767

Identification of Predictor Genes for Feed Efficiency in Beef Cattle by Applying Machine Learning Methods to Multi-Tissue Transcriptome Data.

Weihao Chen1,2, Pâmela A Alexandre2, Gabriela Ribeiro3, Heidge Fukumasu3, Wei Sun1,4,5, Antonio Reverter2, Yutao Li2.   

Abstract

Machine learning (ML) methods have shown promising results in identifying genes when applied to large transcriptome datasets. However, no attempt has been made to compare the performance of combining different ML methods together in the prediction of high feed efficiency (HFE) and low feed efficiency (LFE) animals. In this study, using RNA sequencing data of five tissues (adrenal gland, hypothalamus, liver, skeletal muscle, and pituitary) from nine HFE and nine LFE Nellore bulls, we evaluated the prediction accuracies of five analytical methods in classifying FE animals. These included two conventional methods for differential gene expression (DGE) analysis (t-test and edgeR) as benchmarks, and three ML methods: Random Forests (RFs), Extreme Gradient Boosting (XGBoost), and combination of both RF and XGBoost (RX). Utility of a subset of candidate genes selected from each method for classification of FE animals was assessed by support vector machine (SVM). Among all methods, the smallest subsets of genes (117) identified by RX outperformed those chosen by t-test, edgeR, RF, or XGBoost in classification accuracy of animals. Gene co-expression network analysis confirmed the interactivity existing among these genes and their relevance within the network related to their prediction ranking based on ML. The results demonstrate a great potential for applying a combination of ML methods to large transcriptome datasets to identify biologically important genes for accurately classifying FE animals.
Copyright © 2021 Chen, Alexandre, Ribeiro, Fukumasu, Sun, Reverter and Li.

Entities:  

Keywords:  Bos indicus; Extreme Gradient Boosting; RNA-seq; Random Forest; co-expression network; residual feed intake; supporting vector machine

Year:  2021        PMID: 33664767      PMCID: PMC7921797          DOI: 10.3389/fgene.2021.619857

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  39 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Computing topological parameters of biological networks.

Authors:  Yassen Assenov; Fidel Ramírez; Sven-Eric Schelhorn; Thomas Lengauer; Mario Albrecht
Journal:  Bioinformatics       Date:  2007-11-15       Impact factor: 6.937

3.  Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks.

Authors:  Antonio Reverter; Eva K F Chan
Journal:  Bioinformatics       Date:  2008-09-10       Impact factor: 6.937

Review 4.  Review: Biological determinants of between-animal variation in feed efficiency of growing beef cattle.

Authors:  G Cantalapiedra-Hijar; M Abo-Ismail; G E Carstens; L L Guan; R Hegarty; D A Kenny; M McGee; G Plastow; A Relling; I Ortigues-Marty
Journal:  Animal       Date:  2018-08-24       Impact factor: 3.240

5.  Cell Biology Symposium: molecular basis for feed efficiency.

Authors:  J L Sartin
Journal:  J Anim Sci       Date:  2013-03-11       Impact factor: 3.159

6.  The effect of breed and diet type on the global transcriptome of hepatic tissue in beef cattle divergent for feed efficiency.

Authors:  Marc G Higgins; David A Kenny; Claire Fitzsimons; Gordon Blackshields; Séan Coyle; Clare McKenna; Mark McGee; Derek W Morris; Sinéad M Waters
Journal:  BMC Genomics       Date:  2019-06-26       Impact factor: 3.969

7.  Systems Biology Reveals NR2F6 and TGFB1 as Key Regulators of Feed Efficiency in Beef Cattle.

Authors:  Pâmela A Alexandre; Marina Naval-Sanchez; Laercio R Porto-Neto; José Bento S Ferraz; Antonio Reverter; Heidge Fukumasu
Journal:  Front Genet       Date:  2019-03-22       Impact factor: 4.599

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

9.  Identification of usual interstitial pneumonia pattern using RNA-Seq and machine learning: challenges and solutions.

Authors:  Yoonha Choi; Tiffany Ting Liu; Daniel G Pankratz; Thomas V Colby; Neil M Barth; David A Lynch; P Sean Walsh; Ganesh Raghu; Giulia C Kennedy; Jing Huang
Journal:  BMC Genomics       Date:  2018-05-09       Impact factor: 3.969

10.  Genome-Wide Epistatic Interaction Networks Affecting Feed Efficiency in Duroc and Landrace Pigs.

Authors:  Priyanka Banerjee; Victor Adriano Okstoft Carmelo; Haja N Kadarmideen
Journal:  Front Genet       Date:  2020-02-28       Impact factor: 4.599

View more
  3 in total

1.  Insights Into Long Non-Coding RNA and mRNA Expression in the Jejunum of Lambs Challenged With Escherichia coli F17.

Authors:  Weihao Chen; Xiaoyang Lv; Weibo Zhang; Tingyan Hu; Xiukai Cao; Ziming Ren; Tesfaye Getachew; Joram M Mwacharo; Aynalem Haile; Wei Sun
Journal:  Front Vet Sci       Date:  2022-04-12

2.  Analysis of merged whole blood transcriptomic datasets to identify circulating molecular biomarkers of feed efficiency in growing pigs.

Authors:  Farouk Messad; Isabelle Louveau; David Renaudeau; Hélène Gilbert; Florence Gondret
Journal:  BMC Genomics       Date:  2021-07-03       Impact factor: 3.969

3.  Non-Coding Transcriptome Provides Novel Insights into the Escherichia coli F17 Susceptibility of Sheep Lamb.

Authors:  Weihao Chen; Xiaoyang Lv; Weibo Zhang; Tingyan Hu; Xiukai Cao; Ziming Ren; Tesfaye Getachew; Joram M Mwacharo; Aynalem Haile; Wei Sun
Journal:  Biology (Basel)       Date:  2022-02-22
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.