| Literature DB >> 32973876 |
Tiago Bresolin1, João R R Dórea1.
Abstract
High-throughput phenotyping technologies are growing in importance in livestock systems due to their ability to generate real-time, non-invasive, and accurate animal-level information. Collecting such individual-level information can generate novel traits and potentially improve animal selection and management decisions in livestock operations. One of the most relevant tools used in the dairy and beef industry to predict complex traits is infrared spectrometry, which is based on the analysis of the interaction between electromagnetic radiation and matter. The infrared electromagnetic radiation spans an enormous range of wavelengths and frequencies known as the electromagnetic spectrum. The spectrum is divided into different regions, with near- and mid-infrared regions being the main spectral regions used in livestock applications. The advantage of using infrared spectrometry includes speed, non-destructive measurement, and great potential for on-line analysis. This paper aims to review the use of mid- and near-infrared spectrometry techniques as tools to predict complex dairy and beef phenotypes, such as milk composition, feed efficiency, methane emission, fertility, energy balance, health status, and meat quality traits. Although several research studies have used these technologies to predict a wide range of phenotypes, most of them are based on Partial Least Squares (PLS) and did not considered other machine learning (ML) techniques to improve prediction quality. Therefore, we will discuss the role of analytical methods employed on spectral data to improve the predictive ability for complex traits in livestock operations. Furthermore, we will discuss different approaches to reduce data dimensionality and the impact of validation strategies on predictive quality.Entities:
Keywords: beef cattle; dairy cattle; mid-infrared; near-infrared; novel phenotypes; spectral information
Year: 2020 PMID: 32973876 PMCID: PMC7468402 DOI: 10.3389/fgene.2020.00923
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Keywords used to retrieve published papers from Web of Science*.
| Cattle | Mid or Near-infrared |
| Dairy | Milk compounds, milk fatty acids, protein, minerals, metabolic status, energy balance, feed efficiency, feed intake, energy intake, methane emission, reproduction, fertility, lameness, blood metabolites |
| Beef | Meat quality, feed efficiency, feed intake, energy intake, methane emission, metabolic status, energy balance, reproduction, fertility |
FIGURE 1Published papers retrieved from Web of Science based on the combination of keywords presented in Table 1. Scientific papers published up to May 2020.
Number of samples (N) and coefficient of determination in the validation set for the milk fatty acids predicted from mid-infrared spectrometry using partial least square methodology in dairy cattle.
| References | N | Breed | Validation* | C4:0 | C6:0 | C8:0 | C10:0 | C12:0 | C14:0 | C16:0 | C16:1 |
| 49 | Mul | CV | 0.51 | 0.52 | 0.59 | 0.64 | 0.74 | 0.82 | 0.82 | – | |
| 78 | Mul | LOOCV | – | – | – | – | – | 0.90 | 0.84 | – | |
| 3,622 | – | R-Tr/Te | 0.91 | 0.96 | 0.94 | 0.92 | 0.85 | 0.94 | 0.94 | – | |
| 224 | Nor | 20-F CV | 0.72 | 0.83 | 0.88 | 0.89 | 0.90 | 0.82 | 0.65 | – | |
| 468 | – | Tr/Teb | 0.66 | 0.88 | 0.90 | 0.91 | 0.89 | 0.88 | 0.91 | – | |
| 267 | Bro | LOOCV | – | – | 0.48 | 0.52 | 0.52 | 0.56 | 0.49 | – | |
| 517 | Mul | Tr/Teb | 0.89 | 0.95 | 0.93 | 0.92 | 0.92 | 0.95 | 0.93 | – | |
| 250 | Mul | Tr/Te | 0.85 | 0.96 | 0.96 | 0.91 | 0.91 | 0.93 | 0.88 | – | |
| 1,236 | Mul | Tr/Teb | 0.92 | 0.93 | 0.92 | 0.93 | 0.85 | 0.95 | 0.93 | – | |
| 345 | Mul | R-Tr/Te | 0.93 | 0.96 | 0.97 | 0.95 | 0.96 | 0.95 | 0.94 | – | |
| 850 | Cro | R-Tr/Te | 0.73 | 0.78 | 0.81 | 0.81 | 0.86 | 0.77 | 0.74 | 0.33 | |
| 890 | Mul | 10-F CV | – | 0.88 | 0.89 | 0.91 | 0.91 | 0.90 | 0.91 | 0.63 | |
| 422 | – | 20-F CV | 0.82 | – | – | – | – | 0.82 | 0.66 | – | |
| 1,264 | Bro | R-Tr/Teb | – | – | – | 0.67 | – | – | 0.60 | – | |
| 112 | Mul | LOOCV | 0.92 | 0.94 | 0.94 | – | 0.93 | 0.93 | 0.92 | – | |
| 1,040 | Sim | 10-F CV | – | – | – | 0.88 | 0.90 | 0.90 | 0.92 | – | |
| 1,911 | Mul | 10-F CV | 0.66 | 0.38 | 0.37 | 0.66 | 0.71 | 0.80 | 0.86 | 0.62 | |
| 240 | Hol | 10-F CV | 0.94 | 0.94 | 0.90 | 0.89 | 0.90 | 0.93 | 0.95 | – |
Coefficient of determination in the validation set for the milk fatty acids predicted from mid-infrared in dairy cattle*.
| References | C17:0 | C18:0 | C18:1a | C18:2b | C18:2c | C18:3d | SFA | MUFA | PUFA |
| – | 0.69 | – | 0.07 | 0.62 | 0.14 | 0.94 | 0.85 | 0.39 | |
| – | 0.85 | – | – | – | – | – | 0.93 | ||
| – | 0.82 | 0.92 | 0.58 | 0.36 | 0.45 | – | – | – | |
| – | 0.48 | 0.92 | 0.53 | 0.49 | 0.29 | 0.92 | 0.94 | 0.52 | |
| 0.65 | 0.80 | 0.93 | 0.73 | 0.34 | – | 0.95 | 0.91 | 0.75 | |
| 0.56 | 0.42 | 0.50 | 0.21 | – | – | – | – | – | |
| 0.61 | 0.88 | 0.95 | 0.63 | 0.71 | 0.60 | 0.99 | 0.97 | 0.81 | |
| – | 0.77 | 0.91 | 0.70 | 0.65 | – | 0.98 | 0.92 | 0.38 | |
| – | 0.72 | – | – | – | – | 0.99 | |||
| – | 0.85 | 0.97 | 0.83 | 0.78 | – | 1.00 | 0.98 | 0.78 | |
| 0.43 | 0.60 | 0.87 | 0.64 | 0.66 | 0.51 | 0.93 | – | 0.73 | |
| 0.54 | 0.82 | 0.82 | 0.37 | 0.65 | – | – | – | – | |
| – | 0.62 | 0.84 | – | – | – | 0.77 | 0.86 | – | |
| – | 0.49 | – | – | – | – | – | – | – | |
| – | 0.80 | – | – | – | – | 0.99 | 0.95 | 0.71 | |
| – | 0.78 | 0.90 | 0.65 | – | – | 0.97 | 0.93 | 0.75 | |
| 0.53 | 0.73 | 0.79 | – | 0.65 | – | 0.94 | 0.84 | 0.66 | |
| 0.82 | 0.81 | 0.72 | – | – | – | – | – | – |
Number of samples (N) and coefficient of determination in the validation set for the major protein content predicted from milk spectra using partial least square methodology in dairy cattle.
| References | N | Breed | Validation | Prot | Cas | Whey | α | α | β-CN | κ-CN | α-LA | β-LG |
| 74 | – | Tr/Te* | – | 0.90 | – | – | – | – | – | – | – | |
| 86 | Multibreed | – | – | 0.53 | – | – | – | – | – | – | – | |
| 1,336 | Simmental | 20-F CV | 0.58 | 0.58 | 0.53 | 0.50 | 0.35 | 0.32 | 0.43 | 0.29 | 0.55 | |
| 1,517 | Simmental | 4-F CV | 0.78 | 0.77 | 0.61 | 0.66 | 0.49 | 0.53 | 0.49 | 0.31 | 0.64 | |
| 1,800 | Holstein | R-Tr/Te | - | 0.25 | 0.53 | 0.18 | 0.26 | 0.19 | 0.28 | 0.20 | 0.56 | |
| 193 | Multibreed | Tr/Te | 0.99 | 0.88 | 0.58 | 0.65 | 0.71 | 0.78 | 0.54 | 0.48 | 0.45 | |
| 1,137 | Simmental | 10-F CV | 0.81 | 0.80 | 0.53 | 0.74 | 0.49 | 0.58 | 0.39 | 0.24 | 0.48 | |
| 832 | Multibreed | Tr/Te | – | – | – | 0.66 | 0.36 | 0.25 | 0.71 | 0.06 | 0.34 | |
| 730 | Multibreed | 4-F CV* | – | 0.55 | 0.42 | 0.43 | 0.43 | 0.45 | 0.31 | 0.29 | 0.48 | |
| 114 | Multibreed | LOOCV | 0.88 | 0.88 | 0.69 | – | – | 0.60 | 0.74 | 0.37 | 0.47 |
Number of samples (N) and coefficient of determination in the validation set for mineral contents using partial least square methodology.
| References | N | Breed | Validation* | Ca | K | Mg | Na | P |
| 92 | Multibreed | LOOCV | 0.87 | 0.36 | 0.65 | 0.65 | 0.85 | |
| 208 | – | 10-F CVb | 0.55 | – | – | – | – | |
| 208 | Holstein | LOOCV | 0.53c | – | – | – | 0.70c | |
| 689 | Simental | 10-F CV | 0.48 | 0.41 | 0.46 | – | 0.43 | |
| 923 | Multibreed | R-Tr/Teb | 0.67 | 0.69 | 0.65 | 0.40 | 0.68 | |
| 153 | Holstein | Tr/Teb | 0.25 | 0.34 | 0.26 | 0.25 | 0.53 | |
| 93 | Holstein | CV | 0.79 | 0.55 | 0.68 | 0.75 | 0.87 | |
| 986 | Multibreed | 10- FCV | 0.25 | – | – | – | – |
Number of samples (N), and coefficient of determination in the validation set (R2) for the prediction of dry matter intake (DMI) and organic matter intake (OMI) traits using grass near-infrared (G-NIR) and fecal near-infrared (F-NIR) spectrometry .
| References | N | Breed | Spectra | Trait | Validation | |
| 203 | dairy | G-NIR | DMI | 7-F CV | 0.71 | |
| 88 | beef | F-NIR | OMI | 3-F CV | 0.52 | |
| 139 | dairy | F-NIR | DMI | CV | 0.98 | |
| 91 | dairy | F-NIR | DMI | R-Tr/Tea | 0.97 | |
| 1,322 | dairy | F-NIR | DMI | Tr/Tea | 0.58 | |
| 406 | beef | F-NIR | DMI | CV | 0.44 | |
| 125 | beef | F-NIR | DMI | 6-F CV | 0.75 | |
| 408 | beef | F-NIR | DMI | CV | 0.73 |
Number of samples (N) and coefficient of determination (R2) for dry matter intake (DMI), residual feed efficiency (RFI), effective energy intake (EEI), net energy intake (NEI), and energy intake (EI) traits using milk mid-infrared spectrometry in dairy cattle.
| References | N | Breed | Trait | Method | Validation | |
| 5,469 | Holstein | EEI | PLS | 4-F CV* | 0.74a | |
| 4,109 | Holstein | EEI | PLS | 4-F CV* | 0.64a | |
| 1,335 | Holstein | RFI | PLS | Tr/Te* | 0.36a | |
| 1,335 | Holstein | EEI | PLS | Tr/Te* | 0.49a | |
| 1,270 | Holstein | EI | PLS | 20-F CV | 0.56 | |
| 1,044 | Multibreed | DMI | PLS | R-Tr/Te* | 0.77 | |
| 1,044 | Multibreed | RFI | PLS | R-Tr/Te* | 0.46 | |
| 1,279 | Holstein | DMI | ANN | LOOCV* | 0.70 | |
| 857 | Norwegian red | DMI | PLS | 5-F CV* | 0.29a | |
| 857 | Norwegian red | NEI | PLS | 5-F CV* | 0.42a | |
| 1,074 | Multibreed | DMI | PLS | LOOCV* | 0.64 | |
| 11,941 | Holstein | EEI | PLS | 4-F CV* | 0.52 | |
| 1,034 | Holstein | DMI | SVM | R-Tr/Te | 0.66 |
Number of samples (N) and coefficient of determination (R2) in the validation for energy balance trait using milk mid-infrared spectrometry in dairy cattle.
| References | N | Breed | Validation | |
| 5,469 | Holstein | 4-F CV* | 0.56a | |
| 4,109 | Holstein | 4-F CV* | 0.29a | |
| 1,335 | Holstein | Tr/Te* | 0.46a | |
| 1,270 | Holstein | 20-F CV | 0.53a | |
| 240 | Holstein | 10-F CV | 0.48 | |
| 11,941 | Holstein | 4-F CV* | 0.60 |
Number of samples (N) and coefficient of determination in validation set (R2) for methane emission trait predicted from milk mid-infrared spectra data.
| References | N | Breed | Method | Validation | |
| 60 | Holstein | PLS | LOOCV | 0.79 | |
| 446 | Multibreed | PLS | Tr/Te* | 0.23a | |
| 532 | Multibreed | PLS | 5-F CV | 0.70 | |
| 2,202 | Holstein | PLS | R-Tr/Te* | 0.39 | |
| 1,150 | Brown Swiss | Bayes B | R-Tr/Te | 0.57 | |
| 584 | Multibreed | PLS | 5-F CV | 0.57 | |
| 218 | Holstein | PLS | 10-F CV | 0.49 | |
| 801 | Holstein | PLS | LOOCV* | 0.01 |
Number of samples (N) and coefficient of determination (R2) in the validation set for β-hydroxybutyrate (BHB), acetone (Ac), non-esterified fatty acids (NEFA), blood urea nitrogen (BUN), glucose (Glu), glutamic oxaloacetic transaminase (GOT), and insuline-like growth factor 1 (IGF-1) using milk mid-infrared spectrometry in dairy cattle.
| References | N | Breed | Sample | Trait | Method | Validation* | |
| 310 | – | Milk | Ac | PLS | Tr/Te | 0.81 | |
| 180 | – | Milk | Ac | PLS | LOOCV | 0.24b | |
| 1,080 | Holstein | Milk | Ac | PLS | CV | 0.72c | |
| 1,080 | Holstein | Milk | BHB | PLS | CV | 0.62c | |
| 224 | Holstein | Milk | Ac | PLS | R-Tr/Tea | 0.67 | |
| 434 | Holstein | Milk | BHB | PLS | R-Tr/Tea | 0.63 | |
| 1,914 | Holstein | Blood | BHB | PLS | R-Tr/Tea | 0.40 | |
| 3,629 | Holstein | Blood | BHB | ANN | R-Tr/Tea | 0.56 | |
| 1,910 | Multibreed | Blood | BHB | PLS | R-Tr/Tea | 0.52 | |
| 295 | Multibreed | Blood | BHB | PLS | 3-F CV | 0.63 | |
| 294 | Multibreed | Blood | NEFA | PLS | 3-F CV | 0.52 | |
| 294 | Multibreed | Blood | BUN | PLS | 3-F CV | 0.58 | |
| 294 | Multibreed | Blood | Glu | PLS | 3-F CV | 0.20 | |
| 294 | Multibreed | Blood | GOT | PLS | 3-F CV | 0.24 | |
| 205 | Holstein | Blood | BHB | PLS | 4-F CV | 0.70 | |
| 234 | Holstein | Blood | NEFA | PLS | 4-F CV | 0.39 | |
| 387 | Holstein | Blood | IGF-1 | PLS | 4-F CV | 0.61 | |
| 380 | Holstein | Blood | Glu | PLS | 4-F CV | 0.44 | |
| 878 | Holstein | Blood | BHB | PLS | R-Tr/Tea | 0.60 | |
| 878 | Holstein | Blood | NEFA | PLS | R-Tr/Tea | 0.45 | |
| 878 | Holstein | Blood | BUN | PLS | R-Tr/Tea | 0.35 | |
| 585 | Holstein | Blood | BHB | PLS | CV | 0.42 |
Number of samples (N), accuracy (Acc), sensitive (Sen), and specificity (Spe) in the validation set for mastitis (Mas) and lameness (Lam) traits using milk mid-infrared spectrometry using partial least square in dairy cattle.
| Reference | N | Breed | Trait | Validation | Acc (%) | Sen (%) | Spe (%) |
| 9,811 | Multibreed | Lam | R-Tr/Te | – | 60 | 62 | |
| 2,340 | Multibreed | Mas | R-Tr/Te* | 68 | 57 | 79 | |
| 3,771 | Multibreed | Lam | 10-F CV | 62 | 57 | 62 |
Number of samples (N) and coefficient of determination (R2) in the validation set for meat tenderness trait predicted from near-infrared spectrometry in cattle.
| References | N | Breed | Method | Validation∗ | |
| 11 | Japanese Black | MLR | – | 0.67b | |
| 10 | Norwegian | PCR | CV | 0.29b | |
| 70 | – | PLS | CV | 0.37b | |
| 119 | – | PLS | Tr/Te | 0.63 | |
| 79 | Norwegian Red | PLS | LOOCV | 0.36b | |
| 48 | Norwegian | PLS | CV | 0.72b | |
| 67 | – | PLS | LOOCV | 0.31b | |
| 189 | Belgian White Blue | PLS | CV | 0.25 | |
| 22 | Multibreed | PLS | LOOCV | 0.48 | |
| 146 | Multibreed | MLR | Tr/Te | 0.22 | |
| 148 | Piamontese | PLS | 4-FCV | 0.12 | |
| 112 | Maronesa | PLS | LOOCV | 0.53 | |
| 190 | Multibreed | PLS | R-Tr/Te | 0.74 | |
| 67 | - | PLS | LOOCV | 0.17 | |
| 194 | Crossbred | PLS | LOOCV | 0.31 | |
| 381 | Hereford | PLS | R-Tr/Te | 0.58 | |
| 1,298 | Piamontese | PLS | Tr/Te | 0.50 | |
| 40 | Multibreed | PLS | LOOCV | 0.28b | |
| 1,208 | Piamontese | PLS | R-Tr/Te | 0.21 | |
| 336 | Multibreed | PLS | 8-FCV | 0.34 | |
| 81 | Crossbred | PLS | LOOCV | 0.13 | |
| 63 | Crossbred | PLS | LOOCV | 0.81 | |
| 162 | Yak | PLS | R-Tr/Te | 0.43 | |
| 644 | Nelore | PLS | LOOCV | 0.40 | |
| 442 | Multibreed | PLS | R-Tr/Tea | 0.60 | |
| 234 | - | SVM | Tr/Te | 0.20 | |
| 89 | Holstein | PLS | TR/Te | 0.62 | |
| 1,166 | Piamontese | Bayes B | LOOCVa | 0.16 | |
| 595 | Multibreed | PLS | LOOCV | 0.22 |
Number of samples (N) and coefficient of determination (R2) in the validation set for L∗ (R2L*), a∗ (R2a*), and b∗ (R2b*) meat color, and cooking losses (R2CL) traits predicted from near-infrared spectrometry in cattle.
| References | N | Breed | Method | Validation§ | ||||
| 11 | Japanese black | MLR | CV | – | – | – | 0.59 | |
| 189 | Belgian White Blue | PLS | CV | 0.83 | 0.39 | 0.75 | 0.25 | |
| 113 | Multibreed | PLS | LOOCV | 0.55 | 0.90 | 0.78 | – | |
| 148 | Piamontese | PLS | 4-FCV | – | – | – | 0.15 | |
| 109 | Maronesa | PLS | LOOCV | 0.80 | 0.23 | 0.27 | 0.02 | |
| 67 | – | PLS | LOOCV | 0.87 | 0.71 | 0.90 | 0.001 | |
| 194 | Crossbred | PLS | LOOCV | 0.83 | 0.76 | 0.84 | 0.23 | |
| 1,298 | Piamontese | PLS | CV | 0.65 | 0.69 | 0.81 | 0.50 | |
| 1,208 | Piamontese | PLS | R-Tr/Te | 0.64 | 0.68 | 0.44 | 0.04 | |
| 336 | Multibreed | PLS | 8-FCV | 0.70 | 0.73 | 0.60 | 0.38 | |
| 81 | Crossbred | PLS | LOOCV | 0.41 | 0.58 | 0.57 | 0.31 | |
| 63 | Crossbred | PLS | LOOCV | 0.80 | 0.71 | 0.77 | – | |
| 162 | Yak | PLS | R-Tr/Te | 0.74 | 0.81 | 0.93 | 0.61 | |
| 234 | – | SVM | Tr/Te | 0.80 | 0.64 | 0.54 | – | |
| 644 | Nelore | PLS | LOOCV | 0.16 | 0.17 | 0.45 | – | |
| 442 | Multibreed | PLS | R-Tr/Te | 0.61 | 0.64 | 0.38 | 0.56 | |
| 89 | Holstein | PLS | Tr/Te | 0.33 | 0.57 | 0.61 | 0.47 | |
| 1,166 | Piamontese | Bayes B | LOOCV | 0.84 | 0.55 | 0.63 | 0.16 |
Number of samples (N) and coefficient of determination (R2) in the validation set for intramuscular fat content predicted from near-infrared spectrometry in cattle.
| References | N | Breed | Method | Validation* | |
| 11 | Japanese Black | MLR | – | 0.922 | |
| 72 | British Friesian | PLS | 4-FCV | 0.95 | |
| 79 | Norwegian Red | PLS | LOOCV | 0.58b | |
| 100 | – | PLS | 4-FCV | 0.86 | |
| 78 | Hereford | PLS | 4-FCV | 0.92 | |
| 34 | Multibreed | PLS | CV | 0.93 | |
| 190 | Multibreed | PLS | R-Tr/Tea | 0.76 | |
| 194 | Multibreed | PLS | LOOCV | 0.43 | |
| 148 | Piamontese | PLS | 4-FCV | 0.82 | |
| 63 | Crossbred | PLS | LOOCV | 0.86 | |
| 182 | Multibreed | PLS | R-Tr/Tea | 0.99 | |
| 108 | – | PLS | Tr/Te | 0.82 | |
| 644 | Nelore | PLS | LOOCV | 0.02 |
FIGURE 2Example of validation strategies employed in the publish papers retrieved from Web of Science. (A) split-data or k-fold cross-validation, (B) leave-one-out cross-validations, and (C) leave-one-group-out cross-validation. Colors represent animals from different herds.