A comparison of 4 different machine learning algorithms to predict lactoferrin content in bovine milk from mid-infrared spectra.
lactoferrin
machine learning
mid infrared
milk
Journal
Journal of dairy science
ISSN: 1525-3198
Titre abrégé: J Dairy Sci
Pays: United States
ID NLM: 2985126R
Informations de publication
Date de publication:
Dec 2020
Dec 2020
Historique:
received:
09
05
2020
accepted:
10
08
2020
entrez:
23
11
2020
pubmed:
24
11
2020
medline:
21
1
2021
Statut:
ppublish
Résumé
Lactoferrin (LF) is a glycoprotein naturally present in milk. Its content varies throughout lactation, but also with mastitis; therefore it is a potential additional indicator of udder health beyond somatic cell count. Condequently, there is an interest in quantifying this biomolecule routinely. First prediction equations proposed in the literature to predict the content in milk using milk mid-infrared spectrometry were built using partial least square regression (PLSR) due to the limited size of the data set. Thanks to a large data set, the current study aimed to test 4 different machine learning algorithms using a large data set comprising 6,619 records collected across different herds, breeds, and countries. The first algorithm was a PLSR, as used in past investigations. The second and third algorithms used partial least square (PLS) factors combined with a linear and polynomial support vector regression (PLS + SVR). The fourth algorithm also used PLS factors, but included in an artificial neural network with 1 hidden layer (PLS + ANN). The training and validation sets comprised 5,541 and 836 records, respectively. Even if the calibration prediction performances were the best for PLS + polynomial SVR, their validation prediction performances were the worst. The 3 other algorithms had similar validation performances. Indeed, the validation root mean squared error (RMSE) ranged between 162.17 and 166.75 mg/L of milk. However, the lower standard deviation of cross-validation RMSE and the better normality of the residual distribution observed for PLS + ANN suggest that this modeling was more suitable to predict the LF content in milk from milk mid-infrared spectra (R
Identifiants
pubmed: 33222859
pii: S0022-0302(20)30851-1
doi: 10.3168/jds.2020-18870
pii:
doi:
Substances chimiques
Lactoferrin
EC 3.4.21.-
Types de publication
Comparative Study
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
11585-11596Informations de copyright
Copyright © 2020 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.