Appropriate Data Quality Checks Improve the Reliability of Values Predicted from Milk Mid-Infrared Spectra.
Holstein cow
Mahalanobis distance
mid-infrared spectrum
milk-component prediction
quality-assurance system
Journal
Animals : an open access journal from MDPI
ISSN: 2076-2615
Titre abrégé: Animals (Basel)
Pays: Switzerland
ID NLM: 101635614
Informations de publication
Date de publication:
18 Feb 2021
18 Feb 2021
Historique:
received:
02
02
2021
accepted:
14
02
2021
entrez:
6
3
2021
pubmed:
7
3
2021
medline:
7
3
2021
Statut:
epublish
Résumé
The use of abnormal milk mid-infrared (MIR) spectrum strongly affects prediction quality, even if the prediction equations used are accurate. So, this record must be detected after or before the prediction process to avoid erroneous spectral extrapolation or the use of poor-quality spectral data by dairy herd improvement (DHI) organizations. For financial or practical reasons, adapting the quality protocol currently used to improve the accuracy of fat and protein contents is unfeasible. This study proposed three different statistical methods that would be easy to implement by DHI organizations to solve this issue: the deletion of 1% of the extreme high and low predictive values (M1), the deletion of records based on the Global-H (GH) distance (M2), and the deletion of records based on the absolute fat residual value (M3). Additionally, the combinations of these three methods were investigated. A total of 346,818 milk samples were analyzed by MIR spectrometry to predict the contents of fat, protein, and fatty acids. Then, the same traits were also predicted externally using their corresponded standardized MIR spectra. The interest in cleaning procedures was assessed by estimating the root mean square differences (RMSDs) between those internal and external predicted phenotypes. All methods allowed for a decrease in the RMSD, with a gain ranging from 0.32% to 41.39%. Based on the obtained results, the "M1 and M2" combination should be preferred to be more parsimonious in the data loss, as it had the higher ratio of RMSD gain to data loss. This method deleted the records based on the 2% extreme predictions and a GH threshold set at 5. However, to ensure the lowest RMSD, the "M2 or M3" combination, considering a GH threshold of 5 and an absolute fat residual difference set at 0.30 g/dL of milk, was the most relevant. Both combinations involved M2 confirming the high interest of calculating the GH distance for all samples to predict. However, if it is impossible to estimate the GH distance due to a lack of relevant information to compute this statistical parameter, the obtained results recommended the use of M1 combined with M3. The limitation used in M3 must be adapted by the DHI, as this will depend on the spectral data and the equation used. The methodology proposed in this study can be generalized for other MIR-based phenotypes.
Identifiants
pubmed: 33670810
pii: ani11020533
doi: 10.3390/ani11020533
pmc: PMC7922538
pii:
doi:
Types de publication
Journal Article
Langues
eng
Références
Animal. 2011 Aug;5(10):1653-8
pubmed: 22440358
J Dairy Sci. 2017 Oct;100(10):7910-7921
pubmed: 28755945
J Dairy Sci. 2017 Jun;100(6):5073-5081
pubmed: 28434722
J Dairy Sci. 2018 Jun;101(6):5466-5473
pubmed: 29525319
J Dairy Sci. 2013;96(11):6863-6869
pubmed: 24054296
J Dairy Sci. 2009 Jan;92(1):423-32
pubmed: 19109300
J Dairy Sci. 2015 Jul;98(7):4956-68
pubmed: 25958288
J Dairy Sci. 2012 Oct;95(10):6113-21
pubmed: 22863102
J Dairy Sci. 2009 Jun;92(6):2444-54
pubmed: 19447976
J Dairy Sci. 2011 Jul;94(7):3651-61
pubmed: 21700055
J Dairy Sci. 2008 Dec;91(12):4871-80
pubmed: 19038963
J Dairy Sci. 2011 Apr;94(4):1657-67
pubmed: 21426953
J Dairy Sci. 2020 May;103(5):4435-4445
pubmed: 32147266
Animal. 2012 Apr;6(4):544-50
pubmed: 22436268
J Dairy Sci. 2020 Apr;103(4):3264-3274
pubmed: 32037165
J Dairy Sci. 2006 Sep;89(9):3690-5
pubmed: 16899705
J Dairy Sci. 2019 Nov;102(11):10460-10470
pubmed: 31495611
J Dairy Sci. 2016 May;99(5):4071-4079
pubmed: 26778306
J Dairy Sci. 2018 Aug;101(8):7618-7624
pubmed: 29753478
AAPS PharmSciTech. 2003;4(2):E24
pubmed: 12916906
Anim Nutr. 2016 Dec;2(4):329-333
pubmed: 29767071
Methods. 2021 Feb;186:97-111
pubmed: 32763376
J Dairy Res. 2013 Feb;80(1):103-12
pubmed: 23182024