Building machine-learning-based models for retention time and resolution predictions in ion pair chromatography of oligonucleotides.

Ion-pair chromatography Machine-learning Oligonucleotides Resolution Support vector regression (SVR) model

Journal

Journal of chromatography. A
ISSN: 1873-3778
Titre abrégé: J Chromatogr A
Pays: Netherlands
ID NLM: 9318488

Informations de publication

Date de publication:
24 May 2022
Historique:
received: 08 12 2021
revised: 22 03 2022
accepted: 25 03 2022
pubmed: 6 4 2022
medline: 11 5 2022
entrez: 5 4 2022
Statut: ppublish

Résumé

Support vector regression models are created and used to predict the retention times of oligonucleotides separated using gradient ion-pair chromatography with high accuracy. The experimental dataset consisted of fully phosphorothioated oligonucleotides. Two models were trained and validated using two pseudo-orthogonal gradient modes and three gradient slopes. The results show that the spread in retention time differs between the two gradient modes, which indicated varying degree of sequence dependent separation. Peak widths from the experimental dataset were calculated and correlated with the guanine-cytosine content and retention time of the sequence for each gradient slope. This data was used to predict the resolution of the n - 1 impurity among 250 000 random 12- and 16-mer sequences; showing one of the investigated gradient modes has a much higher probability of exceeding a resolution of 1.5, particularly for the 16-mer sequences. Sequences having a high guanine-cytosine content and a terminal C are more likely to not reach critical resolution. The trained SVR models can both be used to identify characteristics of different separation methods and to assist in the choice of method conditions, i.e. to optimize resolution for arbitrary sequences. The methodology presented in this study can be expected to be applicable to predict retention times of other oligonucleotide synthesis and degradation impurities if provided enough training data.

Identifiants

pubmed: 35381559
pii: S0021-9673(22)00197-2
doi: 10.1016/j.chroma.2022.462999
pii:
doi:

Substances chimiques

Oligonucleotides 0
Guanine 5Z93L87A1R
Cytosine 8J337D1HZY

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

462999

Informations de copyright

Copyright © 2022 The Authors. Published by Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Martin Enmark (M)

Department of Engineering and Chemical Sciences, Karlstad University, SE-651 88 Karlstad, Sweden.

Jakob Häggström (J)

Department of Engineering and Chemical Sciences, Karlstad University, SE-651 88 Karlstad, Sweden.

Jörgen Samuelsson (J)

Department of Engineering and Chemical Sciences, Karlstad University, SE-651 88 Karlstad, Sweden. Electronic address: Jorgen.Samuelsson@kau.se.

Torgny Fornstedt (T)

Department of Engineering and Chemical Sciences, Karlstad University, SE-651 88 Karlstad, Sweden. Electronic address: Torgny.Fornstedt@kau.se.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted

Classifications MeSH