Building machine-learning-based models for retention time and resolution predictions in ion pair chromatography of oligonucleotides.
Ion-pair chromatography
Machine-learning
Oligonucleotides
Resolution
Support vector regression (SVR) model
Journal
Journal of chromatography. A
ISSN: 1873-3778
Titre abrégé: J Chromatogr A
Pays: Netherlands
ID NLM: 9318488
Informations de publication
Date de publication:
24 May 2022
24 May 2022
Historique:
received:
08
12
2021
revised:
22
03
2022
accepted:
25
03
2022
pubmed:
6
4
2022
medline:
11
5
2022
entrez:
5
4
2022
Statut:
ppublish
Résumé
Support vector regression models are created and used to predict the retention times of oligonucleotides separated using gradient ion-pair chromatography with high accuracy. The experimental dataset consisted of fully phosphorothioated oligonucleotides. Two models were trained and validated using two pseudo-orthogonal gradient modes and three gradient slopes. The results show that the spread in retention time differs between the two gradient modes, which indicated varying degree of sequence dependent separation. Peak widths from the experimental dataset were calculated and correlated with the guanine-cytosine content and retention time of the sequence for each gradient slope. This data was used to predict the resolution of the n - 1 impurity among 250 000 random 12- and 16-mer sequences; showing one of the investigated gradient modes has a much higher probability of exceeding a resolution of 1.5, particularly for the 16-mer sequences. Sequences having a high guanine-cytosine content and a terminal C are more likely to not reach critical resolution. The trained SVR models can both be used to identify characteristics of different separation methods and to assist in the choice of method conditions, i.e. to optimize resolution for arbitrary sequences. The methodology presented in this study can be expected to be applicable to predict retention times of other oligonucleotide synthesis and degradation impurities if provided enough training data.
Identifiants
pubmed: 35381559
pii: S0021-9673(22)00197-2
doi: 10.1016/j.chroma.2022.462999
pii:
doi:
Substances chimiques
Oligonucleotides
0
Guanine
5Z93L87A1R
Cytosine
8J337D1HZY
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
462999Informations de copyright
Copyright © 2022 The Authors. Published by Elsevier B.V. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.