Enhanced virtual sample generation based on manifold features: Applications to developing soft sensor using small data.

Industrial processes Small data Soft sensor T-distribution stochastic neighbor​ embedding Virtual sample generation

Journal

ISA transactions
ISSN: 1879-2022
Titre abrégé: ISA Trans
Pays: United States
ID NLM: 0374750

Informations de publication

Date de publication:
Jul 2022
Historique:
received: 11 04 2021
revised: 11 07 2021
accepted: 21 07 2021
pubmed: 3 8 2021
medline: 3 8 2021
entrez: 2 8 2021
Statut: ppublish

Résumé

In the process industry, it is essential to establish a data-driven soft sensor to predict the key variable that is difficult to online measure directly. The accuracy performance of data-driven soft sensors relies heavily on data. Unfortunately, it is hard to acquire sufficient and informative data from the samples with limited number, which is called as the small sample problem. For handling the small sample problem, it is a good solution to generating virtual samples according to the distribution of original data. This paper proposes an enhanced method of virtual sample generation utilizing manifold features to develop soft sensors using small data. First, T-Distribution Stochastic Neighbor Embedding (t-SNE) is utilized to extract the features of input data. The main idea of generating virtual samples is to use the interpolation algorithm to obtain virtual t-SNE input features and then the random forest algorithm is utilized to get the virtual outputs using virtual t-SNE input features. Finally, virtual samples using the proposed t-SNE based virtual sample generation (t-SNE-VSG) can be achieved. For the sake of confirming the effectiveness and feasibility of the presented t-SNE-VSG, a standard data set is first used. What is more, a small data set from an actual industrial process of Purified Terephthalic Acid is used to establish a soft sensor model. The results from simulations show that the accuracy performance of the soft sensor established with small data can be effectively improved by adding the virtual samples generated by t-SNE-VSG. In addition, t-SNE-VSG achieves superior accuracy to state-of-the-art virtual sample generation methods.

Identifiants

pubmed: 34334185
pii: S0019-0578(21)00400-6
doi: 10.1016/j.isatra.2021.07.033
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

398-406

Informations de copyright

Copyright © 2021 ISA. Published by Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Yan-Lin He (YL)

College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, 100029, China.

Qiang Hua (Q)

College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, 100029, China.

Qun-Xiong Zhu (QX)

College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, 100029, China.

Shan Lu (S)

Institute of Intelligence Science and Engineering, Shenzhen Polytechnic, Shenzhen, 518055, China. Electronic address: lushan@szpt.edu.cn.

Classifications MeSH