Time-series clustering approach for training data selection of a data-driven predictive model: Application to an industrial bio 2,3-butanediol distillation process

Citation DataComputers & Chemical Engineering, ISSN: 0098-1354, Vol: 161, Page: 107758

Publication Year2022

14
Citations
0
Usage
24
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
14
- Citation Indexes
  14
Captures
24
- Readers
  24

Article Description

In this study, we propose a time-series clustering approach that selects optimal training data for the development of predictive models. The optimal number of clusters was set based on the variation of within-cluster sums of squares. A predictive model was developed with the selection ratio of training data from each of those clusters. Based on the results, a regression model was developed to predict the performance of the model. The search space was applied to the regression model, and the optimal training data ratio were selected satisfying the objective function and constraints. The effectiveness of the method is demonstrated by addressing a commercial bio 2,3-butanediol distillation process. As a result, the number of data for model training was reduced by 49.20% compared to the base case without clustering. The coefficient of determination (R 2 ) showed the same level of performance, and the root-mean-square error was improved up to 14.07%.

Bibliographic Details

DOI10.1016/j.compchemeng.2022.107758

URL IDhttp://www.sciencedirect.com/science/article/pii/S0098135422000990; http://dx.doi.org/10.1016/j.compchemeng.2022.107758; http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85126548415&origin=inward; https://linkinghub.elsevier.com/retrieve/pii/S0098135422000990; https://dx.doi.org/10.1016/j.compchemeng.2022.107758

AUTHOR(S)

Yeongryeol Choi; Nahyeon An; Seokyoung Hong; Hyungtae Cho; Jongkoo Lim; In-Su Han; Il Moon; Junghwan Kim

PUBLISHER(S)

Elsevier BV

TAG(S)

Chemical Engineering; Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know