Embedding active learning in batch-to-batch optimization using reinforcement learning

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 81
  • Download : 0
Batch-to-batch (B2B) or run-to-run (R2R) optimization refers to the strategy of updating the operating parameters of a batch run based on the results of previous runs and exploits the repetitive nature of batch process operation. Although B2B optimization uses feedback from previous batch runs to learn about model uncertainty and improve the operation of future runs, the standard techniques have the limitations of passive learning and being myopic in making adjustments. This work proposes a novel way to use the reinforcement learning approach to embed the active learning feature into B2B optimization. For this, the B2B optimization problem is formulated as a maximization of a long-term performance of repeated batch runs, which are modeled as a stochastic process with uncertain parameters. To solve the resulting Bayes-Adaptive Markov decision process (BAMDP) problem in a near-optimal manner, a policy gradient reinforcement learning algorithm is employed. Through case studies, the behavior and effectiveness of the proposed B2B optimization method are examined by comparing it with the traditional certainty equivalence based B2B optimization method with passive learning.
Publisher
PERGAMON-ELSEVIER SCIENCE LTD
Issue Date
2023-11
Language
English
Article Type
Article
Citation

AUTOMATICA, v.157

ISSN
0005-1098
DOI
10.1016/j.automatica.2023.111260
URI
http://hdl.handle.net/10203/313424
Appears in Collection
CBE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0