Optimal Control of Batch Process Based on Double Actors and Double Critics Algorithm

TANG Zekai; LUO Na

doi:10.14135/j.cnki.1006-3080.20251110004

TANG Zekai, LUO Na. Optimal Control of Batch Process Based on Double Actors and Double Critics AlgorithmJ. Journal of East China University of Science and Technology, 2026, 52(3): 399-408. DOI: 10.14135/j.cnki.1006-3080.20251110004

Citation:

Optimal Control of Batch Process Based on Double Actors and Double Critics Algorithm

TANG Zekai,
LUO Na

Graphical Abstract

Abstract

Abstract

When using deep reinforcement learning methods to optimize the control of batch processes, there are problems such as low sample learning efficiency, long training time and unstable training. To solve these problems, a Double Actors and Regularized Double Critics (DARDC) algorithm is proposed. The algorithm enhances the agent's exploration ability in the complex state space through the parallel double actor network, effectively suppresses the value function estimation deviation by virtue of the regularization term constraint double critic network, and introduces the prioritized experience replay mechanism to accelerate the training convergence. Further, in order to ensure the rationality of control quantities output by the agent, expert experience rules are embedded into the action output. In the experiment of penicillin fermentation process, the performance of the proposed algorithm is better than the traditional deep reinforcement learning algorithm, and compared with PID control, the average penicillin yield of the key control index is increased by 19.1%. The results show that the DARDC algorithm has certain effectiveness and robustness, and provides a new method for the optimal control of batch process.

FullText(HTML)

References (29)

Cited By

Optimal Control of Batch Process Based on Double Actors and Double Critics Algorithm

Abstract

Catalog

Export File

Citation

Format

Content