(A) study of reinforcement learning implementation and building reinforcement learning environment using supervised learning for generating nuclear power plant accidents scenario원전 사고 시나리오 생성을 위한 강화학습 구현 및 지도학습을 통한 강화학습 환경 구축 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 74
  • Download : 0
Accident scenarios for the study of existing nuclear power plant accident phenomena are deterministically selected through probabilistic safety analysis (PSA). Although it is possible to determine the sequence of failures of a particular component in this process, it is difficult to assess the exact timing of the component failure and the change and severity of the accident process resulting from it. MAAP, which is the existing nuclear accident analysis code, can also confirm the progress of the accident by specifying the malfunctioning component and the time for failure in the input value. In order to confirm the high-risk device failure time, it is necessary to set all the failure time of various components before the analysis and then interpret the analysis results. In this study, it is confirmed whether this process can be replaced by using reinforcement learning. Through the process of comparing whether the most serious accident scenario obtained via MAAP simulation is the same as the accident determined by the reinforcement learning agent, the possibility of generating a new nuclear accident scenario through reinforcement learning can be evaluated. However, because it is difficult to apply the reinforcement learning methodology to the existing MAAP code directly, a surrogate model was created through supervised learning and reinforcement learning was performed by interacting with the MAAP surrogate model. After evaluating how accurately the MAAP code can be represented with the surrogate model, it was evaluated whether the reinforcement learning agent can determine the most severe scenario identified from the MAAP data-driven model while various reward settings were tested. From this process, a methodology to apply reinforcement learning to the existing nuclear accident analysis code, which is difficult to use reinforcement learning, is newly presented, and factors to be considered in order to use this methodology are identified and validated.
Advisors
Lee, Jeong Ikresearcher이정익researcher
Description
한국과학기술원 :원자력및양자공학과,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 원자력및양자공학과, 2022.8,[v, 58 p. :]

Keywords

MAAP▼aSurrogate model▼aSupervised learning▼aReinforcement learning; MAAP▼aSurrogate Model▼a지도학습▼a강화학습

URI
http://hdl.handle.net/10203/309771
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1008315&flag=dissertation
Appears in Collection
NE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0