Modeling retrosynthesis via multi-decoder transformer with set invariant loss멀티 디코더 트랜스포머와 집합 불변 손실함수를 이용한 역합성 모델링

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 109
  • Download : 0
Retrosynthesis, predicting reactants from a given product, is a demanding problem in the field of organic chemistry. Discovering the synthesis path of a new chemical compound with desired properties is key in materials development. As a molecule can be represented as a sequence of characters describing atoms or bonds using Simplified-Molecular-Input Line-Entry-System (SMILES), recent studies started to cast the problem into a seq2seq translation problem from a product to its reactants and leverage the success of neural machine translation models such as Transformer. However, our target in this “translation” task is not a simple sequence as in the regular language translation task but rather a “set” of reactant sequences. Hence, the quality of predictions should not be assessed by the order of reactants generated and naively applying the standard models without this consideration would bring performance degradation. In this paper, we propose a novel $\textbf{Set Invariant Loss}$ function to train the retrosynthesis model, which promotes the model to learn to predict reactants in an order invariant fashion. We also devise new Transformer architecture called $\textbf{Multi-Decoder Transformer}$ and its ensemble techniques suitable for such set invariant loss. We validate our set invariant loss against standard cross-entropy loss on top of recent Transformer based models and we achieve state-of-the-art performance among the template-free based baselines on the standard benchmark dataset (USPTO-50K).
Advisors
Yang, Eunhoresearcher양은호researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2020.8,[iv, 33 p. :]

Keywords

Restrosynthesis▼aSequence to Sequence▼aTransformer▼aMulti Decoder▼aSet Invariant Loss▼aEnsemble▼aLatent Variables▼aEM algorithm▼aMachine Learning▼aArtificial Intelligence; 역합성▼a시퀀스투시퀀스▼a트랜스포머▼a멀티디코더▼a집합불변손실함수▼a앙상블▼a잠재변수▼a기댓값 최대화 알고리즘▼a머신러닝▼a인공지능

URI
http://hdl.handle.net/10203/296131
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=964783&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0