DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kim, Dae-Shik | - |
dc.contributor.advisor | 김대식 | - |
dc.contributor.author | Yoon, Wonjun | - |
dc.date.accessioned | 2019-09-04T02:40:15Z | - |
dc.date.available | 2019-09-04T02:40:15Z | - |
dc.date.issued | 2019 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=843408&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/266712 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2019.2,[v, 26 p. :] | - |
dc.description.abstract | Understanding the relationships between objects in an image is an important problem in computer vision. Recently, methods for concerning the relationships have been proposed in many vision tasks, but there are few studies in the semantic-visual embedding problem. In this paper, we first propose a new dataset called R-CLEVR to concentrate on the relations between objects in semantic-visual problems, and we introduce an Object Phase Module (OPM) that focuses on relative locations of objects in an image. Experiments demonstrate that our proposed network with object phase module has the highest performance in cross-modal retrieval and phrase grounding problems on R-CLEVR datasets. Furthermore, our model demonstrates meaningful performance on MS-COCO dataset which has a relatively small number of object relations. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Deep learning▼acomputer vision▼amulti modal▼aimage and text understanding▼asemantic visual embeddings | - |
dc.subject | 딥러닝▼a컴퓨터 비전▼a멀티모달▼a이미지-텍스트 이해▼a의미론적 시각 임베딩 | - |
dc.title | Deep semantic visual embeddings with spatial relationships | - |
dc.title.alternative | 공간적 위치 관계성을 고려한 의미론적 시각 임베딩 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | 윤원준 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.