Scalable cache coherent schemes for direct-connected shared memory multiprocessors직접 상호연결 공유메모리 다중처리기를 위한 확장성있는 캐쉬 일관성 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 502
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorLee, Joon-Won-
dc.contributor.advisor이준원-
dc.contributor.authorRhee, Yun-Seok-
dc.contributor.author이윤석-
dc.date.accessioned2011-12-13T05:25:01Z-
dc.date.available2011-12-13T05:25:01Z-
dc.date.issued1999-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=151040&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/33142-
dc.description학위논문(박사) - 한국과학기술원 : 전산학과, 1999.2, [ vi, 103 p. ]-
dc.description.abstractLarge scale shared memory multiprocessors favor a directory-based cache coherence scheme for its scalability. The directory space needed to record the information for sharers has a complexity of $\Theta(N^2)$ when a full-mapped vector is used for an N-node system. Though this overhead can be reduced by limiting the directory size assuming that the sharing degree is small, it will experience significant inefficiency when a data is widely shared. As a reason of the coherence overhead, we notice that most coherence actions in directory-based protocols are initiated by home nodes, and this nature consequently leads to limit the scalability. In order to lessen the burden of home nodes, we propose a new directory scheme and an efficient broadcast capability for a direct interconnection network. The broadcast capability is based on the fact that all switching technologies allow each router to watch a message passed by, and thus a message can be used to perform a broadcast mission without extra traffic, which can be utilized for the cache coherence problem. Only a slight change on a typical router is needed to implement our scheme. The Splash parallel program suite is used in the simulation study where our scheme is compared with other directory based schemes. Our scheme is proved to generate much less traffic for cache coherence while the space complexity is slightly more scalable ($\Theta(N^{3/2}\log{N})$). This scheme is also applicable to any k-ary n-cube networks including a mesh. Based on the proposed broadcast mechanism, we present two enhanced schemes which aggressively exploit the nature of wide sharing pattern in conjunction with software. One scheme is motivated by the fact that most widely shared data have relatively short write runs, and thus updates to those data are more effective while the invalidate scheme is still effective for other data. For this scheme, it is more important to accurately identify the widely shared data. Simulation results show that...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectDirect interconnect-
dc.subjectMultiprocessor-
dc.subjectShared memory-
dc.subjectCache coherence-
dc.subjectDirectory-based protocol-
dc.subject디렉토리 기반 프로토콜-
dc.subject직접 상호 연결망-
dc.subject다중처리기-
dc.subject공유 메모리-
dc.subject캐쉬 일관성-
dc.titleScalable cache coherent schemes for direct-connected shared memory multiprocessors-
dc.title.alternative직접 상호연결 공유메모리 다중처리기를 위한 확장성있는 캐쉬 일관성 기법-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN151040/325007-
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid000959532-
dc.contributor.localauthorLee, Joon-Won-
dc.contributor.localauthor이준원-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0