Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorSubasi, Omer
dc.contributor.authorUnsal, Osman
dc.contributor.authorLabarta, Jesus
dc.contributor.authorYalcin, Gulay
dc.contributor.authorCristal, Adrian
dc.date.accessioned2021-11-10T07:00:44Z
dc.date.available2021-11-10T07:00:44Z
dc.date.issued2016en_US
dc.identifier.isbn978-1-5090-2140-6
dc.identifier.issn1530-2075
dc.identifier.urihttps //doi.org/10.1109/IPDPS.2016.70
dc.identifier.urihttps://hdl.handle.net/20.500.12573/1020
dc.description.abstractMemory reliability will be one of the major concerns for future HPC and Exascale systems. This concern is mostly attributed to the expected massive increase in memory capacity and the number of memory devices in Exascale systems. For memory systems Error Correcting Codes (ECC) are the most commonly used mechanism. However state-of-the art hardware ECCs will not be sufficient in terms of error coverage for future computing systems and stronger hardware ECCs providing more coverage have prohibitive costs in terms of area, power and latency. Software-based solutions are needed to cooperate with hardware. In this work, we propose a Cyclic Redundancy Checks (CRCs) based software mechanism for task-parallel HPC applications. Our mechanism incurs only 1.7% performance overhead with hardware acceleration while being highly scalable at large scale. Our mathematical analysis demonstrates the effectiveness of our scheme and its error coverage. Results show that our CRCbased mechanism reduces the memory vulnerability by 87% on average with up to 32-bit burst (consecutive) and 5-bit arbitrary error correction capability.en_US
dc.description.sponsorshipIEEE; IEEE Comp Soc, Tech Comm Parallel Proc; ACM SIGARCH; IEEE Comp Soc Tech Comm Comp Architecture; IEEE Comp Soc Tech Comm Distributed Procen_US
dc.language.isoengen_US
dc.publisherIEEE345 E 47TH ST, NEW YORK, NY 10017 USAen_US
dc.relation.isversionof10.1109/IPDPS.2016.70en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.titleCRC-based Memory Reliability for Task-parallel HPC Applicationsen_US
dc.title.alternativeBook SeriesInternational Parallel and Distributed Processing Symposium IPDPSen_US
dc.typebookParten_US
dc.contributor.departmentAGÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.contributor.institutionauthorYalcin, Gulay
dc.identifier.startpage1101en_US
dc.identifier.endpage1112en_US
dc.relation.journal2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016)en_US
dc.relation.publicationcategoryMakale - Uluslararası - Editör Denetimli Dergien_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster