Vis enkel innførsel

dc.contributor.authorLagraviere, Jeremie Alexandre Emilien
dc.contributor.authorLangguth, Johannes
dc.contributor.authorPrugger, Martina
dc.contributor.authorEinkemmer, Lukas
dc.contributor.authorHa, Hoai Phuong
dc.contributor.authorCai, Xing
dc.date.accessioned2019-08-27T12:54:13Z
dc.date.available2019-08-27T12:54:13Z
dc.date.issued2019-03-03
dc.description.abstractThe Unified Parallel C (UPC) programming language offers parallelism via logically partitioned shared memory, which typically spans physically disjoint memory subsystems. One convenient feature of UPC is its ability to automatically execute between-thread data movement, such that the entire content of a shared data array appears to be freely accessible by all the threads. The programmer friendliness, however, can come at the cost of substantial performance penalties. This is especially true when indirectly indexing the elements of a shared array, for which the induced between-thread data communication can be irregular and have a fine-grained pattern. In this paper, we study performance enhancement strategies specifically targeting such fine-grained irregular communication in UPC. Starting from explicit thread privatization, continuing with block-wise communication, and arriving at message condensing and consolidation, we obtained considerable performance improvement of UPC programs that originally require fine-grained irregular communication. Besides the performance enhancement strategies, the main contribution of the present paper is to propose performance models for the different scenarios, in the form of quantifiable formulas that hinge on the actual volumes of various data movements plus a small number of easily obtainable hardware characteristic parameters. These performance models help to verify the enhancements obtained, while also providing insightful predictions of similar parallel implementations, not limited to UPC, that also involve between-thread or between-process irregular communication. As a further validation, we also apply our performance modeling methodology and hardware characteristic parameters to an existing UPC code for solving a 2D heat equation on a uniform mesh.en_US
dc.descriptionSource at <a href=https://doi.org/10.1155/2019/6825728>https://doi.org/10.1155/2019/6825728. </a>en_US
dc.identifier.citationLagraviere, J., Langguth, J., Prugger, M., Einkemmer, L., Ha, P.H. & Cai, X. (2019). Performance optimization and modeling of fine-grained irregular communication in UPC. <i>Scientific Programming</i>, 6825728. https://doi.org/10.1155/2019/6825728en_US
dc.identifier.cristinIDFRIDAID 1710351
dc.identifier.doi10.1155/2019/6825728
dc.identifier.issn1058-9244
dc.identifier.issn1875-919X
dc.identifier.urihttps://hdl.handle.net/10037/16014
dc.language.isoengen_US
dc.publisherHindawien_US
dc.relation.journalScientific Programming
dc.relation.projectIDinfo:eu-repo/grantAgreement/RCN/FRINATEK/231746/Norway/PRoductivity and Energy-efficiency through Abstraction-based Parallel Programming/PREAPP/en_US
dc.relation.projectIDinfo:eu-repo/grantAgreement/RCN/FRINATEK/214113/Norway/User-friendly programming of GPU-enhanced clusters via automated code translation and optimization//en_US
dc.relation.projectIDinfo:eu-repo/grantAgreement/RCN/IKTPLUSS/251186/Norway/Meeting Exascale Computing with Source-to-Source Compilers//en_US
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/671647/Norway/Runtime Exploitation of Application Dynamism for Energy-efficient eXascale computing/READEX/en_US
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/FP7-ICT/611183/Norway/Execution Models for Energy-Efficient Computing Systems/EXCESS/en_US
dc.rights.accessRightsopenAccessen_US
dc.subjectVDP::Mathematics and natural science: 400::Information and communication science: 420en_US
dc.subjectVDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420en_US
dc.titlePerformance optimization and modeling of fine-grained irregular communication in UPCen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel