dc.contributor.author | Lagraviere, Jeremie Alexandre Emilien | |
dc.contributor.author | Prugger, Martina | |
dc.contributor.author | Einkemmer, Lukas | |
dc.contributor.author | Langguth, Johannes | |
dc.contributor.author | Ha, Hoai Phuong | |
dc.contributor.author | Cai, Xing | |
dc.date.accessioned | 2017-01-24T10:27:33Z | |
dc.date.available | 2017-01-24T10:27:33Z | |
dc.date.issued | 2016 | |
dc.description.abstract | Programmability and performance-per-watt are the major challenges
of the race to Exascale. In this study we focus on Partitioned Global
Address Space (PGAS) languages, using UPC as a particular example. This
category of parallel languages provides ease of programming as a strong advantage
over the classic Message Passing Interface(MPI). PGAS has also
advantages compared to classic shared memory programming (OpenMP),
as by nature a PGAS program is meant to work on a single-node and multinode
machine without changing the code. Our goal in this technical report,
is to use UPC in order to implement a memory bound problem, which involves
irregular inter-thread communication. To represent this problem we
perform a SParse Matrix-Vector multiplication (SpMV) over unstructured
data. We implemented different versions of the UPC-SpMV for different
levels in the code complexity. In this technical report, we give a description
of this various versions of the UPC-SpMV and a set of results using
single-node and multi-node machine hardware scenarios. | en_US |
dc.identifier.cristinID | FRIDAID 1386752 | |
dc.identifier.uri | https://hdl.handle.net/10037/10205 | |
dc.language.iso | eng | en_US |
dc.publisher | UiT The Arctic University of Norway, Faculty of Science and Technology | en |
dc.relation.projectID | Notur/NorStore: NN2849K | en_US |
dc.relation.projectID | Norges forskningsråd: 231746 | en_US |
dc.rights.accessRights | openAccess | en_US |
dc.subject | VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550 | en_US |
dc.subject | VDP::Technology: 500::Information and communication technology: 550 | en_US |
dc.title | Implementing and optimizing a Sparse Matrix-Vector Multiplication with UPC | en_US |
dc.type | Research report | en_US |
dc.type | Forskningsrapport | en_US |