Søk
Viser treff 1-9 av 9
Implementing and optimizing a Sparse Matrix-Vector Multiplication with UPC
(Research report; Forskningsrapport, 2016)
Programmability and performance-per-watt are the major challenges
of the race to Exascale. In this study we focus on Partitioned Global
Address Space (PGAS) languages, using UPC as a particular example. This
category of parallel languages provides ease of programming as a strong advantage
over the classic Message Passing Interface(MPI). PGAS has also
advantages compared to classic shared memory ...
Experimental Fault-Tolerant Synchronization for Reliable Computation on Graphics Processors
(Research report; Forskningsrapport, 2012)
Graphics processors (GPUs) are emerging as a promising platform for highly parallel, compute-intensive, general-purpose computations, which usually need support for inter-process synchronization. Using the traditional lock-based
synchronization (e.g. mutual exclusion) makes the computation vulnerable to faults caused by both scientists’ inexperience and hardware transient errors. It is notoriously ...
NB-FEB : an easy-to-use and scalable universal synchronization primitive for parallel programming
(Research report; Forskningsrapport, 2008-10)
This paper addresses the problem of universal synchronization
primitives that can support scalable thread synchronization for large-scale many-core architectures. The universal synchronization primitives that have been deployed widely
in conventional architectures, are the compare-and-swap (CAS) and load-linked/store-conditional (LL/SC) primitives. However,
such synchronization primitives are ...
Evaluation of the power efficiency of UPC, OpenMP and MPI
(Research report; Forskningsrapport, 2015)
In this study we compare the performance
and power efficiency of Unified Parallel C (UPC), MPI
and OpenMP by running a set of kernels from the NAS
Benchmark. One of the goals of this study is to focus
on the Partitioned Global Address Space (PGAS)
model, in order to describe it and compare it to MPI
and OpenMP. In particular we consider the power effi-
ciency expressed in millions operations ...
DeltaTree: A Practical Locality-aware Concurrent Search Tree
(Research report; Forskningsrapport, 2013)
As other fundamental programming abstractions in energy-e cient computing, search trees are expected to support both high parallelism and data locality. However, existing highly-concurrent search trees such as red-black trees and AVL trees do not consider data locality while existing locality-aware search trees such as those
based on the van Emde Boas layout (vEB-based trees), poorly support ...
Power models, energy models and libraries for energy-efficient concurrent data structures and algorithms
(Research report; Forskningsrapport, 2016)
This deliverable reports the results of the power models, energy models and librariesfor energy-efficient concurrent data structures and algorithms as available by projectmonth 30 of Work Package 2 (WP2). It reports i) the latest results of Task 2.2-2.4 onproviding programming abstractions and libraries for developing energy-efficient datastructures and algorithms and ii) the improved results of ...
Models for energy consumption of data structures and algorithms
(Research report; Forskningsrapport, 2014)
This deliverable reports our early energy models for data structures and algorithms based on both micro-benchmarks and concurrent algorithms. It reports the early results of Task 2.1 on investigating and modeling the trade-off between energy and performance in concurrent data structures and algorithms, which forms the basis for the whole work package 2 (WP2). The work has been ...
Report on the final prototype of programming abstractions for energy-efficient inter-process communication
(Research report; Forskningsrapport, 2016)
Work package 2 (WP2) aims to develop libraries for energy-efficient inter-processcommunication and data sharing on the EXCESS platforms. The Deliverable D2.4reports on the final prototype of programming abstractions for energy-efficient inter-process communication. Section 1 is the updated overview of the prototype of pro-gramming abstraction and devised power/energy models. The Section 2-6 contain ...
White-box methodologies, programming abstractions and libraries
(Research report; Forskningsrapport, 2015)
This deliverable reports the results of white-box methodologies and early results ofthe first prototype of libraries and programming abstractions as available by projectmonth 18 by Work Package 2 (WP2). It reports i) the latest results of Task 2.2on white-box methodologies, programming abstractions and libraries for developingenergy-efficient data structures and algorithms ...