A Survey Of Techniques for Architecting and Managing Asymmetric Multicore Processors... Journal December, 2016
Towards Achieving Performance Portability Using Directives for Accelerators... Conference Paper November, 2016
Performance analysis and acceleration of explicit integration for large kinetic networks using batched GPU computations Conference Paper September, 2016
Parallel-DFTL: A Flash Translation Layer that Exploits Internal Parallelism in Solid State Drives Conference Paper August, 2016
A Distributed OpenCL Framework using Redundant Computation and Data Replication Conference Paper June, 2016
NVL-C: Static Analysis Techniques for Efficient, Correct Programming of Non-Volatile Main Memory Systems Conference Paper June, 2016
IMPACC: A Tightly Integrated MPI+OpenACC Framework Exploiting Shared Memory Parallelism Conference Paper May, 2016