A Runtime Environment for Supporting Research in Resilient HPC System Software & Tools Conference Paper December, 2013
Detection and Correction of Silent Data Corruption for Large-Scale High-Performance Computing Conference Paper November, 2013
Toward a Performance/Resilience Tool for Hardware/Software Co-Design of High-Performance Computing Systems... Conference Paper October, 2013
Optimizing Blocking and Nonblocking Reduction Operations for Multicore Systems: Hierarchical Design and Implementation Conference Paper September, 2013
Design and Implementation of a Scalable Membership Service for Supercomputer Resiliency-Aware Runtime Book Chapter August, 2013
Design and Implementation of a Scalable Membership Service for Supercomputer Resiliency-Aware Runtime... Conference Paper August, 2013
SLOAVx: Scalable LOgarithmic AlltoallV Algorithm for Hierarchical Multicore Systems Conference Paper May, 2013