Providing Runtime Clock Synchronization With Minimal Node-to-Node Time Deviation on XT4s and XT5s... Conference Paper May, 2011
ConnectX-2 CORE-Direct Enabled Asynchronous Broadcast Collective Communications... Conference Paper May, 2011
Collective Framework and Performance Optimizations to Open MPI for Cray XT Platforms Conference Paper May, 2011
Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures... Conference Paper November, 2010
Verification of Scientific Simulations via Hypothesis-Driven Comparative and Quantitative Visualization... Journal November, 2010
A Clock Synchronization Strategy for Minimizing Clock Variance at Runtime in High-end Computing Environments... Conference Paper October, 2010