Improved MPI collectives for MPI processes in shared address spaces.
Shigang Li, Torsten Hoefler, Chungjin Hu, Marc Snir: Improved MPI collectives for MPI processes in shared address spaces. Clust. Comput. 17(4): 1139-1155 (2014)
View ArticleTowards a more fault resilient multigrid solver.
Jon Calhoun, Luke N. Olson, Marc Snir, William D. Gropp: Towards a more fault resilient multigrid solver. SpringSim (HPS) 2015: 1-8
View ArticlePattern-driven parallel I/O tuning.
Babak Behzad, Surendra Byna, Prabhat, Marc Snir: Pattern-driven parallel I/O tuning. PDSW@SC 2015: 43-48
View ArticlePPL: an abstract runtime system for hybrid parallel programming.
Alex Brooks, Hoang-Vu Dang, Nikoli Dryden, Marc Snir: PPL: an abstract runtime system for hybrid parallel programming. ESPM@SC 2015: 2-9
View ArticleScheduling the I/O of HPC Applications Under Congestion.
Ana Gainaru, Guillaume Aupy, Anne Benoit, Franck Cappello, Yves Robert, Marc Snir: Scheduling the I/O of HPC Applications Under Congestion. IPDPS 2015: 1013-1022
View ArticleA General Space-filling Curve Algorithm for Partitioning 2D Meshes.
Aparna Sasidharan, John M. Dennis, Marc Snir: A General Space-filling Curve Algorithm for Partitioning 2D Meshes. HPCC/CSS/ICESS 2015: 875-879
View ArticleDistributed Monitoring and Management of Exascale Systems in the Argo Project.
Swann Perarnau, Rajeev Thakur, Kamil Iskra, Ken Raffenetti, Franck Cappello, Rinku Gupta, Peter H. Beckman, Marc Snir, Henry Hoffmann, Martin Schulz, Barry Rountree: Distributed Monitoring and...
View ArticleUnderstanding the Propagation of Error Due to a Silent Data Corruption in a...
Jon Calhoun, Marc Snir, Luke N. Olson, María Jesús Garzarán: Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply. CLUSTER 2015: 541-542
View ArticleDynamic Model-Driven Parallel I/O Performance Tuning.
Babak Behzad, Surendra Byna, Stefan M. Wild, Prabhat, Marc Snir: Dynamic Model-Driven Parallel I/O Performance Tuning. CLUSTER 2015: 184-193
View ArticleDesign of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters.
Junchao Zhang, Babak Behzad, Marc Snir: Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters. IEEE Trans. Parallel Distributed Syst. 26(7): 1861-1873 (2015)
View ArticleDoing Moore with Less - Leapfrogging Moore's Law with Inexactness for...
Sven Leyffer, Stefan M. Wild, Mike Fagan, Marc Snir, Krishna V. Palem, Kazutomo Yoshii, Hal Finkel: Doing Moore with Less - Leapfrogging Moore's Law with Inexactness for Supercomputing. CoRR...
View ArticleOvercoming the power wall by exploiting inexactness and emerging COTS...
Mike Fagan, Jeremy Schlachter, Kazutomo Yoshii, Sven Leyffer, Krishna V. Palem, Marc Snir, Stefan M. Wild, Christian C. Enz: Overcoming the power wall by exploiting inexactness and emerging COTS...
View ArticleTowards millions of communicating threads.
Hoang-Vu Dang, Marc Snir, William Gropp: Towards millions of communicating threads. EuroMPI 2016: 1-14
View ArticleReducing Waste in Extreme Scale Systems through Introspective Analysis.
Leonardo Arturo Bautista-Gomez, Ana Gainaru, Swann Perarnau, Devesh Tiwari, Saurabh Gupta, Christian Engelmann, Franck Cappello, Marc Snir: Reducing Waste in Extreme Scale Systems through Introspective...
View ArticleDamaris: Addressing Performance Variability in Data Management for...
Matthieu Dorier, Gabriel Antoniu, Franck Cappello, Marc Snir, Robert Sisneros, Orcun Yildiz, Shadi Ibrahim, Tom Peterka, Leigh Orf: Damaris: Addressing Performance Variability in Data Management for...
View ArticleTowards a More Complete Understanding of SDC Propagation.
Jon Calhoun, Marc Snir, Luke N. Olson, William D. Gropp: Towards a More Complete Understanding of SDC Propagation. HPDC 2017: 131-142
View ArticleLogAider: A tool for mining potential correlations of HPC log events.
Sheng Di, Rinku Gupta, Marc Snir, Eric Pershey, Franck Cappello: LogAider: A tool for mining potential correlations of HPC log events. CCGrid 2017: 442-451
View ArticleEliminating contention bottlenecks in multithreaded MPI.
Hoang-Vu Dang, Marc Snir, William Gropp: Eliminating contention bottlenecks in multithreaded MPI. Parallel Comput. 69: 1-23 (2017)
View ArticlePredicting HPC parallel program performance based on LLVM compiler.
Weizhe Zhang, Meng Hao, Marc Snir: Predicting HPC parallel program performance based on LLVM compiler. Clust. Comput. 20(2): 1179-1192 (2017)
View ArticleThe informal guide to ACM fellow nominations.
Marc Snir: The informal guide to ACM fellow nominations. Commun. ACM 60(7): 32-34 (2017)
View ArticleNetwork and Parallel Computing - 15th IFIP WG 10.3 International Conference,...
Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara, Mateo Valero: Network and Parallel Computing - 15th IFIP WG 10.3 International Conference, NPC 2018, Muroran, Japan, November 29 -...
View ArticleGluon: a communication-optimizing substrate for distributed heterogeneous...
Roshan Dathathri, Gurbinder Gill, Loc Hoang, Hoang-Vu Dang, Alex Brooks, Nikoli Dryden, Marc Snir, Keshav Pingali: Gluon: a communication-optimizing substrate for distributed heterogeneous graph...
View ArticleA Lightweight Communication Runtime for Distributed Graph Analytics.
Hoang-Vu Dang, Roshan Dathathri, Gurbinder Gill, Alex Brooks, Nikoli Dryden, Andrew Lenharth, Loc Hoang, Keshav Pingali, Marc Snir: A Lightweight Communication Runtime for Distributed Graph Analytics....
View ArticleFULT: Fast User-Level Thread Scheduling Using Bit-Vectors.
Hoang-Vu Dang, Marc Snir: FULT: Fast User-Level Thread Scheduling Using Bit-Vectors. ICPP 2018: 71:1-71:10
View ArticleNeural Network Based Silent Error Detector.
Chen Wang, Nikoli Dryden, Franck Cappello, Marc Snir: Neural Network Based Silent Error Detector. CLUSTER 2018: 168-178
View ArticleArgobots: A Lightweight Low-Level Threading and Tasking Framework.
Sangmin Seo, Abdelhalim Amer, Pavan Balaji, Cyril Bordage, George Bosilca, Alex Brooks, Philip H. Carns, Adrián Castelló, Damien Genet, Thomas Hérault, Shintaro Iwasaki, Prateek Jindal, Laxmikant V....
View ArticleTechnical perspective: The future of MPI.
Marc Snir: Technical perspective: The future of MPI. Commun. ACM 61(10): 105 (2018)
View ArticleImproving Strong-Scaling of CNN Training by Exploiting Finer-Grained...
Nikoli Dryden, Naoya Maruyama, Tom Benson, Tim Moon, Marc Snir, Brian Van Essen: Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism. CoRR abs/1903.06681 (2019)
View ArticleChannel and filter parallelism for large-scale CNN training.
Nikoli Dryden, Naoya Maruyama, Tim Moon, Tom Benson, Marc Snir, Brian Van Essen: Channel and filter parallelism for large-scale CNN training. SC 2019: 10:1-10:20
View ArticleImproving Strong-Scaling of CNN Training by Exploiting Finer-Grained...
Nikoli Dryden, Naoya Maruyama, Tom Benson, Tim Moon, Marc Snir, Brian Van Essen: Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism. IPDPS 2019: 210-220
View ArticleCharacterizing and Understanding HPC Job Failures Over The 2K-Day Life of IBM...
Sheng Di, Hanqi Guo, Eric Pershey, Marc Snir, Franck Cappello: Characterizing and Understanding HPC Job Failures Over The 2K-Day Life of IBM BlueGene/Q System. DSN 2019: 473-484
View ArticleGluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous...
Roshan Dathathri, Gurbinder Gill, Loc Hoang, Vishwesh Jatala, Keshav Pingali, V. Krishna Nandivada, Hoang-Vu Dang, Marc Snir: Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous...
View ArticleExploring Properties and Correlations of Fatal Events in a Large-Scale HPC...
Sheng Di, Hanqi Guo, Rinku Gupta, Eric Pershey, Marc Snir, Franck Cappello: Exploring Properties and Correlations of Fatal Events in a Large-Scale HPC System. IEEE Trans. Parallel Distributed Syst....
View ArticleOptimizing I/O Performance of HPC Applications with Autotuning.
Babak Behzad, Surendra Byna, Prabhat, Marc Snir: Optimizing I/O Performance of HPC Applications with Autotuning. ACM Trans. Parallel Comput. 5(4): 15:1-15:27 (2019)
View ArticleAutomatic generation of benchmarks for I/O-intensive parallel applications.
Meng Hao, Weizhe Zhang, You Zhang, Marc Snir, Laurence T. Yang: Automatic generation of benchmarks for I/O-intensive parallel applications. J. Parallel Distributed Comput. 124: 1-13 (2019)
View ArticleGuest Editorial: Special Issue on Network and Parallel Computing for Emerging...
Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara, Mateo Valero: Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications. Int. J....
View ArticleExploring the feasibility of lossy compression for PDE simulations.
Jon C. Calhoun, Franck Cappello, Luke N. Olson, Marc Snir, William D. Gropp: Exploring the feasibility of lossy compression for PDE simulations. Int. J. High Perform. Comput. Appl. 33(2) (2019)
View ArticleRecorder 2.0: Efficient Parallel I/O Tracing and Analysis.
Chen Wang, Jinghan Sun, Marc Snir, Kathryn M. Mohror, Elsa Gonsiorowski: Recorder 2.0: Efficient Parallel I/O Tracing and Analysis. IPDPS Workshops 2020: 1052-1059
View ArticleFirst IEEE International Workshop on High-Performance Storage (HPS).
Kathryn M. Mohror, Marc Snir: First IEEE International Workshop on High-Performance Storage (HPS). IPDPS Workshops 2020: 1024-1026
View ArticleUnderstanding and Finding Crash-Consistency Bugs in Parallel File Systems.
Jinghan Sun, Chen Wang, Jian Huang, Marc Snir: Understanding and Finding Crash-Consistency Bugs in Parallel File Systems. HotStorage 2020
View ArticlePinpointing crash-consistency bugs in the HPC I/O stack: a cross-layer approach.
Jinghan Sun, Jian Huang, Marc Snir: Pinpointing crash-consistency bugs in the HPC I/O stack: a cross-layer approach. SC 2021: 103
View ArticlePilgrim: scalable and (near) lossless MPI tracing.
Chen Wang, Pavan Balaji, Marc Snir: Pilgrim: scalable and (near) lossless MPI tracing. SC 2021: 52
View ArticleVerifying IO Synchronization from MPI Traces.
Sushma Yellapragada, Chen Wang, Marc Snir: Verifying IO Synchronization from MPI Traces. PDSW@SC 2021: 41-46
View ArticleFile System Semantics Requirements of HPC Applications.
Chen Wang, Kathryn M. Mohror, Marc Snir: File System Semantics Requirements of HPC Applications. HPDC 2021: 19-30
View ArticleDesign and Analysis of the Network Software Stack of an Asynchronous...
Jiakun Yan, Hartmut Kaiser, Marc Snir: Design and Analysis of the Network Software Stack of an Asynchronous Many-task System - The LCI parcelport of HPX. SC Workshops 2023: 1151-1161
View ArticleImproving the Scaling of an Asynchronous Many-Task Runtime with a Lightweight...
Omri Mor, George Bosilca, Marc Snir: Improving the Scaling of an Asynchronous Many-Task Runtime with a Lightweight Communication Engine. ICPP 2023: 153-162
View ArticleNear-Lossless MPI Tracing and Proxy Application Autogeneration.
Chen Wang, Yanfei Guo, Pavan Balaji, Marc Snir: Near-Lossless MPI Tracing and Proxy Application Autogeneration. IEEE Trans. Parallel Distributed Syst. 34(1): 123-140 (2023)
View ArticleFormal Definitions and Performance Comparison of Consistency Models for...
Chen Wang, Kathryn M. Mohror, Marc Snir: Formal Definitions and Performance Comparison of Consistency Models for Parallel File Systems. CoRR abs/2402.14105 (2024)
View Article
More Pages to Explore .....