The MVAPICH project: Transforming research into high-performance MPI library for HPC community DK Panda, H Subramoni, CH Chu, M Bayatpour Journal of Computational Science 52, 101208, 2021 | 62 | 2021 |
Scalable reduction collectives with data partitioning-based multi-leader design M Bayatpour, S Chakraborty, H Subramoni, X Lu, DK Panda Proceedings of the International Conference for High Performance Computing …, 2017 | 45 | 2017 |
BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs M Bayatpour, N Sarkauskas, H Subramoni, JM Hashmi, DK Panda ISC High Performance 2021, 2021 | 41 | 2021 |
Adaptive and dynamic design for MPI tag matching M Bayatpour, H Subramoni, S Chakraborty, DK Panda 2016 IEEE International Conference on Cluster Computing (CLUSTER), 1-10, 2016 | 30 | 2016 |
Salar: Scalable and adaptive designs for large message reduction collectives M Bayatpour, JM Hashmi, S Chakraborty, H Subramoni, P Kousha, ... 2018 IEEE International Conference on Cluster Computing (CLUSTER), 12-23, 2018 | 29 | 2018 |
Designing efficient shared address space reduction collectives for multi-/many-cores JM Hashmi, S Chakraborty, M Bayatpour, H Subramoni, DK Panda 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018 | 29 | 2018 |
Efficient asynchronous communication progress for MPI without dedicated resources A Ruhela, H Subramoni, S Chakraborty, M Bayatpour, P Kousha, ... Proceedings of the 25th European MPI Users' Group Meeting, 1-11, 2018 | 24 | 2018 |
Falcon: Efficient designs for zero-copy mpi datatype processing on emerging architectures JM Hashmi, S Chakraborty, M Bayatpour, H Subramoni, DK Panda 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 15 | 2019 |
FALCON-X: Zero-copy MPI derived datatype processing on modern CPU and GPU architectures JM Hashmi, CH Chu, S Chakraborty, M Bayatpour, H Subramoni, ... Journal of Parallel and Distributed Computing 144, 1-13, 2020 | 12 | 2020 |
Efficient design for MPI asynchronous progress without dedicated resources A Ruhela, H Subramoni, S Chakraborty, M Bayatpour, P Kousha, ... Parallel Computing 85, 13-26, 2019 | 12 | 2019 |
Large-message nonblocking mpi_iallgather and mpi ibcast offload via bluefield-2 dpu N Sarkauskas, M Bayatpour, T Tran, B Ramesh, H Subramoni, DK Panda 2021 IEEE 28th International Conference on High Performance Computing, Data …, 2021 | 9 | 2021 |
Cooperative rendezvous protocols for improved performance and overlap S Chakraborty, M Bayatpour, J Hashmi, H Subramoni, DK Panda SC18: International Conference for High Performance Computing, Networking …, 2018 | 9 | 2018 |
Design and Characterization of Infiniband Hardware Tag Matching in MPI M Bayatpour, SM Ghazimirsaeed, S Xu, H Subramoni, DK Panda The 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020 | 8 | 2020 |
Scalable MPI collectives using SHARP: large scale performance evaluation on the TACC frontera system B Ramesh, KK Suresh, N Sarkauskas, M Bayatpour, JM Hashmi, ... 2020 Workshop on Exascale MPI (ExaMPI), 11-20, 2020 | 8 | 2020 |
Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures JM Hashmi, S Xu, B Ramesh, M Bayatpour, H Subramoni, DK Panda 34th IEEE International Parallel & Distributed Processing Symposium (IPDPS '20), 2020 | 7 | 2020 |
Design and characterization of shared address space mpi collectives on modern architectures JM Hashmi, S Chakraborty, M Bayatpour, H Subramoni, DK Panda 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019 | 7 | 2019 |
Communication-Aware Hardware-Assisted MPI Overlap Engine M Bayatpour, JM Hashmi, S Chakraborty, K Kandadi Suresh, ... ISC High Performance 2020, 2020 | 6 | 2020 |
A Hierarchical and Load-Aware Design for Large Message Neighborhood Collectives SM Ghazimirsaeed, Q Zhou, A Ruhela, M Bayatpour, H Subramoni, ... SC20: International Conference for High Performance Computing, Networking …, 2020 | 5 | 2020 |
Performance characterization of network mechanisms for non-contiguous data transfers in MPI KK Suresh, B Ramesh, SM Ghazimirsaeed, M Bayatpour, J Hashmi, ... 2020 IEEE international parallel and distributed processing symposium …, 2020 | 5 | 2020 |
Dhabaleswar K.(DK) Panda. 2017. Scalable Reduction Collectives with Data Partitioning-Based Multi-Leader Design M Bayatpour, S Chakraborty, H Subramoni, X Lu Proceedings of the International Conference for High Performance Computing …, 0 | 4 | |