Follow
Mohammadreza Bayatpour (Mamzi)
Mohammadreza Bayatpour (Mamzi)
NVIDIA, The Ohio State University
Verified email at nvidia.com
Title
Cited by
Cited by
Year
The MVAPICH project: Transforming research into high-performance MPI library for HPC community
DK Panda, H Subramoni, CH Chu, M Bayatpour
Journal of Computational Science 52, 101208, 2021
602021
Scalable reduction collectives with data partitioning-based multi-leader design
M Bayatpour, S Chakraborty, H Subramoni, X Lu, DK Panda
Proceedings of the International Conference for High Performance Computing …, 2017
452017
BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs
M Bayatpour, N Sarkauskas, H Subramoni, JM Hashmi, DK Panda
ISC High Performance 2021, 2021
402021
Adaptive and dynamic design for MPI tag matching
M Bayatpour, H Subramoni, S Chakraborty, DK Panda
2016 IEEE International Conference on Cluster Computing (CLUSTER), 1-10, 2016
302016
Salar: Scalable and adaptive designs for large message reduction collectives
M Bayatpour, JM Hashmi, S Chakraborty, H Subramoni, P Kousha, ...
2018 IEEE International Conference on Cluster Computing (CLUSTER), 12-23, 2018
292018
Designing efficient shared address space reduction collectives for multi-/many-cores
JM Hashmi, S Chakraborty, M Bayatpour, H Subramoni, DK Panda
2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018
292018
Efficient asynchronous communication progress for MPI without dedicated resources
A Ruhela, H Subramoni, S Chakraborty, M Bayatpour, P Kousha, ...
Proceedings of the 25th European MPI Users' Group Meeting, 1-11, 2018
242018
Falcon: Efficient designs for zero-copy mpi datatype processing on emerging architectures
JM Hashmi, S Chakraborty, M Bayatpour, H Subramoni, DK Panda
2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019
152019
FALCON-X: Zero-copy MPI derived datatype processing on modern CPU and GPU architectures
JM Hashmi, CH Chu, S Chakraborty, M Bayatpour, H Subramoni, ...
Journal of Parallel and Distributed Computing 144, 1-13, 2020
122020
Efficient design for MPI asynchronous progress without dedicated resources
A Ruhela, H Subramoni, S Chakraborty, M Bayatpour, P Kousha, ...
Parallel Computing 85, 13-26, 2019
122019
Cooperative rendezvous protocols for improved performance and overlap
S Chakraborty, M Bayatpour, J Hashmi, H Subramoni, DK Panda
SC18: International Conference for High Performance Computing, Networking …, 2018
92018
Large-message nonblocking mpi_iallgather and mpi ibcast offload via bluefield-2 dpu
N Sarkauskas, M Bayatpour, T Tran, B Ramesh, H Subramoni, DK Panda
2021 IEEE 28th International Conference on High Performance Computing, Data …, 2021
82021
Design and Characterization of Infiniband Hardware Tag Matching in MPI
M Bayatpour, SM Ghazimirsaeed, S Xu, H Subramoni, DK Panda
The 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020
82020
Scalable MPI collectives using SHARP: large scale performance evaluation on the TACC frontera system
B Ramesh, KK Suresh, N Sarkauskas, M Bayatpour, JM Hashmi, ...
2020 Workshop on Exascale MPI (ExaMPI), 11-20, 2020
82020
Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures
JM Hashmi, S Xu, B Ramesh, M Bayatpour, H Subramoni, DK Panda
34th IEEE International Parallel & Distributed Processing Symposium (IPDPS '20), 2020
72020
Design and characterization of shared address space mpi collectives on modern architectures
JM Hashmi, S Chakraborty, M Bayatpour, H Subramoni, DK Panda
2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019
72019
Communication-Aware Hardware-Assisted MPI Overlap Engine
M Bayatpour, JM Hashmi, S Chakraborty, K Kandadi Suresh, ...
ISC High Performance 2020, 2020
62020
A Hierarchical and Load-Aware Design for Large Message Neighborhood Collectives
SM Ghazimirsaeed, Q Zhou, A Ruhela, M Bayatpour, H Subramoni, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
52020
Performance characterization of network mechanisms for non-contiguous data transfers in MPI
KK Suresh, B Ramesh, SM Ghazimirsaeed, M Bayatpour, J Hashmi, ...
2020 IEEE international parallel and distributed processing symposium …, 2020
52020
Dhabaleswar K.(DK) Panda. 2017. Scalable Reduction Collectives with Data Partitioning-Based Multi-Leader Design
M Bayatpour, S Chakraborty, H Subramoni, X Lu
Proceedings of the International Conference for High Performance Computing …, 0
4
The system can't perform the operation now. Try again later.
Articles 1–20