Follow
Zhaorui Zhang
Zhaorui Zhang
Department of Computing, The Hong Kong Polytechnic University
Verified email at connect.hku.hk - Homepage
Title
Cited by
Cited by
Year
MIPD: An adaptive gradient sparsification framework for distributed DNNs training
Z Zhang, C Wang
IEEE Transactions on Parallel and Distributed Systems 33 (11), 3053-3066, 2022
92022
FPGA-based High-Performance Collision Detection: An Enabling Technique for Image-Guided Robotic Surgery
Z Zhang, Xin Y, Liu B, Li WXY, Lee KH, Ng CF, Stoyanov D, Cheung RCC, Kwok KW
Frontiers in Robotics and AI, 2016
9*2016
C-coll: Introducing error-bounded lossy compression into mpi collectives
J Huang, S Di, X Yu, Y Zhai, J Liu, K Raffenetti, H Zhou, K Zhao, Z Chen, ...
arXiv preprint arXiv:2304.03890, 2023
72023
SaPus: Self-adaptive parameter update strategy for DNN training on Multi-GPU clusters
Z Zhang, C Wang
IEEE Transactions on Parallel and Distributed Systems 33 (7), 1569-1580, 2021
52021
国家高性能计算环境发展报告: 2002-2017 年
迟学斌
科学出版社, 2018
52018
An application specific instruction set processor (asip) for adaptive filters in neural prosthetics
Y Xin, WXY Li, Z Zhang, RCC Cheung, D Song, TW Berger
IEEE/ACM Transactions on Computational Biology and Bioinformatics 12 (5 …, 2015
52015
Momentum-driven adaptive synchronization model for distributed DNN training on HPC clusters
Z Zhang, Z Ji, C Wang
Journal of Parallel and Distributed Computing 159, 65-84, 2022
32022
Development Report on National High Performance Computing Environment (2002-2017)[M]
XB Chi
Science Press, 51-113, 2018
22018
FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
H Xu, Z Zhang, S Di, B Liu, A Khalid, J Cao
arXiv preprint arXiv:2404.11015, 2024
2024
A Survey on Error-Bounded Lossy Compression for Scientific Datasets
S Di, J Liu, K Zhao, X Liang, R Underwood, Z Zhang, M Shah, Y Huang, ...
arXiv preprint arXiv:2404.02840, 2024
2024
POSTER: Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs
Z Ji, Z Zhang, J Xu, L Ju
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024
2024
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression
Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu ...
IPDPS: 2024 38th IEEE International Parallel & Distributed Processing Symposium, 2023
2023
Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs
Zhuoran Ji, Zhaorui Zhang, Jiming Xu, Lei Ju
PPoPP: ACM SIGPLAN Symposium on Principles and Practice of Parallel …, 2023
2023
Efficient parameter update strategy for distributed deep learning system
Z Zhang
HKU Theses Online (HKUTO), 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–14