GARLSched: Generative adversarial deep reinforcement learning task scheduling optimization for large-scale high performance computing systems J Li, X Zhang, J Wei, Z Ji, Z Wei Future Generation Computer Systems 135, 259-269, 2022 | 16 | 2022 |
Energy-aware task scheduling optimization with deep reinforcement learning for large-scale heterogeneous systems J Li, X Zhang, Z Wei, J Wei, Z Ji CCF transactions on high performance computing 3, 383-392, 2021 | 12 | 2021 |
Leader population learning rate schedule J Wei, X Zhang, Z Zhuo, Z Ji, Z Wei, J Li, Q Li Information Sciences 623, 455-468, 2023 | 8 | 2023 |
Status, challenges and trends of data-intensive supercomputing J Wei, M Chen, L Wang, P Ren, Y Lei, Y Qu, Q Jiang, X Dong, W Wu, ... CCF Transactions on High Performance Computing 4 (2), 211-230, 2022 | 7 | 2022 |
Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system J Wei, X Zhang, Z Ji, J Li, Z Wei Scientific Reports 11 (1), 20244, 2021 | 5 | 2021 |
A tile-fusion method for accelerating Winograd convolutions Z Ji, X Zhang, Z Wei, J Li, J Wei Neurocomputing 460, 9-19, 2021 | 4 | 2021 |
EP4DDL: addressing straggler problem in heterogeneous distributed deep learning Z Ji, X Zhang, J Li, J Wei, Z Wei The Journal of Supercomputing 78 (13), 15663-15680, 2022 | 3 | 2022 |
Fastensor: Optimise the Tensor I/O Path from SSD to GPU for Deep Learning Training J Wei, X Zhang, L Wang, Z Wei ACM Transactions on Architecture and Code Optimization 20 (4), 1-25, 2023 | 2 | 2023 |
数据密集型超算现状, 挑战以及未来发展趋势 魏嘉, 陈默, 王龙翔, 任沛, 雷雨佳, 屈俞岐, 蒋骐羽, 董小社, 伍卫国, ... 数据与计算发展前沿 5 (3), 66-91, 2023 | 1 | 2023 |
How much storage do we need for high performance server J Wei, X Zhang 2022 IEEE 38th International Conference on Data Engineering (ICDE), 3221-3225, 2022 | 1 | 2022 |
BenQ: Benchmarking automated quantization on deep neural network accelerators Z Wei, X Zhang, J Li, Z Ji, J Wei 2022 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2022 | 1 | 2022 |
天河三号原型机分布式并行深度神经网络性能评测及调优 魏嘉, 张兴军, 纪泽宇, 李靖波, 岳莹莹 计算机工程与科学 43 (05), 782, 2021 | 1 | 2021 |
Dual-pronged deep learning preprocessing on heterogeneous platforms with CPU, GPU and CSD J Wei, X Zhang, W Pedrycz, L Wang, J Zhao arXiv preprint arXiv:2407.00005, 2024 | | 2024 |
BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion J Wei, X Zhang, W Pedrycz arXiv preprint arXiv:2403.15766, 2024 | | 2024 |
Revisit and Benchmarking of Automated Quantization Towards Fair Comparison Z Wei, X Zhang, Z Ji, J Li, J Wei IEEE Transactions on Computers, 2023 | | 2023 |