Optimizing sparse tensor times matrix on multi-core and many-core architectures J Li, Y Ma, C Yan, R Vuduc 2016 6th Workshop on Irregular Applications: Architecture and Algorithms …, 2016 | 45 | 2016 |
Optimizing sparse tensor times matrix on GPUs Y Ma, J Li, X Wu, C Yan, J Sun, R Vuduc Journal of Parallel and Distributed Computing 129, 99-109, 2019 | 37 | 2019 |
ParTI!: A parallel tensor infrastructure for multicore CPU and GPUs J Li, Y Ma, R Vuduc | 37* | 2016 |
PASTA: A parallel sparse tensor algorithm benchmark suite J Li, Y Ma, X Wu, A Li, K Barker CCF Transactions on High Performance Computing 1 (2), 111-130, 2019 | 25 | 2019 |
ParTI!: A Parallel Tensor Infrastructure for multicore CPUs and GPUs.(Oct 2018) J Li, Y Ma, R Vuduc URL: https://github. com/hpcgarage/ParTI, 2020 | 5 | 2020 |
ParTI!: A parallel tensor infrastructure for data analysis J Li, Y Ma, C Yan, J Sun, R Vuduc Tensor-Learn, NIPS workshop on Learning with Tensors, co-located with NIPS …, 2016 | 4 | 2016 |
LB-HM: load balance-aware data placement on heterogeneous memory for task-parallel HPC applications Z Xie, J Liu, S Ma, J Li, D Li Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022 | 1 | 2022 |