AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 240 | 2020 |
FPDeep: Acceleration and load balancing of CNN training on FPGA clusters T Geng, T Wang, A Sanaullah, C Yang, R Xu, R Patel, M Herbordt 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018 | 101 | 2018 |
A Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Work and Weight Load Balancing T Geng, T Wang, A Sanaullah, C Yang, R Patel, M Herbordt 2018 28th International Conference on Field Programmable Logic and …, 2018 | 74 | 2018 |
Fully integrated FPGA molecular dynamics simulations C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 53 | 2019 |
Fully integrated FPGA molecular dynamics simulations C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 53 | 2019 |
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters T Wang, T Geng, A Li, X Jin, M Herbordt IEEE Transactions on Computers 69 (8), 1143-1158, 2020 | 46 | 2020 |
LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism T Geng, T Wang, C Wu, C Yang, SL Song, A Li, M Herbordt 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 44 | 2019 |
BSTC: a novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets A Li, T Geng, T Wang, M Herbordt, SL Song, K Barker Proceedings of the International Conference for High Performance Computing …, 2019 | 38 | 2019 |
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference T Geng, A Li, T Wang, C Wu, Y Li, R Shi, W Wu, M Herbordt IEEE Transactions on Parallel and Distributed Systems 32 (1), 199-213, 2020 | 34 | 2020 |
High performance dynamic communication on reconfigurable clusters J Sheng, C Yang, T Wang, M Herbordt 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018 | 29 | 2018 |
O3BNN: an out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning T Geng, T Wang, C Wu, C Yang, W Wu, A Li, MC Herbordt Proceedings of the ACM International Conference on Supercomputing, 461-472, 2019 | 28 | 2019 |
Molecular Dynamics Range-Limited Force Evaluation Optimized for FPGAs C Yang, T Geng, T Wang, C Lin, J Sheng, V Sachdeva, W Sherman, ... 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 23 | 2019 |
FP-AMG: FPGA-Based Acceleration Framework for Algebraic Multigrid Solvers P Haghi, T Geng, A Guo, T Wang, M Herbordt 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 21 | 2020 |
A Scalable Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Weight and Workload Balancing T Geng, T Wang, A Li, X Jin, M Herbordt arXiv preprint arXiv:1901.01007, 2019 | 15 | 2019 |
An accelerating solution for-body mond simulation with fpga-soc B Peng, T Wang, X Jin, C Wang International Journal of Reconfigurable Computing 2016, 2016 | 15 | 2016 |
A 56-ps multi-phase clock time-to-digital convertor based on Artix-7 FPGA T Xiang, L Zhao, X Jin, T Wang, S Chu, C Ma, S Liu, Q An 2014 19th IEEE-NPSS Real Time Conference, 1-4, 2014 | 15 | 2014 |
Accelerating AP3M-Based Computational Astrophysics Simulations with Reconfigurable Clusters T Wang, T Geng, X Jin, M Herbordt 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 11 | 2019 |
Uwb-gcn: Hardware acceleration of graph-convolution-network through runtime workload rebalancing T Geng, A Li, T Wang, C Wu, Y Li, A Tumeo, M Herbordt arXiv preprint arXiv:1908.10834, 2019 | 11 | 2019 |
FP-AMR: A Reconfigurable Fabric Framework for Adaptive Mesh Refinement Applications T Wang, T Geng, X Jin, M Herbordt 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019 | 10 | 2019 |
BaPipe: Exploration of Balanced Pipeline Parallelism for DNN Training L Zhao, R Xu, T Wang, T Tian, X Wang, W Wu, C Ieong, X Jin arXiv preprint arXiv:2012.12544, 2020 | 7 | 2020 |