Follow
Niansong Zhang
Title
Cited by
Cited by
Year
Heteroflow: An accelerator programming model with decoupled data placement for software-defined fpgas
S Xiang, YH Lai, Y Zhou, H Chen, N Zhang, D Pal, Z Zhang
Proceedings of the 2022 ACM/SIGDA International Symposium on Field …, 2022
222022
RapidLayout: Fast Hard Block Placement of FPGA-optimized Systolic Arrays Using Evolutionary Algorithm
N Zhang, X Chen, N Kapre
ACM Transactions on Reconfigurable Technology and Systems (TRETS) 15 (4), 1-23, 2022
122022
Codedvtr: Codebook-based sparse voxel transformer with geometric guidance
T Zhao, N Zhang, X Ning, H Wang, L Yi, Y Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
92022
Understanding the potential of fpga-based spatial acceleration for large language model inference
H Chen, J Zhang, Y Du, S Xiang, Z Yue, N Zhang, Y Cai, Z Zhang
arXiv preprint arXiv:2312.15159, 2023
42023
Serving Multi-DNN Workloads on FPGAs: a Coordinated Architecture, Scheduling, and Mapping Perspective
S Zeng, G Dai, N Zhang, X Yang, H Zhang, Z Zhu, H Yang, Y Wang
IEEE Transactions on Computers, 2022
42022
aw_nas: A modularized and extensible nas framework
X Ning, C Tang, W Li, S Yang, T Zhao, N Zhang, T Lu, S Liang, H Yang, ...
arXiv preprint arXiv:2012.10388, 2020
42020
Accelerator design with decoupled hardware customizations: benefits and challenges
D Pal, YH Lai, S Xiang, N Zhang, H Chen, J Casas, P Cocchini, Z Yang, ...
Proceedings of the 59th ACM/IEEE Design Automation Conference, 1351-1354, 2022
22022
Allo: A Programming Model for Composable Accelerator Design
H Chen, N Zhang, S Xiang, Z Zeng, M Dai, Z Zhang
arXiv preprint arXiv:2404.04815, 2024
12024
Formal Verification of Source-to-Source Transformations for HLS
LN Pouchet, E Tucker, N Zhang, H Chen, D Pal, G Rodríguez, Z Zhang
Proceedings of the 2024 ACM/SIGDA International Symposium on Field …, 2024
12024
A Comprehensive Evaluation of FPGA-Based Spatial Acceleration of LLMs
H Chen, J Zhang, Y Du, S Xiang, Z Yue, N Zhang, Y Cai, Z Zhang
Proceedings of the 2024 ACM/SIGDA International Symposium on Field …, 2024
2024
Supporting a Virtual Vector Instruction Set on a Commercial Compute-in-SRAM Accelerator
C Golden, D Ilan, C Huang, N Zhang, Z Zhang, C Batten
IEEE Computer Architecture Letters, 2023
2023
RapidLayout: Fast Hard Block Placement of FPGA-optimized Systolic Arrays Using Evolutionary Algorithm
N Zhang, X Chen, N Kapre
30th International Conference on Field Programmable Logic and Applications (FPL), 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–12