Taming irregular EDA applications on GPUs Y Deng, BD Wang, S Mu Proceedings of the 2009 International Conference on Computer-Aided Design …, 2009 | 120 | 2009 |
IP routing processing with graphic processors S Mu, X Zhang, N Zhang, J Lu, YS Deng, S Zhang 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010 …, 2010 | 92 | 2010 |
WADE: Writeback-aware dynamic cache management for NVM-based main memory system Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013 | 58 | 2013 |
Toward real-time ray tracing: A survey on hardware acceleration and microarchitecture techniques Y Deng, Y Ni, Z Li, S Mu, W Zhang ACM Computing Surveys (CSUR) 50 (4), 1-41, 2017 | 57 | 2017 |
Evaluating the potential of graphics processors for high performance embedded computing S Mu, C Wang, M Liu, D Li, M Zhu, X Chen, X Xie, Y Deng 2011 Design, Automation & Test in Europe, 1-6, 2011 | 38 | 2011 |
Orchestrating cache management and memory scheduling for GPGPU applications S Mu, Y Deng, Y Chen, H Li, J Pan, W Zhang, Z Wang IEEE Transactions on Very Large Scale Integration (VLSI) Systems 22 (8 …, 2013 | 29 | 2013 |
GPU accelerated sparse matrix‐vector multiplication and sparse matrix‐transpose vector multiplication Y Tao, Y Deng, S Mu, Z Zhang, M Zhu, L Xiao, L Ruan Concurrency and Computation: Practice and Experience 27 (14), 3771-3789, 2015 | 18 | 2015 |
Towards accelerating irregular EDA applications with GPUs H Qian, Y Deng, B Wang, S Mu Integration 45 (1), 46-60, 2012 | 10 | 2012 |
NPGPU: network processing on graphics processing units Y Deng, X Jiao, S Mu, K Kang, Y Zhu Theoretical and Mathematical Foundations of Computer Science: Second …, 2011 | 10 | 2011 |
The potential of GPUs for VLSI physical design automation Y Deng, S Mu 2008 9th International Conference on Solid-State and Integrated-Circuit …, 2008 | 10 | 2008 |
Electronic design automation with graphic processors: A survey Y Deng, S Mu Foundations and Trends® in Electronic Design Automation 7 (1–2), 1-176, 2013 | 7 | 2013 |
Atomic reduction based sparse matrix-transpose vector multiplication on GPUs Y Tao, Y Deng, S Mu, M Zhu, L Xiao, L Ruan, Z Huang 2014 20th IEEE International Conference on Parallel and Distributed Systems …, 2014 | 5 | 2014 |
FastLanes: An FPGA accelerated GPU microarchitecture simulator K Fang, Y Ni, J He, Z Li, S Mu, Y Deng 2013 IEEE 31st International Conference on Computer Design (ICCD), 241-248, 2013 | 4 | 2013 |
Toward concurrent lock-free queues on gpus X Zhang, Y Deng, S Mu IEICE TRANSACTIONS on Information and Systems 97 (7), 1901-1904, 2014 | 1 | 2014 |
Exploiting the Task-Pipelined Parallelism of Stream Programs on Many-Core GPUs S Mu, D Li, Y Chen, Y Deng, Z Wang IEICE TRANSACTIONS on Information and Systems 96 (10), 2194-2207, 2013 | 1 | 2013 |
Toward eda computing on gpus Y Deng, S Mu, Z Wang 2009 International Conference on Communications, Circuits and Systems, 1119-1123, 2009 | 1 | 2009 |
Performance Optimization for Sparse AtAx in Parallel on Multicore CPU Y Tao, Y Deng, S Mu, Z Zhang, M Zhu, L Xiao, L Ruan IEICE TRANSACTIONS on Information and Systems 97 (2), 315-318, 2014 | | 2014 |