Autotuning stencil-based computations on GPUs A Mametjanov, D Lowell, CC Ma, B Norris 2012 IEEE international conference on cluster computing, 266-274, 2012 | 56 | 2012 |
MIOpen: An open source library for deep learning primitives J Khan, P Fultz, A Tamazov, D Lowell, C Liu, M Melesse, ... arXiv preprint arXiv:1910.00078, 2019 | 39 | 2019 |
Compiler techniques to reduce the synchronization overhead of gpu redundant multithreading M Gupta, D Lowell, J Kalamatianos, S Raasch, V Sridharan, D Tullsen, ... Proceedings of the 54th Annual Design Automation Conference 2017, 1-6, 2017 | 35 | 2017 |
Stencil-aware GPU optimization of iterative solvers D Lowell, J Godwin, J Holewinski, D Karthik, C Choudary, A Mametjanov, ... SIAM Journal on Scientific Computing 35 (5), S209-S228, 2013 | 27 | 2013 |
Bufferless communication for redundant multithreading using register permutation DI Lowell, M Gupta US Patent 10,303,472, 2019 | 18* | 2019 |
Fingerprinting of redundant threads using compiler-inserted transformation code DI Lowell US Patent 10,013,240, 2018 | 12 | 2018 |
Performance evaluation of compiler-based software rmt in an hsa environment C Kalra, D Lowell, J Kalamatianos, V Sridharan, D Kaeli The 12th Workshop on Silicon Errors in Logic-System Effects, SELSE, 1-7, 2016 | 8 | 2016 |
Spartan: A Sparsity-Adaptive Framework to Accelerate Deep Neural Network Training on GPUs S Dong, Y Sun, NB Agostini, E Karimi, D Lowell, J Zhou, J Cano, ... IEEE Transactions on Parallel and Distributed Systems 32 (10), 2448-2463, 2021 | 7 | 2021 |
Performance-aware and reliability-aware data placement for n-level heterogeneous memory systems M Gupta, DA Roberts, MR Meswani, V Sridharan, S Raasch, DI Lowell US Patent 10,365,996, 2019 | 7 | 2019 |
Paired value comparison for redundant multi-threading operations DI Lowell, M Gupta US Patent 10,042,687, 2018 | 4* | 2018 |
GPGPU support in Chapel with the Radeon Open Compute platform ML Chu, AM Aji, D Lowell, K Hamidouche CHIUW, 2017 | 4 | 2017 |
Asar: Application-specific approximate recovery to mitigate hardware variability M Gupta, A Rahimi, D Lowell, J Kalamatianos, D Tullsen, R Gupta SELSE (Silicon Errors in Logic, System Effects), 2017 | 3 | 2017 |
Data sparsity monitoring during neural network training S Dong, DI Lowell US Patent 11,562,248, 2023 | 2 | 2023 |
Time space tradeoffs in GA based feature selection for workload characterization DE Tamir, C Novoa, D Lowell Trends in Applied Intelligent Systems: 23rd International Conference on …, 2010 | 2 | 2010 |
Method and apparatus for predicting kernel tuning parameters J Khan, DI Lowell US Patent App. 16/560,954, 2021 | 1 | 2021 |
Composable neural network kernels C Liu, DI Lowell, WH Chung, J Zhang US Patent App. 17/138,709, 2021 | | 2021 |
Composable neural network kernels C Liu, DI Lowell, WH Chung, J Zhang US Patent App. 16/779,557, 2020 | | 2020 |
Adaptive quantization for neural networks DI Lowell, S Voronov, M Daga US Patent App. 15/849,617, 2019 | | 2019 |
Automatic Differentiation A Radenski, B Norris, P Balaprakash, D Buntinas, A Chan, A Mametjanov, ... Proceedings of ParCo2013 180, 2115-2123, 2012 | | 2012 |
New Families and New Members of Integer Sequence Based Coding Methods D Lowell, DE Tamir 2009 Data Compression Conference, 456-456, 2009 | | 2009 |