Follow
Sana Damani
Sana Damani
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Speculative reconvergence for improved SIMT efficiency
S Damani, DR Johnson, M Stephenson, SW Keckler, E Yan, M McKeown, ...
Proceedings of the 18th ACM/IEEE International Symposium on Code Generation …, 2020
122020
Vegeta: Vertically-integrated extensions for sparse/dense gemm tile acceleration on cpus
G Jeong, S Damani, AR Bambhaniya, E Qin, CJ Hughes, S Subramoney, ...
2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023
92023
Gpu subwarp interleaving
S Damani, M Stephenson, R Rangan, D Johnson, R Kulkami, SW Keckler
2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022
62022
Common subexpression convergence: A new code optimization for simt processors
S Damani, V Sarkar
Languages and Compilers for Parallel Computing: 32nd International Workshop …, 2021
42021
Techniques for divergent thread group execution scheduling
S Damani, M Stephenson, R Rangan, DR Johnson, R Kulkarni
US Patent 11,934,867, 2024
22024
cuCatch: A Debugging Tool for Efficiently Catching Memory Safety Violations in CUDA Applications
M Tarek Ibn Ziad, S Damani, A Jaleel, SW Keckler, M Stephenson
Proceedings of the ACM on Programming Languages 7 (PLDI), 124-147, 2023
12023
WASP: Exploiting GPU Pipeline Parallelism with Hardware-Accelerated Automatic Warp Specialization
NC Crago, S Damani, K Sankaralingam, SW Keckler
2024 IEEE International Symposium on High-Performance Computer Architecture …, 2024
2024
Convergence among concurrently executing threads
DR Johnson, J Choquette, O Giroux, MP McKeown, M Stephenson, ...
US Patent 11,847,508, 2023
2023
Software-directed register file sharing
S Damani, S Treichler, M Stephenson
US Patent App. 17/697,325, 2023
2023
Software-directed divergent branch target prioritization
S Damani, S Treichler, M Stephenson, DR Johnson
US Patent App. 17/568,514, 2023
2023
Convergence among concurrently executing threads
DR Johnson, J Choquette, O Giroux, MP McKeown, M Stephenson, ...
US Patent 11,442,795, 2022
2022
OPTIMIZED SCHEDULING AND RESOURCE ALLOCATION FOR THREAD PARALLEL ARCHITECTURES
S Damani
Georgia Institute of Technology, 2022
2022
Memory access scheduling to reduce thread migrations
S Damani, P Barua, V Sarkar
Proceedings of the 31st ACM SIGPLAN International Conference on Compiler …, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–13