Follow
Michel Steuwer
Michel Steuwer
Professor | Chair of Compilers and Programming Languages | TU Berlin
Verified email at tu-berlin.de - Homepage
Title
Cited by
Cited by
Year
Lift: a functional data-parallel IR for high-performance GPU code generation
M Steuwer, T Remmelg, C Dubach
2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017
2182017
SkelCL - A Portable Skeleton Library for High-Level GPU Programming
M Steuwer, P Kegel, S Gorlatch
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 …, 2011
1982011
Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code
M Steuwer, C Fensch, S Lindley, C Dubach
ACM SIGPLAN Notices 50 (9), 205-217, 2015
1852015
High performance stencil code generation with lift
B Hagedorn, L Stoltzfus, M Steuwer, S Gorlatch, C Dubach
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
1262018
dOpenCL: Towards A Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems
P Kegel, M Steuwer, S Gorlatch
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW …, 2012
562012
Achieving high-performance the functional way: a functional pearl on expressing high-performance optimizations as rewrite strategies
B Hagedorn, J Lenfers, T Koehler, X Qin, S Gorlatch, M Steuwer
Proceedings of the ACM on Programming Languages 4 (ICFP), 1-29, 2020
452020
Automatic matching of legacy code to heterogeneous APIs: An idiomatic approach
P Ginsbach, T Remmelg, M Steuwer, B Bodin, C Dubach, MFP O'Boyle
Proceedings of the Twenty-Third International Conference on Architectural …, 2018
412018
Just-in-time GPU compilation for interpreted languages with partial evaluation
J Fumero, M Steuwer, L Stadler, C Dubach
Proceedings of the 13th ACM SIGPLAN/SIGOPS International Conference on …, 2017
402017
SkelCL: Enhancing OpenCL for high-level programming of multi-GPU systems
M Steuwer, S Gorlatch
Parallel Computing Technologies: 12th International Conference, PaCT 2013 …, 2013
392013
Performance Portable GPU Code Generation for Matrix Multiplication
T Remmelg, T Lutz, M Steuwer, C Dubach
Proceedings of the 9th Annual Workshop on General Purpose Processing using …, 2016
382016
Autotuning OpenCL Workgroup Size for Stencil Patterns
C Cummins, P Petoumenos, M Steuwer, H Leather
The 6th International Workshop on Adaptive Self-tuning Computing Systems …, 2015
372015
Matrix Multiplication Beyond Auto-Tuning: Rewrite-based GPU Code Generation
M Steuwer, T Remmelg, C Dubach
CASES'16: Proceedings of the 2016 International Conference on Compilers …, 2016
352016
SkelCL: a high-level extension of OpenCL for multi-GPU systems
M Steuwer, S Gorlatch
The Journal of Supercomputing 69, 25-33, 2014
302014
Runtime code generation and data management for heterogeneous computing in java
JJ Fumero, T Remmelg, M Steuwer, C Dubach
Proceedings of the principles and practices of programming on the java …, 2015
292015
A composable array function interface for heterogeneous computing in java
JJ Fumero, M Steuwer, C Dubach
Proceedings of ACM SIGPLAN International Workshop on Libraries, Languages …, 2014
252014
Towards High-Level Programming of Multi-GPU Systems Using the SkelCL Library
M Steuwer, P Kegel, S Gorlatch
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW …, 2012
252012
High-level synthesis of functional patterns with lift
M Kristien, B Bodin, M Steuwer, C Dubach
Proceedings of the 6th ACM SIGPLAN International Workshop on Libraries …, 2019
232019
High-level programming of stencil computations on multi-GPU systems using the SkelCL library
M Steuwer, M Haidl, S Breuer, S Gorlatch
Parallel Processing Letters 24 (03), 1441005, 2014
212014
Extending the SkelCL skeleton library for stencil computations on multi-GPU systems
S Breuer, M Steuwer, S Gorlatch
Proceedings of the 1st International Workshop on High-Performance Stencil …, 2014
212014
Efficient auto-tuning of parallel programs with interdependent tuning parameters via auto-tuning framework (ATF)
A Rasch, R Schulze, M Steuwer, S Gorlatch
ACM Transactions on Architecture and Code Optimization (TACO) 18 (1), 1-26, 2021
192021
The system can't perform the operation now. Try again later.
Articles 1–20