Follow
Arash Bakhtiari
Arash Bakhtiari
Unknown affiliation
Verified email at bakhtiari.org - Homepage
Title
Cited by
Cited by
Year
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
3862024
Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art Binarized Neural Networks
T Bannink, A Bakhtiari, A Hillier, L Geiger, T de Bruin, L Overweel, ...
arXiv preprint arXiv:2011.09398, 2020
492020
A holistic scalable implementation approach of the lattice Boltzmann method for CPU/GPU heterogeneous clusters
C Riesinger, A Bakhtiari, M Schreiber, P Neumann, HJ Bungartz
Computation 5 (4), 48, 2017
352017
Deepspeed-fastgen: High-throughput text generation for llms via mii and deepspeed-inference
C Holmes, M Tanaka, M Wyatt, AA Awan, J Rasley, S Rajbhandari, ...
arXiv preprint arXiv:2401.08671, 2024
222024
Fp6-llm: Efficiently serving large language models through fp6-centric algorithm-system co-design
H Xia, Z Zheng, X Wu, S Chen, Z Yao, S Youn, A Bakhtiari, M Wyatt, ...
arXiv preprint arXiv:2401.14112, 2024
92024
Phi-3 technical report: A highly capable language model locally on your phone, 2024
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
URL https://arxiv. org/abs/2404.14219, 2024
72024
A parallel arbitrary-order accurate amr algorithm for the scalar advection-diffusion equation
A Bakhtiari, D Malhotra, A Raoofy, M Mehl, HJ Bungartz, G Biros
SC'16: Proceedings of the International Conference for High Performance …, 2016
72016
Zeroquant (4+ 2): Redefining llms quantization with a new fp6-centric strategy for diverse generative tasks
X Wu, H Xia, S Youn, Z Zheng, S Chen, A Bakhtiari, M Wyatt, ...
arXiv preprint arXiv:2312.08583, 2023
62023
Phi-3 technical report: A highly capable language model locally on your phone. arXiv 2024
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 0
5
Phi-3 technical report: A highly capable language model locally on your phone. arXiv
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
32024
MPI Parallelization of GPU-based Lattice Boltzmann Simulations
A Bakhtiari, C Riesinger, M Schreiber, P Neumann
22013
{Quant-LLM}: Accelerating the Serving of Large Language Models via {FP6-Centric}{Algorithm-System}{Co-Design} on Modern {GPUs}
H Xia, Z Zheng, X Wu, S Chen, Z Yao, S Youn, A Bakhtiari, M Wyatt, ...
2024 USENIX Annual Technical Conference (USENIX ATC 24), 699-713, 2024
2024
High Order Adaptive Semi-Lagrangian/Volume-Integral Methods for Parallel Advection-Diffusion Simulations
A Bakhtiari
Technische Universität München, 2017
2017
A Distributed Memory Advection-Diffusion Solver with Dynamic Adaptive Trees
A Bakhtiari
2015
Implementation of a 3D Seismic Wave Model on Adaptive Meshes in PeanoClaw (BGCE Honours Project)
D Pinaev, A Bakhtiari
2013
The system can't perform the operation now. Try again later.
Articles 1–15