Arash Bakhtiari

Cited by

	All	Since 2019
Citations	531	525
h-index	7	6
i10-index	4	4

460

230

115

345

201720182019202020212022202320244 2 14 3 10 12 34 450

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

He YuxiongMicrosoft ResearchVerified email at microsoft.com
Reza Yazdani AminabadiMicrosoft ResearchVerified email at microsoft.com
Olatunji RuwaseMicrosoft ResearchVerified email at microsoft.com
Haojun XiaUniversity of SydneyVerified email at uni.sydney.edu.au
Zhewei YaoSnowflakeVerified email at snowflake.com
Zhen ZHENGMicrosoftVerified email at microsoft.com
Martin SchreiberUniversité Grenoble Alpes / InriaVerified email at univ-grenoble-alpes.fr
Philipp NeumannDESY/ Universität HamburgVerified email at desy.de
Xiaoxia (Shirley) Wu 吴晓霞MicrosoftVerified email at microsoft.com
Jeff RasleyMicrosoftVerified email at microsoft.com
Ammar Ahmad AwanMicrosoftVerified email at osu.edu
Connor HolmesOpenAIVerified email at openai.com
Heyang QinMicrosoftVerified email at microsoft.com
George BirosProfessor, Oden Institute for Computational Engineering and Sciences, The University of Texas atVerified email at acm.org
Dhairya MalhotraResearch Scientist, Flatiron InstituteVerified email at flatironinstitute.org
Miriam SchulteUniversity of StuttgartVerified email at ipvs.uni-stuttgart.de

Arash Bakhtiari

Unknown affiliation

Verified email at bakhtiari.org - Homepage

High Performance Computing Parallel Algorithms Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 2024	386	2024
Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art Binarized Neural Networks T Bannink, A Bakhtiari, A Hillier, L Geiger, T de Bruin, L Overweel, ... arXiv preprint arXiv:2011.09398, 2020	49	2020
A holistic scalable implementation approach of the lattice Boltzmann method for CPU/GPU heterogeneous clusters C Riesinger, A Bakhtiari, M Schreiber, P Neumann, HJ Bungartz Computation 5 (4), 48, 2017	35	2017
Deepspeed-fastgen: High-throughput text generation for llms via mii and deepspeed-inference C Holmes, M Tanaka, M Wyatt, AA Awan, J Rasley, S Rajbhandari, ... arXiv preprint arXiv:2401.08671, 2024	22	2024
Fp6-llm: Efficiently serving large language models through fp6-centric algorithm-system co-design H Xia, Z Zheng, X Wu, S Chen, Z Yao, S Youn, A Bakhtiari, M Wyatt, ... arXiv preprint arXiv:2401.14112, 2024	9	2024
Phi-3 technical report: A highly capable language model locally on your phone, 2024 M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... URL https://arxiv. org/abs/2404.14219, 2024	7	2024
A parallel arbitrary-order accurate amr algorithm for the scalar advection-diffusion equation A Bakhtiari, D Malhotra, A Raoofy, M Mehl, HJ Bungartz, G Biros SC'16: Proceedings of the International Conference for High Performance …, 2016	7	2016
Zeroquant (4+ 2): Redefining llms quantization with a new fp6-centric strategy for diverse generative tasks X Wu, H Xia, S Youn, Z Zheng, S Chen, A Bakhtiari, M Wyatt, ... arXiv preprint arXiv:2312.08583, 2023	6	2023
Phi-3 technical report: A highly capable language model locally on your phone. arXiv 2024 M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 0	5
Phi-3 technical report: A highly capable language model locally on your phone. arXiv M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 2024	3	2024
MPI Parallelization of GPU-based Lattice Boltzmann Simulations A Bakhtiari, C Riesinger, M Schreiber, P Neumann	2	2013
{Quant-LLM}: Accelerating the Serving of Large Language Models via {FP6-Centric}{Algorithm-System}{Co-Design} on Modern {GPUs} H Xia, Z Zheng, X Wu, S Chen, Z Yao, S Youn, A Bakhtiari, M Wyatt, ... 2024 USENIX Annual Technical Conference (USENIX ATC 24), 699-713, 2024		2024
High Order Adaptive Semi-Lagrangian/Volume-Integral Methods for Parallel Advection-Diffusion Simulations A Bakhtiari Technische Universität München, 2017		2017
A Distributed Memory Advection-Diffusion Solver with Dynamic Adaptive Trees A Bakhtiari		2015
Implementation of a 3D Seismic Wave Model on Adaptive Meshes in PeanoClaw (BGCE Honours Project) D Pinaev, A Bakhtiari		2013

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors