Demystifying Deep Learning Compiler Optimizations for Training and Inference

Vishakha Agrawal

doi:10.5281/zenodo.14551855

Demystifying Deep Learning Compiler Optimizations for Training and Inference

Author(s)	Vishakha Agrawal
Country	United States
Abstract	Deep learning has achieved tremendous success in recent years, powering many artificial intelligence applications. However, deep learning models are computationally intensive to train, requiring massive amounts of data and compute resources. Once trained, deep learning models need to be deployed for inference to make predictions on new data. Hardware used for training differs from hardware used for inference. Deep learning compilers have revolutionized the field of artificial intelligence by optimizing the performance of deep learning models on various hardware platforms. In the current landscape of research on deep learning compilers, there is a notable absence of comprehensive studies that specifically differentiate between compiler optimizations and methodologies for training versus inference. This paper provides detailed description of deep learning compiler optimization, focusing on training, inference separately. We investigate the challenges, opportunities, and design considerations for compilers targeting each phase.
Keywords	Training, Inference, Optimization, ASIC, Fusion, Quantization, Mixed Precision, Dynamic Batching, Pruning
Published In	Volume 12, Issue 2, July-December 2021
Published On	2021-09-08
Cite This	Demystifying Deep Learning Compiler Optimizations for Training and Inference - Vishakha Agrawal - IJAIDR Volume 12, Issue 2, July-December 2021. DOI 10.5281/zenodo.14551855
DOI	https://doi.org/10.5281/zenodo.14551855
Short DOI	https://doi.org/g8wtgq

View / Download PDF File

About IJAIDR Fees & Payment Current Issue Publication Archive	Submit Research Paper Track Submission Status Publication Guidelines Publication Ethics Peer Review & Plagiarism	Join as a Reviewer Editors & Reviewers Reviewer Referral Program Get Reviewer Membership Certi.	Website/Journal Policies Usage Policy Content Policies Privacy Policy

Contact Us	Message on WhatsApp	+91-9687-182-185	editor@ijaidr.com

Journal of Advances in Developmental Research

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Demystifying Deep Learning Compiler Optimizations for Training and Inference

Share this