Techniques for Efficient Training of Large-Scale Deep Learning Models

Authors

  • Sumit Dahiya Apeejay College of Engineering, India

Abstract

The training of large-scale machine learning models has become a cornerstone of modern artificial intelligence (AI) research and applications. However, the computational demands and resource requirements associated with training such models are substantial, often leading to increased costs and longer training times. This paper reviews various strategies and techniques that have been developed to enhance the efficiency of training large-scale models. We focus on innovations in distributed computing, optimization algorithms, and hardware accelerators, and discuss their implications for scalability and performance.

Downloads

Published

2023-06-10

How to Cite

Dahiya, S. (2023). Techniques for Efficient Training of Large-Scale Deep Learning Models. MZ Computing Journal, 4(1). Retrieved from https://mzresearch.com/index.php/MZCJ/article/view/218