TY - JOUR
T1 - A Survey on Optimization Techniques for Edge Artificial Intelligence (AI)
AU - Surianarayanan, Chellammal
AU - Lawrence, John Jeyasekaran
AU - Chelliah, Pethuru Raj
AU - Prakash, Edmond
AU - Hewage, Chaminda
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/1/22
Y1 - 2023/1/22
N2 - Artificial Intelligence (Al) models are being produced and used to solve a variety of current and future business and technical problems. Therefore, AI model engineering processes, platforms, and products are acquiring special significance across industry verticals. For achieving deeper automation, the number of data features being used while generating highly promising and productive AI models is numerous, and hence the resulting AI models are bulky. Such heavyweight models consume a lot of computation, storage, networking, and energy resources. On the other side, increasingly, AI models are being deployed in IoT devices to ensure real-time knowledge discovery and dissemination. Real-time insights are of paramount importance in producing and releasing real-time and intelligent services and applications. Thus, edge intelligence through on-device data processing has laid down a stimulating foundation for real-time intelligent enterprises and environments. With these emerging requirements, the focus turned towards unearthing competent and cognitive techniques for maximally compressing huge AI models without sacrificing AI model performance. Therefore, AI researchers have come up with a number of powerful optimization techniques and tools to optimize AI models. This paper is to dig deep and describe all kinds of model optimization at different levels and layers. Having learned the optimization methods, this work has highlighted the importance of having an enabling AI model optimization framework.
AB - Artificial Intelligence (Al) models are being produced and used to solve a variety of current and future business and technical problems. Therefore, AI model engineering processes, platforms, and products are acquiring special significance across industry verticals. For achieving deeper automation, the number of data features being used while generating highly promising and productive AI models is numerous, and hence the resulting AI models are bulky. Such heavyweight models consume a lot of computation, storage, networking, and energy resources. On the other side, increasingly, AI models are being deployed in IoT devices to ensure real-time knowledge discovery and dissemination. Real-time insights are of paramount importance in producing and releasing real-time and intelligent services and applications. Thus, edge intelligence through on-device data processing has laid down a stimulating foundation for real-time intelligent enterprises and environments. With these emerging requirements, the focus turned towards unearthing competent and cognitive techniques for maximally compressing huge AI models without sacrificing AI model performance. Therefore, AI researchers have come up with a number of powerful optimization techniques and tools to optimize AI models. This paper is to dig deep and describe all kinds of model optimization at different levels and layers. Having learned the optimization methods, this work has highlighted the importance of having an enabling AI model optimization framework.
KW - AI model optimization
KW - artificial intelligence
KW - edge AI
KW - energy efficient methods for edge AI
KW - federated learning
KW - optimization methods for edge AI
UR - http://www.scopus.com/inward/record.url?scp=85147856754&partnerID=8YFLogxK
U2 - 10.3390/s23031279
DO - 10.3390/s23031279
M3 - Review article
C2 - 36772319
AN - SCOPUS:85147856754
SN - 1424-8220
VL - 23
JO - Sensors
JF - Sensors
IS - 3
M1 - 1279
ER -