Abstract
Model partitioning is a promising solution to reduce the high computation load and transmission of high-volume data. Within the scope of Edge AI, the fundamentals of model partitioning involve splitting the model for local computing at the edge and offloading heavy computation tasks to the cloud or server. This approach benefits scenarios with limited computing and battery capacity with low latency requirements, such as connected autonomous vehicles. However, while model partitioning offers advantages in reducing the onboard computation, memory requirements and inference time, it also introduces challenges such as increased energy consumption for partitioned computations and overhead for transferring partitioned data/model. In this work, we explore hybrid model partitioning to optimize computational and communication energy consumption. Our results provide an initial analysis of the tradeoff between energy and accuracy, focusing on the energy-aware model partitioning for future Edge AI applications.
Original language | English |
---|---|
Title of host publication | Proceedings - 2024 IEEE/ACM Symposium on Edge Computing, SEC 2024 |
Publisher | IEEE |
Pages | 526-527 |
Number of pages | 2 |
ISBN (Electronic) | 9798350378283 |
DOIs | |
Publication status | Published - 2024 |
Event | 9th Annual IEEE/ACM Symposium on Edge Computing, SEC 2024 - Radisson Blu GHR Hotel, Rome, Italy Duration: 4 Dec 2024 → 7 Dec 2024 https://acm-ieee-sec.org/2024/ |
Conference
Conference | 9th Annual IEEE/ACM Symposium on Edge Computing, SEC 2024 |
---|---|
Country/Territory | Italy |
City | Rome |
Period | 4/12/24 → 7/12/24 |
Internet address |
Bibliographical note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-careOtherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
Keywords
- Edge AI
- Energy efficiency
- Model Partitioning