Hybrid Partitioning for Embedded and Distributed CNNs Inference on Edge Devices

Nihel Kaboubi; Loïc Letondeur; Thierry Coupaye; Frédéric Desprez; Denis Trystram

Communication Dans Un Congrès Année : 2022

Hybrid Partitioning for Embedded and Distributed CNNs Inference on Edge Devices

Partitionnement hybride pour une inférence distribuée et embarquée des CNNs sur les équipements en bordure de réseau.

(1) , (1) , (1) , (2) , (3)

1
2
3

Nihel Kaboubi

Fonction : Auteur
PersonId : 1222096

Orange Labs [Meylan]

Loïc Letondeur

Fonction : Auteur
PersonId : 1022843

Orange Labs [Meylan]

Thierry Coupaye

Fonction : Auteur
PersonId : 951328
ORCID : 0009-0005-5626-6304

Orange Labs [Meylan]

Frédéric Desprez

Fonction : Auteur
PersonId : 6600
IdHAL : frederic-desprez
IdRef : 034430563

Inria Grenoble - Rhône-Alpes

Denis Trystram

Fonction : Auteur
PersonId : 1129029

Laboratoire d'Informatique de Grenoble

Résumé

Convolutional Neural Networks (CNNs) and Deep Neural Networks (DNNs) are ubiquitously utilized in many Internet of Things applications, especially for real-time image-based analysis. In order to cope with concerns such as resiliency, privacy and near real time analysis, these models must be deployed on edge devices. Particularly for large models, the large number of parameters becomes a bottleneck for the inference process because edge devices are resource constrained, subjects to failures and/or hardware faults. New solutions to cope with these issues are required. This paper proposes a hybrid partitioning strategy, architecture and implementation (called HyPS), which identifies the best positions in the model structure to split the network structure into small partitions that fit resources constraints of edge devices noticeably by decreasing instantaneous memory needs. The generated partitions consume less memory than the original network and each partition can be processed almost separately, resulting in new ways to process CNN's execution at the edge. Thanks to this partitioning strategy, large CNNs inference can be run without modifying the main model architecture. The proposed approach is assessed on the well-known neural network structure of VGG16 for image classification. The results of the experimental campaign show that the partitioning method allows for the successful inference of large models on devices with limited overhead and high accuracy.

Mots clés

Distributed Inference Edge Computing Edge Intelligence Convolutional Neural Networks Internet of Things

Domaines

Informatique [cs]

Fichier principal

Conference_paper_2022HAL.pdf (1.75 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

nihel kaboubi : Connectez-vous pour contacter le contributeur

https://hal.univ-grenoble-alpes.fr/hal-03967388

Soumis le : mercredi 8 mars 2023-15:13:49

Dernière modification le : mercredi 18 décembre 2024-09:26:57

Archivage à long terme le : vendredi 9 juin 2023-18:02:45

Dates et versions

hal-03967388 , version 1 (08-03-2023)

Licence

Paternité

Identifiants

HAL Id : hal-03967388 , version 1

Citer

Nihel Kaboubi, Loïc Letondeur, Thierry Coupaye, Frédéric Desprez, Denis Trystram. Hybrid Partitioning for Embedded and Distributed CNNs Inference on Edge Devices. ANTIC 2022 - International conference on advanced network technologies and intelligent computing, Department of Computer Science Institute of Science Banaras Hindu University, Dec 2022, Varanasi, India. ⟨hal-03967388⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG INRIA2 MIAI ANR LIG_SIDCH

82 Consultations

171 Téléchargements

Hybrid Partitioning for Embedded and Distributed CNNs Inference on Edge Devices

Partitionnement hybride pour une inférence distribuée et embarquée des CNNs sur les équipements en bordure de réseau.

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager