Active Learning from Unreliable Data

Zilong Zhao; Sophie Cerf; Robert Birke; Bogdan Robu; Sara Bouchenak; Sonia Ben Mokhtar; Lydia y Chen

Communication Dans Un Congrès Année : 2019

Active Learning from Unreliable Data

(1) , (1) , (2) , (1) , (3) , (3) , (2)

1
2
3

Zilong Zhao

Fonction : Auteur
PersonId : 172877
IdHAL : zilong-zhao
IdRef : 255142862

GIPSA - Systèmes non linéaires et complexité

Sophie Cerf

Fonction : Auteur
PersonId : 169879
IdHAL : sophie-cerf
ORCID : 0000-0003-0122-0796

GIPSA - Systèmes non linéaires et complexité

Robert Birke

Fonction : Auteur

IBM Research Laboratory [Zurich]

Bogdan Robu

Fonction : Auteur
PersonId : 747277
IdHAL : bogdan-robu
ORCID : 0000-0001-7568-007X
IdRef : 156193779

GIPSA - Systèmes non linéaires et complexité

Sara Bouchenak

Fonction : Auteur
PersonId : 6304
IdHAL : sara-bouchenak
IdRef : 179480510

Distribution, Recherche d'Information et Mobilité

Sonia Ben Mokhtar

Fonction : Auteur
PersonId : 4352
IdHAL : sonia-ben-mokhtar
ORCID : 0000-0003-2821-7714
IdRef : 121974146

Distribution, Recherche d'Information et Mobilité

Lydia y Chen

Fonction : Auteur

IBM Research Laboratory [Zurich]

Résumé

Classification algorithms have been widely adopted in big recommendation systems, e.g., products, images and advertisements, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the field can be unreliable due to careless annotations or malicious data transformation. In our previous work, we proposed a two-layer learning framework for continuous learning in the presence of unreliable anomaly labels, it worked perfectly for two use cases, (i) detecting 10 classes of IoT attacks and (ii) predicting 4 classes of task failures of big data jobs. To continue this study, now we will challenge our framework with image dataset. The first layer of quality model filters the suspicious data, where the second layer of classification model predicts data instance's class. As we focus on the case of images, we will use widely studied datasets: MNIST, Cifar10, Cifar100 and Ima-geNet. Deep Neural Network (DNN) has demonstrated excellent performances in solving images classification problems, we will show that two collaborating DNN could construct a more robust and high accuracy model.

Mots clés

Deep Neural Network Machine Learning Attacks Images Unreliable Data

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Environnements Informatiques pour l'Apprentissage Humain

Fichier principal

eurosys2019.pdf (278.37 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Zilong ZHAO : Connectez-vous pour contacter le contributeur

https://hal.univ-grenoble-alpes.fr/hal-02045455

Soumis le : vendredi 22 février 2019-10:13:27

Dernière modification le : jeudi 4 avril 2024-21:13:28

Archivage à long terme le : jeudi 23 mai 2019-14:09:33

Dates et versions

hal-02045455 , version 1 (22-02-2019)

Identifiants

HAL Id : hal-02045455 , version 1

Citer

Zilong Zhao, Sophie Cerf, Robert Birke, Bogdan Robu, Sara Bouchenak, et al.. Active Learning from Unreliable Data. EuroDW 2019 - 13th EuroSys Doctoral Workshop, Mar 2019, Dresde, Germany. ⟨hal-02045455⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA TICE CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON GIPSA GIPSA-DA GIPSA-SYSCO LIRIS INSA-GROUPE UDL SILECS FIT

522 Consultations

527 Téléchargements

Active Learning from Unreliable Data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager