Active Learning from Unreliable Data

Zilong Zhao 1 Sophie Cerf 1 Robert Birke 2 Bogdan Robu 1 Sara Bouchenak 3 Sonia Ben Mokhtar 3 Lydia Chen 2
1 GIPSA-SYSCO - SYSCO
GIPSA-DA - Département Automatique
3 DRIM - Distribution, Recherche d'Information et Mobilité
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Classification algorithms have been widely adopted in big recommendation systems, e.g., products, images and advertisements, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the field can be unreliable due to careless annotations or malicious data transformation. In our previous work, we proposed a two-layer learning framework for continuous learning in the presence of unreliable anomaly labels, it worked perfectly for two use cases, (i) detecting 10 classes of IoT attacks and (ii) predicting 4 classes of task failures of big data jobs. To continue this study, now we will challenge our framework with image dataset. The first layer of quality model filters the suspicious data, where the second layer of classification model predicts data instance's class. As we focus on the case of images, we will use widely studied datasets: MNIST, Cifar10, Cifar100 and Ima-geNet. Deep Neural Network (DNN) has demonstrated excellent performances in solving images classification problems, we will show that two collaborating DNN could construct a more robust and high accuracy model.
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

http://hal.univ-grenoble-alpes.fr/hal-02045455
Contributor : Zilong Zhao <>
Submitted on : Friday, February 22, 2019 - 10:13:27 AM
Last modification on : Thursday, November 21, 2019 - 1:44:09 AM
Long-term archiving on: Thursday, May 23, 2019 - 2:09:33 PM

File

eurosys2019.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02045455, version 1

Citation

Zilong Zhao, Sophie Cerf, Robert Birke, Bogdan Robu, Sara Bouchenak, et al.. Active Learning from Unreliable Data. 13th EuroSys Doctoral Workshop (EuroDW 2019), Mar 2019, Dresde, Germany. ⟨hal-02045455⟩

Share

Metrics

Record views

202

Files downloads

261