z-logo
open-access-imgOpen Access
Automated cleaning of identity label noise in a large face dataset with quality control
Author(s) -
Al Jazaery Mohamad,
Guo Guodong
Publication year - 2020
Publication title -
iet biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.434
H-Index - 28
eISSN - 2047-4946
pISSN - 2047-4938
DOI - 10.1049/iet-bmt.2019.0081
Subject(s) - computer science , artificial intelligence , face (sociological concept) , facial recognition system , quality (philosophy) , outlier , bootstrapping (finance) , pattern recognition (psychology) , noise (video) , precision and recall , identity (music) , machine learning , image (mathematics) , social science , philosophy , physics , epistemology , sociology , acoustics , financial economics , economics
For face recognition, some very large‐scale datasets are publicly available in recent years, which are usually collected from the Internet using search engines, and thus have many faces with wrong identity (ID) labels (outliers). Additionally, the face images in these datasets have different qualities because of uncontrolled situations. The authors propose a novel approach for cleaning the ID label error, handling face images in different qualities. The face ID labels cleaned by their method can train better models for low‐quality face recognition since more low‐quality images are correctly labelled for training a deep model. In their low‐to‐high‐quality face verification experiments, the deep model trained on their cleaning results of MS‐Celeb‐1M.v1 face dataset outperforms the same model trained on the same dataset cleaned by the semantic bootstrapping method. They also apply their ID label cleaning method on a subset of the cross‐age celebrity dataset (CACD) face dataset, in which their quality‐based cleaning can deliver higher precision and recall than a previous method on detecting the ID label errors.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here