Deep Learning for Multi-Label Facial Attribute Classification on Large-Scale Image Datasets (CelebA)
The exponential growth of data in recent years has led to an increasing demand for advanced techniques, especially those that work on large and complex data. This has given deep learning a significant advance in dealing with the tasks of analyzing, improving, and distinguishing big data. Our research focused on CNNs from this data and applying deep learning algorithms and their analysis to a large-scale image dataset. More specifically, our research focused on a dataset called CelebA, which contains more than 200,000 face images annotated with 40 binary facial features. It is a multi-label classification model based on the ResNet-50 architecture that has been fine-tuned to predict different facial features and hair color such as age, gender, and facial expressions. It was also trained using data augmentation, taking into account pose differences and background clutter to reduce imbalance between classes. These results reflect very strong predictive performance, with an average mean accuracy of 0.86 and an overall F1 score of 0.81 across all features. Attributes identified by clear visual cues—for example, “smiling,” “male ”and“ wearing lipstick”—were highly accurate, while less obvious attributes such as “big lips” and “narrow eyes” were more difficult to classify. We would like to point out that the results demonstrate the high efficiency of using deep learning models for multi-label classification on big data while solving problems associated with class imbalance and overfitting models. This research leads to the larger general field of big data analytics; in particular, it demonstrates how deep learning can be efficiently applied to large image datasets for automatic attribute recognition. It also opens up potential applications in areas such as biometric identification, surveillance, and human-computer interaction.
Volume & Issue
Vol. Volume 15 / Iss. Issue 1