Facial Recognition Application for Gender and Age Identification

Project Overview

This project involves developing a facial recognition application that identifies gender and age from images.

Datasets

UTKFace Dataset

The primary dataset used in this project is the UTKFace dataset. UTKFace is a large-scale face dataset with a wide age range (from 0 to 116 years old). It consists of over 20,000 face images annotated with age, gender, and ethnicity.

Links to UTKFace:

AgeDB Dataset

AgeDB contains 16,488 images of various famous people, such as actors/actresses, writers, scientists, politicians, etc. Every image is annotated with respect to the identity, age, and gender attribute. There exist a total of 568 distinct subjects. The average number of images per subject is 29. The minimum and maximum age is 1 and 101, respectively. The average age range for each subject is 50.3 years.

Links to AgeDB:

IMDB-WIKI Dataset

The IMDB-WIKI dataset is one of the largest publicly available datasets of face images with age and gender labels. It contains more than 500,000 images. This dataset is primarily used for pre-training models rather than for final evaluation.

Links to IMDB-WIKI:
- Source
- Kaggle

To use this dataset, download imdb_crop.tar and wiki_crop.tar, and place them in ./data/IMDB and ./data/WIKI, respectively.

Adience Dataset

The Adience dataset contains 26,580 photos across 2,284 subjects with a binary gender label and one label from eight different age groups, partitioned into five splits. It is used as a benchmark. You can find the state-of-the-art models here for age and here for gender, respectively.

Links to Adience:
- Papers with code
- Kaggle

Overall

Data Pre-processing

The data pre-processing steps involve loading the datasets, augmenting the images, and creating data generators for training and testing. The load_data function loads the UTKFace, AgeDB, WIKI, and IMDB datasets, extracting age and gender information from the filenames and storing them in a pandas DataFrame. Data generators are created using TensorFlow's ImageDataGenerator class, which applies various augmentations to the training images to improve the model's robustness. The create_generators function sets up these generators for both training and testing datasets. A custom data generator reshapes the gender labels to match the expected input shape for the model, ensuring that the data is correctly formatted for training. The dataset_from_generator function creates a TensorFlow dataset from the custom generator, which can be used for training and evaluation.

By following these pre-processing steps, the data is prepared for training the model, ensuring that the images are appropriately augmented and the labels are correctly formatted. The project uses models like ArcFace to extract features and embeddings from the images, which are then used for age and gender prediction.

Models

The project uses deep learning models to predict age and gender from facial images. The model architectures are based on the VGG16 and ResNet networks, which are pre-trained on the ImageNet dataset and used for feature extraction. The models are fine-tuned for the specific tasks of age and gender prediction.

VGG16 Model

The VGG16 model is a convolutional neural network that is 16 layers deep. It is widely used for image classification tasks and is known for its simplicity and effectiveness. In this project, VGG16 is used as a feature extractor (backbone), leveraging its pre-trained weights on the ImageNet dataset to provide a strong foundation for age and gender prediction.

Model Architecture

Feature Extraction: The VGG16 model is used as the base for feature extraction. The top layers of VGG16 are removed, and the remaining layers are frozen to prevent them from being updated during training.
Flattening: The output of the VGG16 model is flattened to create a single long feature vector.
Dense Layers: Two separate dense layers are added for age and gender prediction. Each dense layer is followed by additional dense layers to further process the features.
Output Layers:
- The age prediction output layer uses a linear activation function.
- The gender prediction output layer uses a sigmoid activation function.

Model Compilation

The model is compiled with the following configurations:

Optimizer: AdamW
Loss Functions:
- Age: Mean Squared Error (MSE)
- Gender: Binary Cross-Entropy
Metrics:
- Age: Mean Absolute Error (MAE)
- Gender: Accuracy
Links to VGG16:
- keras
- Papers with Code

ResNet Model

The ResNet model is a convolutional neural network that is 50 layers deep. It is known for its residual learning framework, which allows it to train very deep networks without the vanishing gradient problem. In this project, ResNet50 is used as a feature extractor (backbone), leveraging its pre-trained weights on the ImageNet dataset to provide a strong foundation for age and gender prediction.

Model Architecture

Feature Extraction: The ResNet50 model is used as the base for feature extraction. The top layers of ResNet50 are removed, and the remaining layers are frozen to prevent them from being updated during training.
Flattening: The output of the ResNet50 model is flattened to create a single long feature vector.
Dense Layers: Two separate dense layers are added for age and gender prediction. Each dense layer is followed by additional dense layers to further process the features.
Output Layers:
- The age prediction output layer uses a linear activation function.
- The gender prediction output layer uses a sigmoid activation function.

Model Compilation

The model is compiled with the following configurations:

Optimizer: AdamW
Loss Functions:
- Age: Mean Squared Error (MSE)
- Gender: Binary Cross-Entropy
Metrics:
- Age: Mean Absolute Error (MAE)
- Gender: Accuracy
Links to ResNet50:
- keras
- Papers with Code

Evaluation results

Click to see the detailed results.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
analytics		analytics
base		base
data_loader		data_loader
demo		demo
evaluate		evaluate
logger		logger
model		model
train		train
training_results		training_results
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.json		config.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Facial Recognition Application for Gender and Age Identification

Project Overview

Datasets

UTKFace Dataset

AgeDB Dataset

IMDB-WIKI Dataset

Adience Dataset

Overall

Data Pre-processing

Models

VGG16 Model

Model Architecture

Model Compilation

ResNet Model

Model Architecture

Model Compilation

Evaluation results

Demo Results

About

Languages

License

Melodiz/face-gender-age-recognation

Folders and files

Latest commit

History

Repository files navigation

Facial Recognition Application for Gender and Age Identification

Project Overview

Datasets

UTKFace Dataset

AgeDB Dataset

IMDB-WIKI Dataset

Adience Dataset

Overall

Data Pre-processing

Models

VGG16 Model

Model Architecture

Model Compilation

ResNet Model

Model Architecture

Model Compilation

Evaluation results

Demo Results

About

Topics

Resources

License

Stars

Watchers

Forks

Languages