Bias-Mitigation-using-Knowledge-Distillation

Reproduced the results on cifar-10 dataset using EfficientNet-B0.
Test Accuracy is 0.9142

From the results we can observe that some classes have higher recall (upto 0.97) while some have low recall (upto 0.84). Similar trends are observed for other performance metrics (precision and F1-score). It is most likely an algorithmic bias. Algorithmic bias can occur when the machine learning model is biased towards certain features or patterns in the data that are more prevalent in some classes than others. For example, if the model is trained on images of cats and dogs, and the dog images have more distinct features, the model may be more biased towards classifying images as dogs even if the image is a cat. It is also possible that the model is overfitting to some classes during training, leading to a better performance on those classes during testing. This can happen if the model is too complex or the training data is too small relative to the number of model parameters.

Co-advise: Cross Inductive Bias Distillation

First we train a teacher model (EfficientNet-B4). Once the teacher model is trained, its knowledge is transferred to a student model that is trained on the target domain. The teacher model's predictions on the source domains are used as soft targets for training the student model on the target domain. The idea is that the teacher model's knowledge of the common features across the source domains will help the student model learn features that are more transferable to the target domain, even if the target domain has different biases and distributions than the source domains.

Results of teacher model (EfficientNet-B4) trained on CIFAR-10:

Performance of student model (EfficientNet-B0) after knowledge distillation:

Disparate Impact (DI) of original model: 0.9376
Disparate Impact (DI) after bias mitigation: 0.9669
Higher DI corresponds to lower bias. Hence we can clearly see that there is reduction in bias and we performed bias mitigation successfully.

The bias mitigation technique suggested in the paper “Co-advise: Cross Inductive Bias Distillation” works best for our case. EfficientNet-B0 is a state-of-the-art model and provides a low on test set of CIFAR-10 dataset. The paper provides the best approach for bias mitigation. First we train a teacher model (EfficientNet-B4). Once the teacher model is trained, its knowledge is transferred to a student model that is trained on the target domain. The teacher model's predictions on the source domains are used as soft targets for training the student model on the target domain. The idea is that the teacher model's knowledge of the common features across the source domains will help the student model learn features that are more transferable to the target domain, even if the target domain has different biases and distributions than the source domains.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Bias_mitigation.ipynb		Bias_mitigation.ipynb
Knowledge_Distillation.ipynb		Knowledge_Distillation.ipynb
Parent_and_student_model_training.py		Parent_and_student_model_training.py
Parent_student_training_output.log		Parent_student_training_output.log
README.md		README.md
Visualizations_gradcam_lime.ipynb		Visualizations_gradcam_lime.ipynb
multimodal_multioutput.ipynb		multimodal_multioutput.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bias-Mitigation-using-Knowledge-Distillation

Co-advise: Cross Inductive Bias Distillation

About

Releases

Packages

Languages

gautamHCSCV/Bias-Mitigation-using-Knowledge-Distillation

Folders and files

Latest commit

History

Repository files navigation

Bias-Mitigation-using-Knowledge-Distillation

Co-advise: Cross Inductive Bias Distillation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages