Seungjae Shin, Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual Generation, Master's Thesis, Department of Industrial and Systems Engineering, KAIST, 2020
- File
- MS_Thesis_SJ_Shin_Final.pdf (6.6M) 35회 다운로드 DATE : 2023-11-07 14:22:29
Seungjae Shin, Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual Generation, Master's Thesis, Department of Industrial and Systems Engineering, KAIST, 2020
Abstract
Recent researches demonstrate that word embeddings, trained on the human-generated corpus, have strong gender biases in embedding spaces, and these biases can result in the prejudiced results from the downstream tasks, i.e. sentiment analysis. Whereas the previous debiasing models project word embeddings into a linear subspace, we introduce a Latent Disentangling model with a siamese auto-encoder structure and a gradient reversal layer. Our siamese auto-encoder utilizes gender word pairs to disentangle semantics and gender information of given word, and the associated gradient reversal layer provides the negative gradient to distinguish the semantics from the gender. Afterwards, we introduce a Counterfactual Generation model to modify the gender information of words, so the original and the modified embeddings can produce a gender-neutralized word embedding after geometric alignment without loss of semantic information. Experimental results quantitatively and qualitatively indicate that the introduced method is better in debiasing word embeddings in WEAT hypothesis test and Sembias analogy test, and in minimizing the semantic information losses for NLP downstream tasks.
@masterthesis{Shin:2020,
author = {Seungjae Shin},
advisor ={Il-Chul Moon},
title = {Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual Generation},
school = {KAIST},
year = {2020}
}
- PreviousWeonyoung Joo, Pathwise Gradient Estimators for Various Probability Distributions in Deep Generative Models, PhD Dissertation, Department of Industrial and Systems Engineering, KAIST, 2020
- NextMinjae Jung, Air Combat Basic System Model for Learning Engagement, Master's Thesis, Department of Industrial and Systems Engineering, KAIST, 2020