Identifying Duplicate Quora Question Pairs (Kaggle Competition Bronze Medal Winner)
- We explored the current methods in NLP, including word2vec embedding (gensim package in python), LSTMs(use keras neural networks API), tf-idf, python nltk package, etc.
- We built machine learning models which identified duplicate Quora question pairs with high accuracy (logloss ~0.151)
- We are ranked top 8% in this Kaggle …