⚠️ NoteThis project was originally developed in 2019. Some libraries, APIs, and functions used in the code may now be deprecated or updated.
It is recommended to use this repository primarily to understand the overall workflow, feature engineering techniques, and modeling approach.
For practical use, please update the dependencies and modify deprecated functions to align with the latest versions of the libraries.
The following NLP techniques are used for feature extraction:
- Bag of Words (BoW)
- TF-IDF
- Word Embeddings
- Word2Vec
- Doc2Vec
📌 Each of the below classification algorithms has been trained and evaluated on all of the above NLP techniques.
- Logistic Regression
- Support Vector Machine (SVM)
- Random Forest
- Extreme Gradient Boosting (XGBoost)
- Light Gradient Boosting (LightGBM)