This project compares traditional machine learning, basic neural networks, and transformer-based deep learning models for binary fake-news detection.
Dataset: davanstrien/WELFake from Hugging Face.
Task: classify news articles as fake or real.
- TF-IDF + Logistic Regression
- Embedding MLP
- BiLSTM
- DistilBERT fine-tuning
Primary metric: macro F1-score.
Additional metrics: accuracy, precision, recall, ROC-AUC, confusion matrix.
The notebook compares body-only input against title+body input using the same TF-IDF Logistic Regression setup.
- Open the notebook in Google Colab.
- Select GPU runtime.
- Run all cells from top to bottom.
- Adjust
MAX_ROWS,EPOCHS_NN, andEPOCHS_TRANSFORMERfor final experiments.
Random seed: 42.
.
├── COMP_690_AH2_Final_Draft.ipynb
├── README.md
├── requirements.txt
This model should be used only as a decision-support or triage tool. It should not be treated as an automatic truth authority.