viralscope
Term project for CS513 - Knowledge Discovery and Data Mining (Fall 2025)
GitHub Link: engindenizdogu/viralscope: Viral video predictor for Youtube videos
viralscope is a machine learning project that predicts YouTube video success using metadata and channel characteristics. The project processes 85+ million video records from the YouNiverse dataset to identify patterns that distinguish successful videos from others.
Key Features:
- End-to-end ML pipeline from raw data to trained models
- Random and data preparation sampling with engagement filtering
- Feature engineering with train/test split and scaling
- Multiple classification models with hyperparameter tuning
- Designed to prevent data leakage - labels created only after train/test split
Models with GridSearchCV:
- Random Forest Classifier
- Decision Tree Classifier
- Linear SVC
- K-Nearest Neighbors
- Multi-Layer Perceptron