Automated MLOps pipeline for collecting graphics card sales data, preprocessing it, and training an XGBoost prediction model.
This pipeline automates:
- Data Collection: Queries API every minute for sales data (rtx3060, rtx3070, rtx3080, rtx3090, rx6700)
- Preprocessing: Cleans and transforms raw data into features
- Model Training: Trains XGBoost model for sales prediction
python3 -m venv .venv
source .venv/bin/activate
pip install uv
uv sync
tar -xvf resources/api.tar
chmod +x api
./api &
crontab scripts/cron.txt
crontab -l # list all cron jobs to verify installation
tail -f logs/cron.log # monitor cron jobs
make bash
make tests
Test logs are saved in logs/tests_logs/.
uv pip install -r pyproject.toml --extra dev
note: tests_logs in requirements vs automatically created "tests_logs" directory through pytest files...