OptiSys — Predictive System Resource Optimization

AI/ML pipeline for cloud cluster resource intelligence: workload clustering, multi-model utilization forecasting, exhaustion detection, and optimization recommendations. Inspired by Google Cluster Trace patterns (CPU/memory utilization, machine events, task events).

Pipeline

Simulated cluster traces
        ↓
Workload clustering (K-Means + DBSCAN anomalies)
        ↓
Resource usage prediction (4 models)
        ↓
CPU / memory exhaustion forecast
        ↓
Optimization suggestions (rightsizing, migration, savings)
        ↓
Streamlit dashboard

Project structure

.
├── app.py                 # Streamlit dashboard (ClusterMind)
├── smoke_test.py          # End-to-end pipeline verification
├── requirements.txt
├── data/                  # Generated CSV traces (gitignored; created on first run)
└── src/
    ├── data_generator.py  # Google-style cluster trace simulator
    ├── clustering.py      # K-Means + DBSCAN workload clustering
    ├── prediction.py      # RF, XGBoost, LSTM, Cluster+XGBoost
    └── optimizer.py       # Alerts, rightsizing, migration, savings

Requirements

Python 3.10+
See requirements.txt for dependencies

Optional: install TensorFlow for a true LSTM backend (not bundled by default on Python 3.13).

Setup

git clone <repo-url>
cd <project-folder>

python -m venv .venv
# Windows
.venv\Scripts\activate
# macOS / Linux
source .venv/bin/activate

pip install -r requirements.txt

Run

Dashboard

python -m streamlit run app.py

Open the URL shown in the terminal (default: http://localhost:8501).

Verify the full pipeline

python smoke_test.py

Run individual modules

From the project root:

python -m src.data_generator   # regenerate trace data
python -m src.clustering
python -m src.prediction
python -m src.optimizer

On first run, trace CSVs are generated under data/ (~12 MB for task-level metrics). Use Regenerate Data in the dashboard sidebar or force_regenerate=True in code to rebuild them.

Dashboard tabs

Cluster Live Feed — CPU heatmap, per-machine CPU/memory charts, machine summary
Workload Analysis — cluster scatter, distribution, DBSCAN anomaly detection
Predictive Models — model metrics (MAE, RMSE, R²), actual vs predicted, future forecast
Optimizer — exhaustion alerts, rightsizing, migration suggestions, forecast gauges

Tech stack

Layer	Tools
Data processing	Python, Pandas, NumPy
ML	Scikit-learn, XGBoost
Clustering	K-Means, DBSCAN
Time series (optional)	LSTM (TensorFlow/Keras)
Visualization	Plotly, Matplotlib
Dashboard	Streamlit

Author

Made by KAVYA RAJ

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OptiSys — Predictive System Resource Optimization

Pipeline

Project structure

Requirements

Setup

Run

Dashboard

Verify the full pipeline

Run individual modules

Dashboard tabs

Tech stack

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
src		src
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
smoke_test.py		smoke_test.py

Folders and files

Latest commit

History

Repository files navigation

OptiSys — Predictive System Resource Optimization

Pipeline

Project structure

Requirements

Setup

Run

Dashboard

Verify the full pipeline

Run individual modules

Dashboard tabs

Tech stack

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages