Baseline Classifier Benchmark

TL;DR: Compare Logistic Regression vs Random Forest on a small dataset (Iris). Includes accuracy, ROC-AUC, and confusion matrix.

What’s inside

Load the Iris dataset from scikit-learn
Train two classifiers: Logistic Regression and Random Forest
Evaluate using Accuracy, ROC-AUC, and Confusion Matrix
PM-style commentary on tradeoffs

How to run

Open the notebook in Google Colab:

Results

Logistic Regression accuracy: ~95%
Random Forest accuracy: ~97%
Random Forest performs slightly better, but Logistic Regression is simpler and faster to train.
Both models separate classes well, but Random Forest handles nonlinear boundaries better.

Screenshot

Next ideas

Try more classifiers (SVM, KNN, Gradient Boosting)
Add cross-validation instead of a single train/test split
Plot feature importance from Random Forest

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
GH_baseline_classifiers.ipynb		GH_baseline_classifiers.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Baseline Classifier Benchmark

What’s inside

How to run

Results

Screenshot

Next ideas

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Baseline Classifier Benchmark

What’s inside

How to run

Results

Screenshot

Next ideas

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages