Skip to content

BioSam-Data/capstone-project

Repository files navigation

Capstone Project – Exploratory Data Analysis and Machine Learning

Author: Abiodun Samuel Akozo-Emiri
Program: Full Stack Data Science Certification
Tools Used: Python, Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, Jupyter Notebook

πŸ“˜ Overview

This capstone project focuses on solving a real-world data challenge using end-to-end data science methodology β€” from data exploration to model building and evaluation. The goal was to extract insights, visualize trends, and build predictive models to guide decisions.

🧰 Project Features

  • βœ” Exploratory Data Analysis (EDA)
  • βœ” Data cleaning & preprocessing
  • βœ” Visualizations using Matplotlib and Seaborn
  • βœ” Correlation analysis
  • βœ” Model building (Linear Regression, Decision Tree, etc.)
  • βœ” Model evaluation (RMSE, RΒ² Score)

πŸ“ File Structure

capstone-project/ β”œβ”€β”€ capstone_notebook.ipynb # Main Jupyter notebook β”œβ”€β”€ data/ # Raw datasets (CSV, Excel, etc.) β”œβ”€β”€ images/ # Charts and visuals β”œβ”€β”€ README.md # Project description and guide

🧠 Key Learnings

  • Data storytelling through EDA
  • Real-world model interpretation
  • Python-based data science workflow
  • Git and version control for project delivery

πŸš€ How to Run

  1. Clone the repo
  2. Open capstone_notebook.ipynb in Jupyter
  3. Follow step-by-step from data loading to final evaluation

πŸ“ž Contact

πŸ“§ biosam04@gmail.com
πŸ”— LinkedIn


About

Exploratory Data Analysis on the Palmer's Penguins Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published