data-analysis


Project maintained by TokarskiPatryk Hosted on GitHub Pages — Theme by mattgraham

Data Analysis Portfolio

A collection of data analysis and machine learning projects.


Metacritic Best Games Analysis

EDA of the highest-rated video games on Metacritic — all-time top 100 and the top 1,000 games from 2013–2022. Covers score distributions, critic vs user ratings, and platform trends over time. Data scraped with rvest.
R tidyverse rvest gtsummary


Heart Attack Prediction

Classification model predicting heart attack occurrence based on clinical measurements (age, heart rate, blood pressure, CK-MB, troponin) collected during ambulance transport. Includes EDA and comparison of model features against clinical reference values.
R tidyverse plotly corrplot


Titanic Survival Prediction

Logistic regression model predicting passenger survival on the Titanic. Explores survival rates by class, sex and age, including a t-test confirming statistically significant age difference between survivors and non-survivors.
R tidyverse caret


BigBasket Product Analysis

Exploratory analysis of ~28,000 products from BigBasket, an Indian online grocery platform. Covers category share, rating distribution, and price/discount patterns across brands and categories.
R tidyverse


Aircraft Detection with YOLOv8

Fine-tuned YOLOv8 to detect 20 types of aircraft in satellite imagery. Trained and compared 4 model variants (YOLOv8n/m × frozen/unfrozen layers) on 3,821 images over 100 epochs using a Kaggle P100 GPU.
Python ultralytics YOLOv8


Time Series Analysis

Course project covering time series modelling and forecasting methods in R.
R


Car Counter

Real-time vehicle detection and counting from an RTSP camera stream using object detection models (YOLOv3 via ImageAI, YOLOv8 via ultralytics).
Python ultralytics OpenCV