Projects

A deeper look at the data science, machine learning, and software engineering projects I've built.

Prop Trading Dashboard

Featured
View Code

Overview

A desktop analytics platform designed for proprietary trading firms. Ingests raw trade CSV exports, aggregates fills using interval overlap analysis, and computes per-day statistics including expectancy, profit factor, drawdown, and Sharpe ratio. Models firm-specific payout eligibility rules in real time so traders can track their path to funded accounts.

Methodology

  • Interval overlap algorithm for aggregating partial fills into complete trades
  • Rolling window statistics for drawdown detection and streak analysis
  • Rule engine modeling payout structures from multiple prop firms (FTMO, MFF, TFT)
  • SQLite for persistent local storage with migration support
  • React component architecture with Recharts for interactive P&L curves

Key Results

< 200ms CSV parse + aggregate (10K trades)
3 firms Payout rule engines modeled
12+ Computed trading statistics
TypeScript React Electron SQLite Recharts Vite

Falcon 9 Landing Prediction

View Code

Overview

End-to-end classification project predicting whether SpaceX Falcon 9 first-stage boosters will successfully land. The pipeline spans data collection via SpaceX REST API, exploratory data analysis, feature engineering, SQL-based analysis, and an interactive Plotly Dash dashboard for stakeholder exploration.

Methodology

  • Data collection from SpaceX API and web scraping Wikipedia launch records
  • Feature engineering: orbit type, launch site, payload mass, booster version, flight number
  • SQL analysis for aggregation and filtering on launch site success rates
  • Model comparison: Logistic Regression, SVM, Decision Tree, KNN with GridSearchCV
  • Interactive Plotly Dash dashboard with launch site maps and success rate visualizations

Key Results

83.3% Best model accuracy (Decision Tree)
4 models Compared with cross-validation
Interactive Plotly Dash dashboard
Python scikit-learn Plotly Dash SQL Pandas Folium

Histopathologic Cancer Detection

View Code

Overview

CNN-based medical image classifier for identifying metastatic cancer in 96x96 pixel histopathologic tissue scans. Built and compared multiple deep learning architectures including custom CNNs and transfer learning approaches, with extensive data augmentation to handle class imbalance in the 220K+ image dataset.

Methodology

  • Custom CNN architectures with batch normalization and dropout regularization
  • Transfer learning with pre-trained models (VGG16, ResNet50)
  • Data augmentation: rotation, flipping, zoom, brightness adjustment
  • Class-weighted loss function to address label imbalance
  • Learning rate scheduling with early stopping for optimal convergence

Key Results

84.8% Test accuracy
0.7961 Kaggle AUC score
220K+ Training images processed
Python TensorFlow CNN Computer Vision Transfer Learning

NLP Disaster Tweets

View Code

Overview

Binary text classification challenge distinguishing real disaster tweets from non-disaster tweets. Explored multiple NLP approaches from traditional TF-IDF baselines to deep learning architectures including bidirectional GRU and LSTM networks with pre-trained GloVe embeddings.

Methodology

  • Text preprocessing: lowercasing, URL/mention removal, lemmatization, stopword filtering
  • TF-IDF + Logistic Regression baseline for comparison
  • GloVe 100d embeddings with BiGRU and BiLSTM architectures
  • Spatial dropout and recurrent dropout for regularization
  • Threshold tuning on validation set for optimal F1 score

Key Results

Rank 377 Kaggle leaderboard
BiGRU Best performing architecture
GloVe Pre-trained embeddings
Python TensorFlow NLP GloVe BiGRU BiLSTM

Monet Style Transfer

View Code

Overview

Photo-to-painting transformation project comparing three generative approaches: DCGAN, CycleGAN, and Neural Style Transfer. Trained models to produce Monet-style artwork from photographs, exploring the strengths and limitations of each architecture for artistic style transfer.

Methodology

  • DCGAN baseline: generator/discriminator pair for unconditional Monet generation
  • CycleGAN: unpaired image-to-image translation with cycle consistency loss
  • Neural Style Transfer: VGG19 feature matching for content/style separation
  • Instance normalization and spectral normalization for training stability
  • MiFID (Memorization-informed Fréchet Inception Distance) for evaluation

Key Results

Rank 20 Kaggle leaderboard
53.14 MiFID score
3 models Architectures compared
Python TensorFlow GANs CycleGAN Computer Vision Neural Style Transfer

Rotten Tomatoes Predictor

View Code

Overview

Predicting Rotten Tomatoes critic scores from raw screenplay text using a fusion architecture that combines BERT transformer embeddings with handcrafted numeric features. The model learns semantic patterns in movie review language and maps them to continuous score predictions.

Methodology

  • BERT-base tokenization and fine-tuning for domain-specific text understanding
  • Fusion architecture: BERT [CLS] embeddings concatenated with numeric features
  • Multi-layer regression head with dropout and batch normalization
  • Cosine annealing learning rate schedule with warmup
  • Evaluation: MAE, RMSE, and R² on held-out test set

Key Results

BERT Transformer backbone
Fusion Text + numeric architecture
PyTorch Training framework
Python PyTorch BERT Transformers NLP HuggingFace

Maevie Project Manager

View Code

Overview

Full-stack project management application built for an interior design company. Features task tracking with drag-and-drop Kanban boards, client management, project timelines, and team coordination. Designed for small teams managing multiple concurrent design projects.

Methodology

  • React frontend with component-based architecture and React Router
  • Node.js/Express REST API with JWT authentication
  • PostgreSQL database with normalized schema for projects, tasks, and users
  • Role-based access control for admin, designer, and client views
  • Responsive design optimized for both desktop and tablet use

Key Results

Full-Stack End-to-end application
REST API JWT-authenticated endpoints
RBAC Role-based access control
React Node.js Express PostgreSQL JWT REST API