Sehej Singh

Data Science & Machine Learning Engineer

Explore My Work

About Me

Sehej Singh

Sehej Singh

Master's in Data Science
University of San Francisco

Background & Education

Born and raised in Sacramento, California, I discovered my passion for data science and machine learning during my undergraduate studies at Loyola Marymount University. I recently graduated with my Master's in Data Science from USF, where I dove deep into advanced ML techniques, statistical modeling, and AI applications.

Professional Journey

My experience spans from developing RAG-based AI chatbots at ArsLab.AI to building mortgage analytics models at Buyer Folio, and creating cryptocurrency market prediction systems at 7VFI.AI. I specialize in turning complex data into actionable insights and building scalable ML solutions.

Beyond Data Science

When I'm not coding or analyzing data, you'll find me on the basketball court, at the gym, playing chess, or dancing Bhangra. I'm also a food enthusiast always exploring new cuisines and flavors!

Python
Machine Learning
Deep Learning
NLP
Computer Vision
PyTorch
LangChain
Docker
Kubernetes
GCP
AWS
Apache Spark
FastAPI
SQL

Featured Projects

FitAI Workout Buddy

End-to-end fitness application leveraging computer vision for real-time workout analysis and form correction. Built with React frontend, FastAPI backend, and containerized deployment.

Computer Vision React FastAPI Docker Kubernetes

RAG Chatbot for MSDS

AI chatbot for USF's MSDS program using Retrieval Augmented Generation. Achieved 90%+ retrieval precision and reduced administrative response time by 15 hours weekly.

RAG LangChain LlamaIndex NLP GCP

Fake News ML Pipeline

End-to-end fake news classification system with Apache Airflow orchestration, GCP infrastructure, and distributed processing using Apache Spark.

Apache Airflow Apache Spark GCP MongoDB ML Pipeline

Lo-Fi Mood Generator 🎵

4th Place at Lo-Fi Hack Hackathon (SF) - A local-first AI-powered website that generates mood-based images and AI-generated music to create the perfect ambiance. Built in 24 hours using React, Stable Diffusion API, and cosine similarity for music matching.

React Vite Stable Diffusion AI Music Selection Cosine Similarity ChromaDB

NBA Win Predictions

Machine learning system for predicting NBA game outcomes using advanced statistical modeling, player performance metrics, and team analytics to forecast win probabilities.

Sports Analytics ML Prediction Statistical Modeling Data Analysis Python

Experience & Education

Data Scientist

ArsLab.AI

Jan 2025 - June 2025

Developing AI chatbot for USF's MSDS program using RAG. Improved retrieval precision from 60% to 90%+ and reduced response time by 15 hours weekly. Containerizing with Docker and deploying on GCP.

Data Scientist Intern

Buyer Folio

Oct 2024 - Jan 2025

Built ML models for credit score prediction and mortgage prequalification, improving accuracy by 15%. Engineered synthetic data pipelines and reduced inference time by 20%.

Master of Science, Data Science

University of San Francisco

July 2024 - June 2025

Advanced coursework in machine learning, statistical modeling, and AI applications. Focus on real-world data science projects and industry applications.

Data Scientist

7VFI.AI

May 2023 - Mar 2024

Developed cryptocurrency market analysis tools and ML forecasting models with 85% accuracy. Created web scraping systems for 50+ networks, resulting in $60K+ portfolio returns.

Bachelor of Science, ISBA

Loyola Marymount University

Aug 2019 - May 2023

Information Systems and Business Analytics degree with focus on data analysis, machine learning fundamentals, and business intelligence.

Let's Connect

Ready to collaborate on exciting data science projects or discuss opportunities? Let's build something amazing together!