Harshmanpreet Singh
Harshmanpreet Singh

Harshmanpreet Singh

Data Engineer & Software Developer

Edmonton, AB

Explore My Work

About Me

Bachelor of Science in Computing Science (Class of 2026) at the University of Alberta, with experience in data engineering at Disney Streaming and research in machine learning for industrial simulation and pipelines. I build end-to-end data systems, video delivery stacks, and intelligent document/RAG tools—focusing on reliability, clear interfaces, and measurable performance.

Education

University of Alberta

Bachelor of Science — Computing Science

Class of 2026 | GPA: 3.7/4.0

Edmonton, AB

Cumulative GPA: 3.7/4.0. Certifications: AWS Certified Solutions Architect – Associate; Google Cloud Associate Cloud Engineer.

Work Experience

Research Assistant

Soft Matter & Interfaces Research Group — University of Alberta

Edmonton, AB

May 2025Sept 2025

Applied machine learning to chemical-engineering and industrial pipeline problems, from exploratory analysis through validated surrogate models and technical reporting.

Key Achievements

  • Collaborated with post-doctoral researchers to design ML surrogate models that reduced computational time for complex CFD simulations.
  • Led work with proprietary Imperial Oil operational data to develop predictive ANN models for slurry waste pipeline pressure drop under varying flow and material conditions.
  • Built end-to-end data pipelines: cleaning, feature engineering from sensor and experimental data, model training/validation, and hyperparameter tuning for stable predictions.
  • Presented results in technical reports and visualizations, informing research decisions and discussions on industrial deployment of ML surrogates.
  • Performed exploratory data analysis on high-dimensional sensor data to support feature selection for downstream models.
  • Evaluated generalization on unseen conditions using cross-validation and held-out test sets to support industrial decision-making.

Data Engineer

Disney Streaming

New York, NY

Sept 2024May 2025

Owned cross-system data migration delivery with engineering, product, and analytics partners—focusing on integrity, performance, and safe cutover.

Key Achievements

  • Led planning, execution, and validation of cross-system data migrations with product, infrastructure, and analytics teams to minimize downtime and defects.
  • Built automated ETL validation suites (schema checks, row-level reconciliations, data-quality alerts) for consistency across legacy and target platforms.
  • Optimized batch and streaming pipelines after migration (partitioning, indexing, concurrency), improving end-to-end performance by ~30% and stabilizing SLAs.
  • Authored migration runbooks, technical documentation, and architecture diagrams to accelerate onboarding and de-risk later migration waves.
  • Partnered with upstream producers and downstream consumers to define and validate schema contracts, reducing breaking changes at migration cutover.

Projects

StreamVault (Mini OTT video streaming platform)

ST

StreamVault (Mini OTT video streaming platform)

2026

End-to-end OTT stack: FFmpeg adaptive HLS with AES-128, FastAPI manifest and DRM key flow, PostgreSQL, S3/CloudFront-style delivery, and a React + Shaka Player client.

FastAPIFFmpegHLSPostgreSQLAWS S3ReactTypeScriptShaka Player
  • Built a full OTT pipeline with FFmpeg: three adaptive HLS renditions (1080p/720p/480p, 6s segments), AES-128 encryption, and a FastAPI key-delivery path modeled on Widevine/FairPlay-style DRM flows.
  • Implemented manifest and content APIs (FastAPI, asyncpg, PostgreSQL) with signed CloudFront URLs for segments, mirroring production CDN usage (e.g. Crave, Disney+).
  • Shipped a React + TypeScript UI with Shaka Player: ABR playback, custom DRM key URI handling, and an S3-backed segment proxy—ingest through browser playback.
DocSage (Intelligent document processing & analytics)

DO

DocSage (Intelligent document processing & analytics)

2026

Local zero-cost “insight agent” over an IDP pipeline: PDF ingest, OCR, structured extraction, RAG, and tool-using LLM for SQL and analytics—FastAPI + Streamlit, Docker, PostgreSQL.

PythonFastAPIPostgreSQLFAISSOllamaDockerStreamlitRAG
  • Ingests PDFs, runs OCR, extracts structured fields, and persists to a relational database.
  • RAG with sentence-transformer embeddings and FAISS to ground answers in document text.
  • Tool-using LLM (SQL, metrics, RAG) for questions such as invoice aging and month-over-month trends from real DB queries.
  • Exposed via FastAPI and Streamlit; docker-compose with PostgreSQL to mirror Textract + RDS + OpenSearch + Bedrock style architecture at no cloud cost.
Whimsy (Social media application)

WH

Whimsy (Social media application)

2025

CMPUT 301 team project: short-form “moods” social app on Firebase with Android (Kotlin/Java), plus AI-driven imagery and responses.

FirebaseJavaKotlinXMLAndroidDALL·E
  • Full-stack social platform for moods with Firebase auth, storage, and sign-in/sign-up flows.
  • Social features: follow graph, real-time feeds, tagging, profile editing, and follow recommendations.
  • DALL·E image generation and automatic mood explanations via the `#generate-response` hashtag.
Retire Ready Inc — Marketing site

RE

Retire Ready Inc — Marketing site

2026

Production marketing site for a financial consulting and retirement planning business.

Next.js 16TypeScriptTailwind CSS v4Framer MotionVercel
  • Multi-page App Router, services, and contact experience
FORMULA 1 race outcome predictor & analytics dashboard

FO

FORMULA 1 race outcome predictor & analytics dashboard

2025

Predictive modeling on Grand Prix data with a Streamlit dashboard (not listed on the one-page resume PDF; kept as a portfolio build).

PythonPandasNumPyLightGBMscikit-learnSHAPFastF1Streamlit
  • Feature engineering and model interpretability (e.g. SHAP)
  • Streamlit UI for exploring predictions and team/driver views
ABCMS — Automotive body control module

AB

ABCMS — Automotive body control module

2026

Embedded C11 BCM-style module: CAN, FSMs, fault handling, and tests.

C11CMakeCANCppUTestSIL
  • Message-driven design and static memory constraints
DEAL-R — Event-sourced poker engine

DE

DEAL-R — Event-sourced poker engine

2026

Event-sourced Hold'em with deterministic replay, FastAPI, and PostgreSQL.

PythonFastAPIPostgreSQLWebSocketsPydantic
  • Pure reducer engine; idempotent server commands
Beyond the Apex — F1 analytics dashboard

BE

Beyond the Apex — F1 analytics dashboard

2025

F1 analytics and predictions with React, Vite, and Recharts.

ReactTypeScriptViteTailwindRechartsReact Router
  • Dashboards, exports, and live-data client stub
Alpha Centauri — Virtual try-on (CPU real-time)

AL

Alpha Centauri — Virtual try-on (CPU real-time)

2026

Webcam virtual try-on: pose, Kalman, TPS/affine warp, lighting, compositing.

PythonOpenCVMediaPipeNumPy
  • Modular pipeline and evaluation hooks
Lossless Legends — Virtual try-on (group)

LO

Lossless Legends — Virtual try-on (group)

2026

CMPUT 414 group project: real-time try-on and evaluation docs.

PythonOpenCVMediaPipePyTorch
  • Config-driven warping, optional GPU path
Seven Senses — Retreat website

SE

Seven Senses — Retreat website

2026

Marketing site: Next.js 16, Tailwind v4, Framer Motion.

Next.js 16TypeScriptTailwind CSS v4Framer Motion
  • Responsive marketing sections and motion

Skills

Frontend

HTML5/5
Shaka Player4/5
Tailwind CSS4/5
TypeScript / JavaScript (React)5/5

Backend

C / C++ / C114/5
Celery3/5
Django4/5
FastAPI5/5
FFmpeg (HLS / streaming)4/5
Firebase4/5
Go3/5
Java / Kotlin (Android)4/5
Python5/5

Database

MongoDB3/5
PostgreSQL / SQL5/5
SQLite4/5

Cloud

AWS5/5
Docker5/5
HLS / DASH & AES-128 DRM4/5
Kubernetes3/5
OpenStack / Ceph3/5

Tools

Cursor4/5
Git5/5
Jira4/5
Linux (kernel & userspace)4/5
SocketCAN / QEMU-KVM3/5

Other

LangChain / LangGraph3/5
ML: scikit-learn, LightGBM, ANN/CNN4/5
Retrieval-Augmented Generation (RAG)4/5
Streamlit4/5

Certifications

AWS Certified Solutions Architect - Associate

AWS

Google Cloud Associate Cloud Engineer

Google Cloud

Get In Touch

I'm always open to discussing new projects, creative ideas, or opportunities to be part of your visions.