Applied AI • ML • NLP • Analytics

Simarpreet Kaur

Building reliable ML systems and data products.

I work across the full pipeline: messy data → clean features → models → evaluation → deployable outputs. Interested in NLP, predictive modeling, and practical analytics that ship.

Data Science • ML Engineering

Building practical ML systems end-to-end — from data pipelines to evaluation and deployable outputs.

Python SQL PyTorch scikit-learn NLP

Focus

NLP • ML • Data Products

Stack

Python • SQL • PyTorch

Strength

Clean pipelines, strong evaluation, shipping-ready outputs

About

I’m a Computer Science graduate from McMaster University with a strong focus on applied data science and machine learning, particularly in building models and analyses that translate into real-world impact. My work emphasizes the full data science lifecycle from data cleaning and exploratory analysis to modeling, evaluation, and clear communication of results.

I bring a detail-oriented and analytical approach, with experience working under real-world constraints such as data quality issues, validation, and model reliability. I value reproducibility, thoughtful evaluation, and practical deployment over purely theoretical results.

In addition to my technical background, I offer 6+ years of customer-facing experience, which has shaped my communication, collaboration, and stakeholder-focused mindset. I’m comfortable explaining complex findings to non-technical audiences, working cross-functionally, and building solutions that are both technically sound and user-centered.

Experience

Data Science Prompt Engineer

Outlier

Oct 2025 – Present

  • Designed and refined prompts for end-to-end data science workflows: EDA, feature engineering, and modeling.
  • Produced gold-standard solutions in Python (Pandas / scikit-learn), validating correctness and edge cases.
  • Performed error analysis on model outputs to improve numerical accuracy and analytical reliability.

Selected Projects

Certifications

Technical Arsenal

ML / AI

sklearn scikit-learn
ml ML Modeling & Evaluation
📈 F1, ROC-AUC, Cross-Validation
🧠 NLP / Embeddings

Languages & Query

python Python
sql SQL
🐍 Scripting & Data Pipelines
🔎 Data Querying & Analysis

Data

pandas Pandas
numpy NumPy
jupyter Jupyter

Viz

matplotlib Matplotlib
seaborn Seaborn
📊 Tableau / BI

Get in Touch

Open to full-time roles. Reach out anytime.

© 2025 Simarpreet Kaur