We are looking for an experienced Data Scientist to build, evaluate, and productionize machine learning models that drive measurable business impact. You will lead end-to-end model development—from problem framing and data preparation to experimentation, deployment, and monitoring—while raising the bar on rigor, reproducibility, and velocity.
THIS IS A FULL-TIME ROLE WITH AVAILABILITY ON PST TIME ZONE (9AM-5PM MON-FRI)
End-to-End Model Development: Translate business objectives into ML problems; scope datasets; build, iterate, and validate models (supervised/unsupervised, time series, deep learning); take models from notebook to production.
Feature Engineering & Data Pipelines: Design reliable datasets and features; implement robust ETL/ELT in SQL/Python (and ideally Spark); partner with data engineering on feature stores, data quality checks, and lineage.
Experimentation & Evaluation: Define metrics and baselines; run structured experiments (CV, holdouts, A/B tests); perform error analysis, bias/fairness checks, and model explainability; document results and tradeoffs.
Model Deployment & Monitoring: Collaborate with engineering to package and ship models (batch, streaming, real-time); instrument monitoring for performance, drift, and business KPIs; establish retraining and rollback strategies.
Machine Learning & Statistics: Strong grasp of classic ML (regression, trees/GBMs, clustering), feature selection, model validation, and statistical testing.
Deep Learning: Hands-on experience with PyTorch or TensorFlow; practical understanding of modern architectures. LLM exposure (prompting, fine-tuning/LoRA, embeddings/RAG) is a plus.
Data Wrangling & SQL: Proficiency in Python (pandas, NumPy) and SQL; experience working with large datasets. Spark or similar distributed frameworks is a plus.
Experiment Tracking & Reproducibility: Comfortable with MLflow/Weights & Biases/DVC; versioning of code, data, and models; reproducible environments (conda/poetry).
Model Serving & APIs: Experience packaging models and exposing inference via services (e.g., FastAPI/Flask) and integrating with upstream/downstream systems.
Programming & Version Control: Strong Python fundamentals, testing, and Git workflows; ability to write clean, production-ready code.
Demonstrated impact shipping models to production with measurable business outcomes and post-deployment monitoring.
Experience collaborating with product/engineering to define problem statements, success metrics, and experiment designs; excellent communication of tradeoffs to non-technical stakeholders.
Ability to operate autonomously in a remote setting, manage multiple workstreams, and communicate clearly across time zones.
Our simple 3-step process:
Answer a few questions about your experience.
Upload your CV.
Record a brief video introduction (up to 2 minutes).
***All answers must be in English.***
FAQs:
What’s the next step? If successful, you’ll have one more interview before receiving a job offer if you're a good fit.
What happens after submission? We’ll review your application within 2–3 business days and contact you if you qualify for the next stage.
When would I start? As soon as possible, with flexibility to accommodate your circumstances.
How long does it take? About 5-10 minutes to complete.
We look forward to reviewing your application!