Machine Learning Engineer (LLMOps)
Own LLM training/fine‑tuning, retrieval pipelines, evaluation, and productionization.
All positions

Job Details

Machine Learning Engineer (LLMOps)

Remote • Full‑time

Own LLM training/fine‑tuning, retrieval pipelines, evaluation, and productionization.

You will collaborate with product engineers to deliver robust AI systems with clear SLAs.

Responsibilities

  • Build data pipelines, labeling workflows and evaluation suites
  • Fine‑tune models (SFT/LoRA), manage embeddings and retrieval
  • Productionize inference (vLLM/TensorRT‑LLM) and monitor drift
  • Implement safety/guardrails and cost/perf optimizations
  • Document playbooks and incident response procedures

Requirements

  • 3+ years in ML/Applied AI with production systems
  • Strong Python; experience with PyTorch/JAX and vector stores
  • MLOps/LLMOps tooling (Weights & Biases, Ray, Kubeflow, Airflow)

Nice to Have

  • Experience with RAG benchmarks and human evaluation
  • Kubernetes, GPUs and observability (Prometheus/Grafana)
  • Security/privacy for AI systems (PII handling)

What We Offer

  • Remote work and flexible PTO
  • Compute/GPU budget
  • Conference travel support

Hiring Process

  1. Intro call
  2. ML systems interview
  3. Practical exercise
  4. Offer

Compensation

Competitive salary + bonus; relocation support if desired