Senior MLOps Engineer

urgent

Senior MLOps Engineer

Full Time - Remote @ITJobs.lk posted 6 days ago in AI & Machine Learning , in IT Infrastructure & Cloud

Sri Lanka View on Map
Post Date : January 29, 2026
Salary: Negotiable

Job Description

Location: Fully Remote (Sri Lanka Based)

Type: Full-Time | Permanent

Timezone: Flexible (US or IST)

Industry: AI-Powered Ecommerce Infrastructure

The Opportunity

Our client, a leading IT firm in Sri Lanka, is seeking a Senior MLOps Engineer to join a high-impact team working —a cutting-edge US-based platform transforming ecommerce.AI-powered infrastructure turns static catalogs into intelligent product data for global brands.

You will own the deployment and optimization of model infrastructure, working directly with state-of-the-art open-source LLMs (QWEN, Llama, Mistral).

Key Responsibilities

Cost Optimization (Primary Focus): Architect strategies for inference cost reduction through batching, quantization, caching, and smart resource allocation.
Fine-Tuning Pipeline Management: Build and scale pipelines for open-source models (QWEN3, Llama, Mistral) using proprietary “golden datasets.”
Data Operations: Collaborate on the versioning and expansion of curated datasets; implement rigorous data quality and validation checkpoints.
Model Deployment: Maintain production ML pipelines capable of handling millions of SKU classifications with high reliability and low latency.
Infrastructure Architecture: Optimize ML environments on AWS (Bedrock, SageMaker, EC2/ECS) with future expansion into Azure; manage GPU clusters for training.
Performance Monitoring: Implement tracking for model drift, fine-tuning metrics, and system health to trigger retraining cycles.
Technical Leadership: Mentor the engineering team on MLOps best practices and drive critical architectural decisions.

Technical Requirements

Experience: 5+ years in MLOps, ML Engineering, or Platform Engineering.
LLM Expertise: Hands-on experience fine-tuning open-source LLMs (QWEN, Llama, Mistral) for production.
Dataset Management: Proven track record in managing training datasets, versioning, and pipeline automation.
Cloud (AWS Required): Deep expertise in Bedrock, SageMaker, Lambda, ECS/EKS, S3, and Step Functions.
Programming & Frameworks: Strong Python skills and mastery of PyTorch, Hugging Face Transformers, PEFT/LoRA, and DeepSpeed.
Optimization Techniques: Experience with LoRA, QLoRA, quantization (GPTQ, AWQ, GGUF), distillation, and VRAM optimization.
DevOps Tools: Expertise in Docker and Kubernetes.
Communication: Exceptional English communication skills to collaborate effectively with US-based stakeholders.

Nice to Have

Experience with Azure ML or Azure OpenAI services.
Familiarity with inference frameworks like vLLM, TensorRT-LLM, or TGI.
Experience with multi-GPU/multi-node distributed training.
Knowledge of ML observability tools (Weights & Biases, MLflow, Langfuse).

What’s on Offer

Competitive Salary & Equity: Highly attractive compensation package.
Global Impact: Work on state-of-the-art AI technology at a massive scale.
Remote-First: Work from anywhere in Sri Lanka with flexible hours.
Career Growth: High-visibility role with direct influence on the technical roadmap.

How to Apply

If you are a Senior Engineer ready to lead the next generation of MLOps for global ecommerce, we want to hear from you.

Apply now via ITJobs.lk or send your CV to hello@itjobs.lk with the subject: Senior MLOps Engineer – [Your Name]

Other jobs you may like

Full-Stack AI Multimedia Developer Featured
- @ ITJobs.lk
- Sri Lanka
Full Time - Remote