urgent

Senior MLOps Engineer

Full Time - Remote @ITJobs.lk in AI & Machine Learning , in IT Infrastructure & Cloud
  • Sri Lanka View on Map
  • Post Date : January 29, 2026
  • Salary: Negotiable
Email Job

Job Description

Location: Fully Remote (Sri Lanka Based)

Type: Full-Time | Permanent

Timezone: Flexible (US or IST)

Industry: AI-Powered Ecommerce Infrastructure

 

The Opportunity

Our client, a leading IT firm in Sri Lanka, is seeking a Senior MLOps Engineer to join a high-impact team working —a cutting-edge US-based platform transforming ecommerce.AI-powered infrastructure turns static catalogs into intelligent product data for global brands.

You will own the deployment and optimization of model infrastructure, working directly with state-of-the-art open-source LLMs (QWEN, Llama, Mistral).

 

Key Responsibilities

  • Cost Optimization (Primary Focus): Architect strategies for inference cost reduction through batching, quantization, caching, and smart resource allocation.

  • Fine-Tuning Pipeline Management: Build and scale pipelines for open-source models (QWEN3, Llama, Mistral) using proprietary “golden datasets.”

  • Data Operations: Collaborate on the versioning and expansion of curated datasets; implement rigorous data quality and validation checkpoints.

  • Model Deployment: Maintain production ML pipelines capable of handling millions of SKU classifications with high reliability and low latency.

  • Infrastructure Architecture: Optimize ML environments on AWS (Bedrock, SageMaker, EC2/ECS) with future expansion into Azure; manage GPU clusters for training.

  • Performance Monitoring: Implement tracking for model drift, fine-tuning metrics, and system health to trigger retraining cycles.

  • Technical Leadership: Mentor the engineering team on MLOps best practices and drive critical architectural decisions.

 

Technical Requirements

  • Experience: 5+ years in MLOps, ML Engineering, or Platform Engineering.

  • LLM Expertise: Hands-on experience fine-tuning open-source LLMs (QWEN, Llama, Mistral) for production.

  • Dataset Management: Proven track record in managing training datasets, versioning, and pipeline automation.

  • Cloud (AWS Required): Deep expertise in Bedrock, SageMaker, Lambda, ECS/EKS, S3, and Step Functions.

  • Programming & Frameworks: Strong Python skills and mastery of PyTorch, Hugging Face Transformers, PEFT/LoRA, and DeepSpeed.

  • Optimization Techniques: Experience with LoRA, QLoRA, quantization (GPTQ, AWQ, GGUF), distillation, and VRAM optimization.

  • DevOps Tools: Expertise in Docker and Kubernetes.

  • Communication: Exceptional English communication skills to collaborate effectively with US-based stakeholders.

Nice to Have

  • Experience with Azure ML or Azure OpenAI services.

  • Familiarity with inference frameworks like vLLM, TensorRT-LLM, or TGI.

  • Experience with multi-GPU/multi-node distributed training.

  • Knowledge of ML observability tools (Weights & Biases, MLflow, Langfuse).

 


What’s on Offer

  • Competitive Salary & Equity: Highly attractive compensation package.

  • Global Impact: Work on state-of-the-art AI technology at a massive scale.

  • Remote-First: Work from anywhere in Sri Lanka with flexible hours.

  • Career Growth: High-visibility role with direct influence on the technical roadmap.

 

 

How to Apply

If you are a Senior Engineer ready to lead the next generation of MLOps for global ecommerce, we want to hear from you.

Apply now via ITJobs.lk or send your CV to hello@itjobs.lk with the subject: Senior MLOps Engineer – [Your Name]

Other jobs you may like