728 x 90



Site Reliability Engineer

London

Competitive salary dependent on experience

Posted 2 weeks ago
  • Company

    Mistral AI
  • Location

    London
  • Company Size

    51-200 employees
  • Salary

    Competitive salary dependent on experience

About the job

Mistral AI is seeking experienced Site Reliability Engineers (SREs) to shape the reliability, scalability, and performance of its AI platform and customer-facing applications. In this hybrid role based in Paris or London, you will work closely with software engineers and research teams to ensure systems meet internal and external expectations. Responsibilities include designing and maintaining highly available infrastructures for web services and machine learning workloads, managing HPC clusters, operating and troubleshooting production systems, and implementing monitoring, alerting, and incident response mechanisms. You will also drive infrastructure automation and orchestration using tools like Kubernetes, Terraform, and Flux; collaborate with AI/ML researchers on safe and reproducible model training; and contribute to building a cloud-agnostic platform. Proficiency in Python, Go, or Bash, and experience with CI/CD, containerization, observability tools (Prometheus, Grafana, ELK Stack), and cloud computing are essential. Candidates should have a master’s degree in computer science or a related field and over 7 years of relevant experience. Mistral values individuals who are self-motivated, collaborative, and thrive in fast-paced environments. Experience with AI/ML infrastructure and HPC systems is a plus. Benefits include competitive salary and equity, food and transportation support, health insurance, generous parental leave, and visa sponsorship. Join Mistral AI to be part of a pioneering company transforming the future of AI.


Apply For this Job