Company
Mistral AILocation
LondonCompany Size
51-200 employeesSalary
Competitive salary dependent on experienceAbout the job
Mistral AI is seeking experienced Site Reliability Engineers (SREs) to shape the reliability, scalability, and performance of its AI platform and customer-facing applications. In this hybrid role based in Paris or London, you will work closely with software engineers and research teams to ensure systems meet internal and external expectations. Responsibilities include designing and maintaining highly available infrastructures for web services and machine learning workloads, managing HPC clusters, operating and troubleshooting production systems, and implementing monitoring, alerting, and incident response mechanisms. You will also drive infrastructure automation and orchestration using tools like Kubernetes, Terraform, and Flux; collaborate with AI/ML researchers on safe and reproducible model training; and contribute to building a cloud-agnostic platform. Proficiency in Python, Go, or Bash, and experience with CI/CD, containerization, observability tools (Prometheus, Grafana, ELK Stack), and cloud computing are essential. Candidates should have a master’s degree in computer science or a related field and over 7 years of relevant experience. Mistral values individuals who are self-motivated, collaborative, and thrive in fast-paced environments. Experience with AI/ML infrastructure and HPC systems is a plus. Benefits include competitive salary and equity, food and transportation support, health insurance, generous parental leave, and visa sponsorship. Join Mistral AI to be part of a pioneering company transforming the future of AI.
Apply For this Job