- Senior
- Office in Courbevoie
Cloud & IA Architect F/M
Systèmes d'Information
A pioneer in the field of public-transport development, Keolis is the preferred partner of public decision-makers who wish to harness shared mobility as a catalyst for their territory’s attractiveness and vitality.
As the world’s leading operator of automated metro and tramway networks, Keolis pursues an ambitious, open-innovation policy with all of its subsidiaries and partners — Kisio, EFFIA, Keolis Santé and Cykleo — to strengthen its core business while creating new, tailor-made shared-mobility solutions: regional and intercity trains, buses and coaches, trolleybuses, pooled ride-hailing services, river and maritime shuttles, self-service bicycles, car-sharing, 100 % electric autonomous shuttles, urban cable cars, and more.
In France, Keolis ranks second in the parking sector through its subsidiary EFFIA and has been the market leader in medical transport since the establishment of Keolis Santé in July 2017.
Seventy per cent owned by SNCF and thirty per cent by the Caisse de dépôt et placement du Québec (CDPQ), the Group employs 68,500 people in 16 countries and generated revenue of €6.1 billion in 2020. In 2019, 3.4 billion passengers used a shared-mobility service operated by Keolis.
Working at Keolis first and foremost means living rich and varied experiences. Our 68,500 employees bring meaning to their work, build unique career paths and grow through the encounters they make every day.
With more than 4,000 employees, the Île-de-France division oversees and manages nearly 30 Keolis subsidiaries within the Greater Paris region. Supported by a dedicated Digital & Information Systems Directorate (DDSI), we deliver enterprise IT, line-of-business applications, digital services and innovation that benefit travellers throughout the Île-de-France.
From Tram-Train and autonomous shuttles to buses, coaches, demand-responsive transport and services for passengers with reduced mobility, Keolis Île-de-France operates public-transport solutions on behalf of Île-de-France Mobilités (formerly STIF) as well as major private clients (Aéroports de Paris, hotels, corporations, etc.).
As part of our ongoing growth, the DDSI is seeking to recruit a Cloud Architect & SRE (F/M).
�� Your Mission
Attached to the Technical Direction team under Cloud Operation, as Cloud & IA Architect your principal remit will be to design, build-out and ensure the operational readiness of cloud infrastructures and AI platforms that underpin Keolis’ information system. You will be the bridge between Cloud Engineering, Data Science / ML teams and our 24 × 7 Production Operations team. Your goal: raise the availability, performance, scalability and reliability of all passenger‑facing and back‑office services while enabling safe, efficient, and fast delivery of AI/ML capabilities.
You will contribute to corporate projects by leveraging a strong grasp of business imperatives and the technical challenges posed by Big Data, Internet of Things and Artificial Intelligence. Your task will be to manage the roadmap towards an urbanised, production‑grade architecture for data and models while simultaneously addressing emerging requirements from the business and our customers, with the ultimate goal of industrialising, operationalising and monetising the data and AI assets generated by the IS across all operational processes.
To that end, you will provide deep technical expertise in cloud and AI technologies, operating principles, model lifecycle management (MLOps) and responsible AI practices, while keeping a sharp focus on business value and the exploitation of meaningful indicators. You will organise and manage the projects entrusted to you within this perimeter—under the CIO’s authority and in constant coordination with IT Production, Transformation, Innovation, Data, and Security teams—in accordance with the IS‑transformation roadmap.
Your sphere of collaboration will encompass subsidiary, regional and Group‑level teams, as well as Keolis’s external partners and selected AI vendors.
You will:
- Define operational objectives and ownership for critical transport applications and AI/ML services (model latency, throughput, prediction accuracy, data freshness, drift detection).
- Build and maintain automated runbooks, self‑healing workflows and observability dashboards for both infra and model performance in Azure.
- Drive reliability and governance features directly into CI/CD and MLOps pipelines (Azure DevOps, Azure ML, MLflow) used by >40 product and data teams.
- Define and run post‑incident reviews and blameless retrospectives for infra, data and model incidents; turn insights into prioritized remediation and prevention plans.
- Champion best practices for Infrastructure‑as‑Code (Terraform), model deployment strategies (canary, blue/green, shadow), feature‑flagging and rollout governance.
- Coach Data Engineers and Operations staff to transition repetitive tasks into code, automate model retraining and deployment, and reduce manual toil.
- Partner with SecOps, Data Governance and Legal to embed guardrails for privacy, security, explainability and compliance (GDPR, model risk).
- Contribute to cost‑optimisation and capacity planning across PaaS, IaaS, managed AI services and data platforms (Azure ML, Synapse, Databricks, vWAN).
- Promote responsible AI: implement monitoring for bias, drift, data lineage, model explainability and lifecycle documentation.
- Evaluate and integrate generative AI and LLM capabilities when appropriate (OpenAI / Azure OpenAI, embeddings, retrieval‑augmented generation) ensuring safety and business alignment.
�� Our Tech Landscape
- Azure AD, VNets, vWAN, App Service, AKS, Functions, SQL MI, Event Hub, IoT Hub
- Azure ML, Azure OpenAI / Cognitive Services, Azure Synapse, Databricks, Data Lake Storage
- Hybrid Windows & Linux workloads, O365, Defender, Intune
- Azure DevOps Repos, Pipelines, Artifacts, Boards; MLflow, Azure Pipelines for MLOps
- HashiCorp Terraform & Packer, Bicep, Ansible
- Prometheus, Grafana, Azure Monitor, Log Analytics, Kusto (KQL)
- Python (pandas, scikit‑learn, PyTorch, TensorFlow), PowerShell, Go (nice‑to‑have)
- Model governance tools, feature stores, monitoring for data/model drift
�� What You’ll Need
Must‑Have Skills
- 4+ years in Cloud/Platform architecture, MLOps or AI engineering on Microsoft Azure
- Demonstrated experience operationalising ML models at scale and understanding of model lifecycle (training, validation, deployment, monitoring, retraining)
- Deep understanding of distributed systems, networking and data platforms
- Hands‑on experience building CI/CD and MLOps pipelines (YAML) in Azure DevOps / Azure ML
- Proven knowledge of monitoring/alerting, incident management and post‑mortems for infra and models
- Infrastructure‑as‑Code in Terraform or Bicep
- Familiarity with responsible AI practices: explainability, bias detection, data lineage and governance
- Fluent in English; French a plus
Nice‑to‑Have
- Exposure to public transport or industry OT environments
- Experience with Databricks, Synapse Analytics, feature stores, or vector databases
- Experience with generative AI, LLMs, embeddings and retrieval‑augmented generation (RAG)
- Kubernetes/MLOps certifications (CKA/CKS), or ML engineer certifications
- FinOps or cloud cost‑management experience
What We Offer
- Impact: Your work helps keep trams, buses and metros running for 1M+ citizens and enables smarter, safer services.
- Hybrid working model & flexible hours (2–3 days remote).
- 32 holidays, pension plan, annual mobility pass, bike leasing.
- Individual training budget & Microsoft certification vouchers.
- A diverse, inclusive culture with regular tech talks, hack days and Communities of Practice.
Apply Now