Overview
Expleo is a trusted partner for your innovation journey. As a global engineering, technology and consulting service provider, we are ideally positioned to help you achieve your ambitions and future-proof your business. With a smart blend of bold thinking and reliable execution, we’re able to fast-track innovation through each step of your value chain.
We are strategically positioned to build value, with a global footprint across 30 countries.We are as global and local as you need us to be, with strong best-in-class pan-European technological centres and unique best-shoring capabilities.
We leverage a network of high value-adding affiliates in consulting and industrial excellence, and leading partners across multiple sectors to provide you with the most comprehensive services and solutions in an ever-changing environment.
Responsibilities
- Design, implement, and maintain Kubernetes-based platforms and DevOps infrastructure.
- Support and operate secure DevSecOps environments and CI/CD pipelines.
- Deploy and manage containerized workloads and platform components.
- Implement infrastructure as code and GitOps practices for platform provisioning and configuration.
- Configure and maintain observability solutions to monitor platform performance and reliability.
- Ensure platform and infrastructure security through container security tools, secrets management, and network policies.
- Manage supporting platform services such as storage, databases, and ML platform components.
- Collaborate with cross-functional engineering teams to support platform and application delivery.
- Produce and maintain technical documentation and communicate effectively with internal teams and customers.
- Identify issues, troubleshoot platform problems, and proactively improve reliability and security.
Essential skills
- 7+ years of experience in DevOps or Platform Engineering.
- 2+ years of experience in DevSecOps.
- Strong expertise in Kubernetes (AKS, upstream, or enterprise Kubernetes).
- Experience with on-premises Kubernetes (RKE2, K3s, or OpenShift).
- Experience with GPU-accelerated workloads using NVIDIA operators, GPU device plugins, or Run:AI.
- Experience with CI/CD tools such as Azure DevOps or GitHub Actions.
- Infrastructure as Code experience using Terraform, Helm, Kustomize, YAML, and GitOps practices.
- Experience with observability tools such as Prometheus and Grafana.
- Expertise in container security tools (e.g., CNI, Trivy, Aqua, Prisma or similar).
- Experience with secrets management tools (e.g., Vault or Azure Key Vault).
- Strong Linux and networking knowledge (TLS, DNS, Ingress, OAuth/OIDC, VNet, Peering, VPN/Jump Host configuration).
- Experience managing MinIO, MLflow, and PostgreSQL (HA and backups).
- Strong scripting skills (Python or Bash; Go is optional).
- Expertise in Keycloak configuration.
- Experience with Kubernetes security practices (CIS hardened Kubernetes, RBAC, NetworkPolicies, PodSecurityAdmission).
- Knowledge of Zero Trust networking principles.
- Experience with SAST, DAST, and SCA security tooling.
- Knowledge of encryption at rest and in transit, certificate management, and firewall configuration.
- Fluency in English.
Desired skills
- Experience supporting ML platforms (Kubeflow, MLflow, KServe).
- Knowledge of storage systems such as Ceph or NetApp.
- Experience working in regulated industries (automotive, aerospace, medical, energy, railway).
What do I need before I apply
-The opportunity is remote but the candidate must already be living in PT.