Jobtitel: 75% remote: mk8s Operations Engineer (f/m/d)
Zahlungsintervall: Stündlich
Lohnsatz: Verhandelbar
Ort: Remote & Frankfurt am Main
Job veröffentlicht: 19-03-2026
Job-ID: 70623
Name: Angelika Arghiani
Telefonnummer: +4915119501559
E-Mail: Angelika.Arghiani@nemensis.de

Stellenbeschreibung

For our client we are looking for a mk8s Operations Engineer (f/m/d).

Start: 04.05.2026
Duration: 31.12.2026 + wish for a long-term prolongation
Capacity: 100%
Location: 75% Remote, 25% Frankfurt or Berlin (1 week Frankfurt / 3 weeks remote in rotation), up to 50% onsite in peak times
Language: English is a must, German is a must (both C1)

Role:
Local Operations manages the on-premises production platform, which serves as the primary host for all mission-critical business applications.
Local operations are responsible for the following core areas:
• Platform Stability: Ensuring the high availability and performance of the on-premises private cloud environment.
• Application Hosting: Consulting on the seamless operation of Germany-specific productive business applications.
• Incident Management: Resolving technical issues within standard business hours to minimize operational downtime.
• Lifecycle Maintenance: Executing routine updates, patches, and system optimizations within the local infrastructure

Objectives:
- Consulting for CI/CD pipelines and ensure operational readiness for deployments
- Ensure operational stability and responsiveness for the managed Kubernetes platform
- Reduce operational toil and improve service reliability
- Ensure platform operations adhere to security and compliance standards

Skills (must-have):
- At least of 5 years of operational experience with self-managed Kubernetes clusters, self-managed services providing Kubernetes clusters and productive applications or systems in on premise environments on Kubernetes
- Deep understanding of networking concepts, including protocols, load balancing, and security.
- Profound knowledge and implementation experience with CI/CD processes, tooling (e.g. GitLab, Jenkins, Tekton, Argo Workflows, and Argo CD), concepts and associated quality and security assurance for software delivery
- Fundamental understanding of core operations processes (incident management, change management, problem management, IT Service Management) as well as SRE concepts
- Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management and tracking.
- Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks.
- Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, Mimir, Loki)

Bewerben mit indeed
Dateitypen (doc, docx, pdf, rtf) mit einer Größe von bis zu 10 MB
Dateitypen (doc, docx, pdf, rtf) mit einer Größe von bis zu 10 MB