Jobtitel: 75% remote: Storage Operations Expert (f/m/d)
Zahlungsintervall: Stündlich
Lohnsatz: Verhandelbar
Ort: Frankfurt am Main, Remote
Job veröffentlicht: 12-06-2026
Job-ID: 76695
Name: Niklas Machens
Telefonnummer: +4915119501867
E-Mail: niklas.machens@nemensis.de

Stellenbeschreibung

For our client we are looking for a Storage Operations Expert (f/m/d).
 
Start: 01.07.2026
Duration: 31.12.2026++
Capacity: 100%
Location: 75% Remote, 25% Frankfurt (occasionally, sometimes Berlin)
1 week Frankfurt / 3 weeks remote in rotation, up to 50% onsite in peak times
Language: English is a must (C1), German is a must (C1)
 
Team:
The local operations team for Germany is responsible for running a production platform in Germany which will host all productive business applications for Germany.
 
Tasks:
- Provide Tier-3 operational ownership for Storage Products for Local Production (DE)
- Ensure operational readiness for deployments
- Ensure operational stability and responsiveness for the managed Kubernetes platform
- Reduce operational toil and improve service reliability
- Ensure platform operations adhere to security and compliance standards
 
Skills (must-have):
- 5+ years in IT storage operations / service delivery / platform operations with demonstrated leadership in missioncritical environments.
- Proven experience implementing/leading Incident, Problem, Change, Release governance in production.
- Experience supporting platform workloads that rely on shared storage services.
- Storage types: File Storage, Block Storage, Object Storage from Netapp (Ontap)
- Protocols/services: NFS; object storage operations (S3-like concepts).
- Kubernetes storage integration: CSI driver concepts and troubleshooting (PV/PVC lifecycle understanding).
- Virtualization (Storage): Experience operating storage virtualization in enterprise environments.
- ITSM / Collaboration: Jira Service Management (JSM), Jira, Confluence.
- Fundamental understanding of core operations processes (incident management, change management, problem management, IT Service Management) as well as SRE concepts
- Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management and tracking.
- Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks.
- Observability Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, Mimir, Loki).
- Familiarity with enterprise DevOps toolchains is a plus (GitLab, JFrog Artifactory, Backstage, Harness).
- understanding of modern platform operations (Kubernetes/containers, automation, observability), sufficient to govern specialists.
- Platform delivery concepts: GitOps and IaC awareness (Terraform/OpenTofu, ArgoCD, Helm) to govern deployment/readiness standards
 
Skills (should-have):
- Experience operating in regulated / high-availability industries (banking, telco, public sector, healthcare).
- Experience with SRE practices (SLOs/SLIs, error budgets) and reliability management.
- Experience operating storage services that integrate with Kubernetes platforms.
- Familiarity with IaC-based provisioning and GitOps-driven operational patterns.