Site Reliability Engineering (SRE) Team – IKP

Site Reliability Engineering (SRE) Team – IKP

Posted Today by GIOS Technology

Negotiable
Undetermined
Onsite
Sheffield, England, United Kingdom

Summary: The bank is seeking a Site Reliability Engineering (SRE) Team – IKP to ensure the reliability, scalability, and security of the Identity and Key Management Platform within the OpenShift infrastructure migration project. This role emphasizes operational excellence, automation, and continuous improvement to meet stringent security and compliance requirements. The position requires a strong focus on maintaining high availability and resilience of services while integrating security practices. The role is based in Sheffield, UK, with an on-site requirement of three days per week.

Key Responsibilities:

  • Maintain high availability and resilience of IKP services across multi-cloud and on-prem environments.
  • Implement monitoring, alerting, and incident response for IKP components.
  • Develop automation for IKP deployment, scaling, and lifecycle management.
  • Integrate IKP processes into CI/CD pipelines for secure and efficient operations.
  • Ensure IKP configurations comply with the bank’s security standards and regulatory frameworks.
  • Manage encryption key lifecycle operations (generation, rotation, revocation).
  • Respond to and resolve IKP-related incidents promptly.
  • Conduct root cause analysis and implement preventive measures.
  • Work closely with IKP SMEs, Build Engineers, OpenShift teams, and ITSO for secure integration.
  • Support audits and compliance reviews.
  • Maintain operational runbooks and detailed documentation for IKP services.
  • Adhere to THE BANK’s IT governance and change management processes.

Key Skills:

  • Strong experience in Site Reliability Engineering for identity and key management systems.
  • Expertise in automation tools (Ansible, Terraform) and scripting (Python, Bash).
  • Familiarity with OpenShift, Kubernetes, and container security best practices.
  • Knowledge of cryptographic principles, PKI, and encryption standards.
  • Proficiency in monitoring tools (Prometheus, Grafana) and incident management frameworks.
  • Certifications such as CISSP, CISM, or cloud security certifications preferred.

Salary (Rate): undetermined

City: Sheffield

Country: United Kingdom

Working Arrangements: on-site

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

We are looking for Site Reliability Engineering (SRE) Team – IKP at Sheffield, UK - 3 days per week

Role Overview: The bank is seeking a Site Reliability Engineering (SRE) Team – IKP to ensure the reliability, scalability, and security of the Identity and Key Management Platform (IKP) within the OpenShift infrastructure migration project. This team will focus on operational excellence, automation, and continuous improvement of IKP services to meet the bank’s stringent security and compliance requirements.

Key Responsibilities:

  • Reliability & Performance: Maintain high availability and resilience of IKP services across multi-cloud and on-prem environments. Implement monitoring, alerting, and incident response for IKP components.
  • Automation & Efficiency: Develop automation for IKP deployment, scaling, and lifecycle management. Integrate IKP processes into CI/CD pipelines for secure and efficient operations.
  • Security & Compliance: Ensure IKP configurations comply with the bank’s security standards and regulatory frameworks. Manage encryption key lifecycle operations (generation, rotation, revocation).
  • Incident Management: Respond to and resolve IKP-related incidents promptly. Conduct root cause analysis and implement preventive measures.
  • Collaboration: Work closely with IKP SMEs, Build Engineers, OpenShift teams, and ITSO for secure integration. Support audits and compliance reviews.
  • Documentation & Governance: Maintain operational runbooks and detailed documentation for IKP services. Adhere to THE BANK’s IT governance and change management processes.

Required Skills & Qualifications:

  • Strong experience in Site Reliability Engineering for identity and key management systems.
  • Expertise in automation tools (Ansible, Terraform) and scripting (Python, Bash).
  • Familiarity with OpenShift, Kubernetes, and container security best practices.
  • Knowledge of cryptographic principles, PKI, and encryption standards.
  • Proficiency in monitoring tools (Prometheus, Grafana) and incident management frameworks.
  • Certifications such as CISSP, CISM, or cloud security certifications preferred.