Design, implement, and maintain highly reliable and scalable infrastructure and services using cloud platforms (e.g. GCP).
Automate repetitive tasks using tools such as Terraform, Ansible and SaltStack.
Collaborate with development and operations teams to ensure smooth deployment and operation of services using CI/CD pipelines (e.g. Gitlab).
Establish and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure system reliability using monitoring tools like Prometheus and Grafana.
Perform capacity planning and optimization to handle growth and scale.
Lead incident management and post-mortem processes to ensure continuous improvement. In addition to conducting root analysis of system failures.
Mit dem Klick auf “Job-E-Mail bestellen” stimmst du unseren AGBs, unseren Datenschutzbestimmungen und der Nutzung von Cookies zu. Du kannst dich jederzeit von unseren E-Mails & Services abmelden.
Mit dem Klick auf “Job-E-Mail bestellen” stimmst du unseren AGBs, unseren Datenschutzbestimmungen und der Nutzung von Cookies zu. Du kannst dich jederzeit von unseren E-Mails & Services abmelden.