Careers at​ Gloat

career page line shape 2

Site Reliability Engineer

Tel Aviv · Full-time · Senior

About The Position

We’re looking for an experienced, highly motivated SRE engineer, to utilize SRE methodologies and technologies in order to implement highly scalable and available production environments.

As an SRE engineer at Gloat, you will be part of a growing DevOps team. You will have the freedom to explore and implement the newest technologies. You will be responsible for implementing monitoring and alerting infrastructure and defining the correct measurements for a highly available production environment. You will learn new things every minute of every day and constantly be challenged. There will not be a single boring moment of work but the opposite; exciting, motivating, and stimulating. The DevOps team has the honor and responsibility to support some of the biggest enterprise clients in the world.



Responsibilities

  • Design and implement reliable, highly available and scalable production infrastructure.
  • Seek for new technologies, from POC through implementation.
  • Ensure high uptime and reliability of the production environment.
  • Perform root cause analysis for complex failures and offer modern solutions and tools.
  • Analyze performance and stability issues.
  • Work closely with DevOps, R&D, product, and support to define cross-organizational processes. 
  • Design, develop, and drive troubleshooting & mitigation tools as part of driving self-healing agenda.
  • Educate engineers on how to approach and debug production issues across services and levels of the stack.

Requirements

  • 3 + years SRE experience
  • Solid knowledge of Kubernetes
  • Proven Monitoring and alerting experience (ELK, Grafana, Prometheus, etc.)
  • Experience implementing services in one of the big clouds (AWS, Azure, GCP, etc.)
  • Experience with a complete programming language (Python, Java, Go, Ruby, etc.)
  • Scripting and automation skills (Bash, Python, etc.) .
  • Strong background knowledge in Linux Administration.
  • Strong networking skills
  • Experience with IAC tools such as Ansible, Terraform, etc.

Advantages:

  • Experience in multi cloud environments
  • Strong Security skills
  • Microservice architecture implementation experience
  • Experience with SaaS production infrastructure


Apply for this position

Let's get acquainted

This information helps us personalize your demo.
 

Let's get acquainted

This information helps us personalize your demo.