Join Us!

Help us build the next big ideas in today's cloud computing industry

Site Reliability Engineering

Tel Aviv · Full-time · Senior

About The Position

Spotinst is a dynamic, fast growing technological startup with headquarters located in Tel-Aviv and additional offices in San Francisco, New York, Washington D.C. and London. With innovative technology that is revolutionizing the cloud computing industry and a team of highly motivated and creative employees, our vision is to optimize the way DevOps and R&D teams consume cloud computing.

Spotinst is seeking an experienced SRE to be responsible for our production environments. As SRE at Spotinst you will "make things scale", you must love learning new technologies and figuring out the most efficient ways to manage our system. You will help us build, maintain, monitor and secure our cloud services infrastructure that powers our customer facing products.

  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Analyzes to improve the efficiency of data related communications functions for the organization's benefit
  • Manages planned or unplanned maintenance, troubleshooting or root cause analysis, and problem resolution
  • Seeks to minimize downtime or service disruptions. Identifies key performance and capacity metrics and monitors system performance
  • Confers with personnel to identify and resolve problems. Consults with directors, managers, supervisors, and vendors regarding IT-related initiatives
  • Performance Reporting – provide analysis and reporting against established performance standards to identify trends, potential issues and remedial actions required to bring about compliance
  • Day to day monitoring of platform components for resolution of user and technical issues
  • Leverage industry best practice solutions for automation of for business process functionality
  • Understand and comply with corporate IT policies and develop procedures to ensure timely execution
  • Defines and manages escalation procedures between operations and engineering
  • Manage on-call rotations across continents, using a follow-the-sun model


Basic Qualifications:

  • Minimum of 4 years of experience management of large scale, complex systems
  • Database (MySQL, Postgres) administration and operational experience
  • Experience with AWS or Cloud Service platforms
  • Knowledge of the Agile mindset and practices
  • Possession of excellent oral and written communication skills
  • Proactive approach
  • Problem solving skills

Preferred Qualifications:

  • Java experience
  • Strong team management and leadership experience

Apply for this position