סקירה כללית

^^משרה זו נלקחה מ Career^^we are building a dedicated SRE team to improve reliability and system resilience and We looking for a SRE Developer to help us monitor and maintain high system reliability in production. You will work closely with engineering, product, and data teams to ensure the performance, availability, and observability of our production systems. This role is ideal for someone who thrives at the intersection of software engineering and systems engineering and is passionate about uptime, automation, and performance. Responsibilities Design, implement, and maintain scalable and reliable infrastructure. Develop automation tools to reduce manual effort and improve system efficiency. Monitor system performance, uptime, and other KPIs to ensure high availability. Drive incident response, root cause analysis, and postmortems. Build and maintain CI/CD pipelines and infrastructure
• as
• code. Collaborate with development teams to ensure best practices in service design and deployment. Implement and advocate for SLOs, SLIs, and SLAs across services. Continuously improve observability, including logging, tracing, and metrics collection. Requirements Bachelor's degree in Computer Science /Completion of a DevOps course/ proven experience
• MUST Proven knowledge and hands
• on experience working with Docker
• MUST Basic experience or exposure to AWS cloud services. Knowledge of Kubernetes and containerized environments. Solid understanding of Linux/Unix systems and networking fundamentals. Proficiency in at least one programming or scripting language (e.g., Python, Go, Bash). Experience working with SQL and relational databases. Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, ELK, Datadog).

דרישות המשרה

Design, implement, and maintain scalable and reliable infrastructure. Develop automation tools to reduce manual effort and improve system efficiency. Monitor system performance, uptime, and other KPIs to ensure high availability. Drive incident response, root cause analysis, and postmortems. Build and maintain CI/CD pipelines and infrastructure
• as
• code. Collaborate with development teams to en