סקירה כללית
You will be responsible for maintaining the availability, performance, and security of our AWS-based cloud infrastructure. Your role will include monitoring systems, responding to incidents, and taking ownership of incident management and troubleshooting. In addition, you will help enhance monitoring practices and play a key role in advancing automation initiatives across our environments, leveraging Infrastructure as Code (IaC) to scale monitoring solutions effectively. Responsibilities: * Product Expertise: Develop and maintain a deep technical understanding of the product, including its features, functionalities, and underlying architecture. * Problem Solving: Take the lead in identifying and resolving product issues by applying advanced debugging techniques and troubleshooting skills. Provide timely solutions to both customers and internal teams. * Matrix Management: Collaborate effectively across multiple teams, including engineering, QA, customer clients etc., ensuring alignment on product goals and initiatives. Manage cross-functional workflows and deliverables. * Customer Relations: Build and maintain strong, positive relationships with customers in and outside of the organization, acting as the main point of contact for any product-related issues or inquiries. * Bug and Issue Tracking: Identify, track, and prioritize product bugs, working closely with the development team to ensure efficient resolution. * Documentation: Maintain clear documentation of debugging processes, technical issues, and resolutions for future reference and knowledge sharing. * Training & Support: Mentor junior team members and provide training on troubleshooting and debugging techniques. Ensure that internal teams are well-equipped with the necessary knowledge to handle product issues. * Continuous Improvement: Analyze product performance data and user feedback to recommend improvements or enhancements to the product and its features. Requirements: * Results-driven Noc Engineer with extensive hands-on experience across AWS services, including CloudWatch for monitoring and OnCall for incident response. * Highly skilled at diagnosing and resolving complex cloud-infrastructure challenges, while proactively implementing measures to reduce future incidents. * Proficient in automation and Infrastructure-as-Code practices, leveraging tools to build and maintain scalable, well-monitored environments. * Known for clear communication and precise documentation under pressure, and for collaborating seamlessly with Operations and Engineering teams. * Brings a problem-solving mindset and a passion for refining monitoring and automation strategies to improve reliability and efficiency. Preferred Qualifications: * Advanced Degree: Bachelor’s degree or higher in Computer Science, Engineering, or a related field. * Certifications: Relevant technical certifications in product management, project management, or troubleshooting are a plus. * Industry Experience: Experience in Fintech companies an advantage.
דרישות המשרה
* Product Expertise: Develop and maintain a deep technical understanding of the product, including its features, functionalities, and underlying architecture. * Problem Solving: Take the lead in identifying and resolving product issues by applying advanced debugging techniques and troubleshooting skills. Provide timely solutions to both customers and internal teams. * Matrix Management: Collaborat