About the Engineering (Tech) TeamThe Engineering (Tech) Team is responsible for all Feedzai product development. Together with Product Management and Data Science, we build the next generation of tools to catch fraud in real-time with a machine learning first approach.
Formed by engineers and managed by engineers, at Feedzai, you will find one of the most talented teams out there, from junior to senior engineers. Our work involves a wide range of technical challenges, such as building distributed systems that need to operate 24/7 with ultra-low latencies and solving UI/UX problems to help fraud analysts fight fraud more efficiently.
In addition, designing extensive databases from relational, NoSQL, and graphs, validate and develop new data science techniques and algorithms.
With Cloud at its core, the Platform Engineering area supports our product development life cycle, from development through testing and deployment to operations and maintenance, enabling a DevOps way of working. We are fast-paced and provide a safe, open, and collaborative environment that encourages us to lean in, try new things, and discover our potential with continuous learning for everyone.
You: Site Reliability Engineering (SRE) ManagerFeedzai is seeking an experienced and dynamic Site Reliability Engineering (SRE) Manager to lead our talented team of SREs. If you are passionate about system reliability, have a knack for leading teams, and are obsessed with making systems and processes better, we would love to hear from you.
Your Day to DayLead and manage a team of highly skilled Site Reliability Engineers.
Ensure the performance, reliability, and availability of Feedzai's systems.
Collaborate with cross-functional teams to define and enforce SRE best practices and processes.
Drive continuous improvement in system performance, reliability, and scalability.
Develop and maintain metrics to measure and improve system performance and reliability.
Provide leadership in incident management and post-mortem analysis.
Mentor and grow the SRE team, fostering a culture of reliability and excellence.
Ensure the team adheres to security and compliance requirements.
Promote a culture of continuous improvement and operational excellence.
Main Requirements: You Have & You Know-howAdvanced degree in Computer Science, Engineering, or a related field.Familiarity with Kubernetes and container orchestration.Understanding of security best practices in cloud environments.Proven experience leading and managing teams of SREs or similar roles.Extensive experience with complex distributed systems in the cloud.Strong background in performance-sensitive and critical systems.Excellent understanding of cloud platforms and services (e.g., AWS, Google Cloud, Azure).Demonstrated ability to improve system and process performance and reliability.Strong problem-solving skills and a proactive approach to identifying and addressing issues.Excellent communication and leadership skills.Experience with monitoring and observability tools.Knowledge of automation and infrastructure-as-code practices.Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
#J-18808-Ljbffr