At Iownit (Permanent), in Lisbon, Portugal Salary: €49.600 - €70.000 Expires at: 2024-10-11 Remote policy: Full remote Your impact: We are actively seeking our first hire for the role of Senior Site Reliability Engineer (SRE) to join our Infrastructure team.
As a Senior Site Reliability Engineer, you will collaborate across teams to consistently enhance and maintain a scalable, reliable production environment for operating the next-generation capital markets platform for alternative investments.
In this role, you will: Champion a culture of site reliability: Instill best practices and a robust reliability culture within your team.
Drive technical excellence across your team, ensuring a high standard of reliability practices.
Collaborate seamlessly with cross-functional teams to identify and resolve system or product reliability issues.
Strengthen iownit's tech stack and deployments by fortifying our observability and monitoring pipelines, scaling systems, and advancing our release process.
Take charge in investigating and addressing incidents, driving postmortem actions to prevent reoccurrence.
Guarantee the stability of releases to client environments, upholding a seamless user experience.
Collaborate with team members and stakeholders to define clear and measurable service level agreements (SLAs) and service level indicators (SLIs), and establish realistic and achievable service level objectives (SLOs).
Document and share knowledge within the organization through internal forums and communities of practice.
Showcase expertise in reliability, scalability, performance, security, enterprise system architecture, and toil reduction.
Implement these practices within applications or platforms.
You will get an opportunity to work in a team that keeps growing, innovating, and giving you room to be proactive and creative.
Main requirements About you: At least 2 years of experience supporting and maintaining AWS infrastructure (VPC, EC2, ECS, Cloudwatch, Security Hub, IAM, S3, etc).
Proficient with infrastructure-as-code approach using CloudFormation or Terraform.
Good understanding of network and security related configuration in AWS environments.
Can easily navigate CloudWatch logs through multiple application layers to identify a root cause of an issue.
Used to create and execute playbooks to remediate standard issues.
Strong experience with modern container orchestration systems such as Docker Swarm or Kubernetes.
Some experience with scripting or programming languages (Bash, Python).
Strong bias for action and ownership.
Last, but not least, we are here to bring value and positive experience to our clients and team.
So you will join the team working on the common goal and sharing responsibility for the results!
Nice to have Experience with AWS Site-to-Site VPN or other VPN services.
Experience with large-scale distributed systems is very appreciated.
Practical experience of identification and mitigation of DDoS attacks together with cybersecurity teams.
Can work with databases using SQL and performance monitoring tools.