Site Reliability Engineer (M/F)
A2IT Technology
26.06.2025 | | Referência: 2290604

PARTILHAR
Empresa:
A2IT Technology
Descrição da Função
A2IT Tecnologia is a Portuguese company specializing in information technology services, founded in 2006. We provide integrations and management of technological solutions, with competence centers available 24x7 and with nationwide coverage. We have offices and support centers in Portugal (Lisbon, Porto, Faro and the Islands), Brazil (Fortaleza and Belo Horizonte) and the United Arab Emirates (Dubai), guaranteeing comprehensive coverage to meet the needs of our clients. We have several partners and are GNS and ISO 9001 certified.
Location: Lisbon / Porto / Azambuja (Hybrid)
Key Responsibilities:
- Provide hands-on support to both technical and business teams;
- Monitor systems proactively to detect and respond to incidents and service degradation;
- Investigate integration issues, gather information, and collaborate with internal and external teams;
- Perform root cause analysis to prevent recurring issues;
- Prioritize multiple concurrent issues effectively with your team;
- Understand the business context and technical architecture of each system to better assess impact and urgency;
- Participate in on-call rotations to ensure platform stability;
- Contribute to the continuous improvement of monitoring, alerting, logging, and incident response processes;
- Act as a liaison between technical and non-technical stakeholders, adapting communication accordingly.
Required Qualifications:
- 3+ years of experience in Application Support or Site Reliability Engineering;
- Strong analytical mindset: identify patterns, differentiate between isolated errors and systemic issues;
- Experience with microservices operationalization;
- Proficient with tools like ELK stack, Prometheus, and Grafana;
- Familiarity with cloud environments, especially AWS;
- Experience using collaboration platforms such as Jira, Confluence, GitLab;
- Ability to understand complex systems architecture and how components interact within a broader ecosystem;
- Strong proactivity in identifying risks through logs and metrics and suggesting improvements to observability;
- Excellent communication skills, especially when engaging with non-technical stakeholders;
- Willingness to participate in on-call duty as needed;
- Fluency in English, both written and spoken.
Nice to Have:
- Hands-on coding experience with .NET Core, Python, or similar;
- Background in Retail or Logistics domains;
- Familiarity with Transport Management Systems (TMS) and logistics processes;
- Experience working with transport carriers (operational or functional knowledge).

Observações
Not Specified (Portugal)