Site Reliability Engineer

Job Location: Belgium
Job Category: Infrastructure
Job Type: Full Time

In the Role of the SRE, you will:

• Be a member of a dynamic team to operate and maintain mission-critical applications of the customer.

• Work with the newest, state-of-the-art cloud native technologies both in the cloud and on-prem.

• Deploy products and automate deployments.

• Monitor systems, and implement instrumentation for better observability, ML, and other techniques to predict and avoid anomalies.

• Detect, identify, and analyze, faults if they arise, help to fix, and work on solutions to avoid further occurrence.

• Constantly improve the service availability, scalability, performance, monitoring, and overall manageability.

• Be involved in common work with security experts, architects, and developers to build and improve a sustainable technical landscape.

• Continuously research and assess new approaches for potential use, and provide recommendations and subject matter expertise regarding trends, technology, tools, and services.

• Actively and continuously promote and contribute to improvements of operations and SRE, e.g., monitoring, observability, incident management, deployment or change management.

• Contribute to all areas of solution architecture as a team member.

Must have Skills:

• Proactive attitude and passion for automation.

• Team player, who can work individually.

• Ability to work in online collaboration in an international environment.

• Ability to quickly acquire and utilize knowledge on new methodologies and solutions.

• Being calm, analytical, structured, and reliable during incident resolution.

• Strong ethics and confidentiality

Must have qualification (Demonstrable experience required):

• Relevant bachelor’s degree or equivalent work experience in computer science or related field.

• A minimum of 3 years of relevant experience to perform the advertised tasks.

• Good understanding and experience in SRE / Operations principles and frameworks, DevOps and DevSecOps principles.

• Good understanding of security principles.

• Hands-on experience with Kubernetes (AKS, OpenStack), Linux system administration (RedHat), and good understanding of networking protocols.

• Hands-on experience in automation, Infrastructure as Code, CI/CD, Git (Terraform, Ansible, Jenkins).

• Experience in scripting (bash, Python).

• Knowledge and hands-on experience in operating and automating solutions with at least two of the following technologies/Products or equivalents:

• Observability (Elastic, Prometheus, Jaeger, OpenTelemetry)

• Cloud services (preferred Azure)

• Document database (MongoDB)

• Relational database management system (preferred PostgreSQL)

• Web Servers, load balancers, forward and reverse proxies (Nginx, Squid)

• Messaging system (preferred Solace)

• Event Streaming (preferred Kafka)

• Issue tracking / agile project management tool (Jira, ServiceNow)

• IAM (Okta)

• Secret management (Hashicorp Vault)

• Experience with Static and Dynamic Application Security Testing is considered an advantage.

• Professional qualification in relevant fields is considered an advantage.

Sorry! This job has expired.