
Remote-first
Fully flexible hours
Dog friendly
Job Description
As Site Reliability Engineer you'll be accountable for closely monitoring the availability of our platforms, performance and stability while closely working with software development teams in how to improve critical components.
Working with complex challenges while assuring uptime and reliability in different setups (AWS Cloud, AWS Outpost, OpenStack) allows you to use different skillsets in coding, algorithms and complexity analysis.
What will you be doing?
- Engage in and improve the whole lifecycle of services—from design, deployment, operation, and refinement.
- Take an active part in production problems root cause investigation, identification, and resolution (where necessary)
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Be an active part of performance and capacity testing;
- Optimize reliability monitoring & alerting;
- Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
- Iteratively perform Auditing of performance and reliability vulnerabilities;
- Define and revise Service Level Indicators (SLIs);
- Practice sustainable incident response and blameless postmortems.
We are looking for someone who:
- Has experience with Operating Systems & Networking knowledge;
- Has experience with programming languages such as Python, Java or Go;
- Has experience working with public cloud providers;
- Has experience working with microservices architectures;
- Has experience working with message queuing services and databases;
- Has experience with Configuration Management tools such chef and ansible;
- Has knowledge of Monitoring Solutions like Datadog and Splunk;
- Familiar with CD/CI pipelines comprising Jenkins, Git, Artifactory or others.
Company benefits
The FlexScore® is the result of a rigorous 2-step verification of a company’s flexibility
First we assess the flexibility options Blip provides and then we anonymously survey a statistically significant proportion of their employees to make sure Blip is as flexible as they say they are. Our assessment is based on the six key elements of flexibility: location, hours, autonomy, benefits, role modelling and work-life balance.
We ask the hard questions so you don’t have to.
Working at Blip
Company employees
430
Gender diversity (male:female)
70:30
Office locations
Porto, Portugal
Hiring Countries
Portugal

What employees are saying
"Blip is a pioneer in flexible working practises in Porto, Portugal. The company really cares for the workers' wellbeing and translates that in to effective work."
Delivery Manager at Blip