### We are looking for a Site Reliability Engineer with focus monitoring\n\nYou will be a key member of a tight-knit group of talented Engineers who are responsible for keeping ours and our customerโs Kubernetes clusters operational and healthy. Youโll also have a key role in the development of the product itself, working together with our Platform Engineers to deliver the greatest Kubernetes service possible.\n\nGiant Swarm is a fast-growing open-source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products. We are a diverse, fully remote (since 2014) and experienced team that is growing and spread across Europe - with a headquarters in Cologne.\n\n**Your Job**\n\n* You maintain, operate and upgrade our own and our customerโs Kubernetes clusters.\n* You will design, configure, build, and maintain our core infrastructure, from kernel parameters to the cloud provider templates.\n* You understand how servers and systems work and you tweak their behaviour to your needs.\n* You will be responsible for our monitoring, logging and alerting.\n* You will help resolve incidents on our own and our customerโs clusters.\n* You participate in the on-call support schedule\n* You are a go-to person in case our developers need advice regarding infrastructure.\n* You will automate everything, and you prefer kubernetes controllers and operators over Terraform and Ansible.\n* We (and the majority of our customers) are currently mostly distributed around Europe (around UTC), thus, your main time zone should be somewhere between +/-2UTC to ensure better communication.\n\n**Requirements**\n\n* You must have deep, hands-on knowledge of Kubernetes from both the end-user and the operational side.\n* Youโre comfortable debugging systems at all levels, from kernel fundamentals right up to workloads running on Kubernetes.\n* Youโre happy troubleshooting a wide variety of issues and youโre not afraid to parse thousands of lines of logs in pursuit of an answer.\n* You have good coding skills (preferably Go, but Python or similar is fine as well)\n* You have experience with maintaining infrastructure with code and you know the pros and cons of various automation tools (We use Terraform & Ansible but Chef, Puppet and the lot is also a good start).\n* You have experience with Prometheus, Grafana and Alertmanager, you understand what SLO and SLI means and how to make use of them.\n* You are fluent with Cloud Native Tools running on top of Kubernetes (prometheus, grafana, ingress controller, โฆ) you know how to use them and how to configure them.\n* You automate all the things by writing code. Using bash scripts makes you sad :)\n\n### Why work at Giant Swarm\n\nEvery new team member changes the team. \n\nWe love to learn from each other and people who know things we donโt are highly welcome. And even though we are almost 70 people we aim at putting the individual first when taking decisions, establishing processes, etc. Youโll find that from day one, your work will make a difference and will be highly valued. There are no meaningless tasks and youโll soon realize that the company is full of people who are passionate about their jobs. Our strong culture of failure helps us stay up to date and try new things.\n\nEven though weโve been fully remote since 2014, we still like to meet in person twice a year at our onsites (make sure you check out our Instagram ;) ) as well as at conferences and events (as soon as they start again). \n\nContinuous learning is important to us - we foster this through bi-yearly personal development talks, a budget for training/certifications/coaching as well as regular feedback talks. \n\nBecoming part of Giant Swarm means that, by extension, you also become part of the Cloud Native community. We actively contribute to upstream projects and our quarterly hackathons will give you space to work on out-of-the box projects. Occasionally, when we, as a team, want to fully focus on one project, we scratch all meetings and routines for a certain time to better focus during our hive-sprints.\n\n**Basics:**\n\n* We don't count holiday (our team members take between 25-35 days off on average)\n* Choose your own hard- and software\n* As a company who has almost, if not more, kids than employees, family-friendliness is crucial to us and paid parental leave is a no-brainer.\n* Healthcare compensation\n* Fixed monthly budget for buying cat pictures or your mobile phone contract/ co-working space if you are boring ;)\n* We aim to be fully transparent (finance, salaries, communication, etc.)\n\n\nWe failed in exactly describing our way to approach important company elements that can be described with โbuzzwordsโ such as agile mindset, cross functional teams, self-organization, value of the individual or trust & teamwork. However, we truly care about them, we live them and we constantly iterate on them. Some snippets about how we do this are posted in our blog but by far not all of them. \n\nPro tipp: Ask whenever something is unclear! \n\nSee more jobs at [Giant Swarm](https://www.giantswarm.io/careers) \n\nPlease mention the words **CLUSTER TYPICAL LIAR** when applying to show you read the job post completely (#RMjE2LjczLjIxNi4xMjU=). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Salary and compensation\n
$70,000 — $130,000/year\n
\n\n#Location\nRemote (GMT +2/-2)
๐ Please reference you found the job on Remote OK, this helps us get more companies to post here, thanks!
When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it's the actual company's main domain name. Scams in remote work are rampant, be careful! Read more to avoid scams. When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.