Remote Senior Site Reliability Engineer ML Platforms
Are you passionate about building and maintaining large-scale production systems that support advanced data science and machine learning applications? Do you want to join a team at the heart of NVIDIA's data-driven decision-making culture? If so, we have a great opportunity for you! NVIDIA is seeking a Senior Site Reliability Engineer (SRE) for the Data Science & ML Platform(s) team. The role involves designing, building, and maintaining services that enable real-time data analytics, streaming, data lakes, observability and ML/AI training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability of the platform, as well as applying SRE principles to improve production systems and optimize service SLOs. Additionally, collaboration with our customers to plan implement changes to the existing system, while monitoring capacity, latency, and performance is part of the role. To succeed in this position, a strong background in SRE practices, systems, networking, coding, capacity management, cloud operations, continuous delivery and deployment, and open-source cloud enabling technologies like Kubernetes and OpenStack is required. Deep understanding of the challenges and standard methodologies of running large-scale distributed systems in production, solving complex issues, automating repetitive tasks, and proactively identifying potential outages is also necessary. Furthermore, excellent communication and collaboration skills, and a culture of diversity, intellectual curiosity, problem solving, and openness are essential. As a Senior SRE at NVIDIA, you will have the opportunity to work on innovative technologies that power the future of AI and data science, and be part of a dynamic and supportive team that values learning and growth. The role provides the autonomy to work on meaningful projects with the support and mentorship needed to succeed, and contributes to a culture of blameless postmortems, iterative improvement, and risk-taking. If you are seeking an exciting and rewarding career that makes a difference, we invite you to apply now! What youโll be doing: Develop software solutions to ensure reliability and operability of large-scale systems supporting machine-critical use cases. Gain a deep understanding of our system operations, scalability, interactions, and failures to identify improvement opportunities and risks. Create tools and automation to reduce operational overhead and eliminate manual tasks. Establish frameworks, processes, and standard methodologies to enhance operational maturity, team efficiency, and accelerate innovation. Define meaningful and actionable reliability metrics to track and improve system and service reliability. Oversee capacity and performance management to facilitate infrastructure scaling across public and private clouds globally. Build tools to improve our service observability for faster issue resolution. Practice sustainable incident response and blameless postmortems What we need to see: Minimum of 10 years of experience in SRE, Cloud platforms, or DevOps with large-scale microservices in production environments. Master's or Bachelor's degree in Computer Science or Electrical Engineering or CE or equivalent experience. Strong understanding of SRE principles, including error budgets, SLOs, and SLAs. Proficiency in incident, change, and problem management processes. Skilled in problem-solving, root cause analysis, and optimization. Experience with streaming data infrastructure services, such as Kafka and Spark. Expertise in building and operating large-scale observability platforms for monitoring and logging (e.g., ELK, Prometheus). Proficiency in programming languages such as Python, Go, Perl, or Ruby. Hands-on experience with scaling distributed systems in public, private, or hybrid cloud environments. Experience in deploying, supporting, and supervising services, platforms, and application stacks. Ways to stand out from the crowd: Experience operating large-scale distributed systems with strong SLAs. Excellent coding skills in Python and Go and extensive experience in operating data platforms. Knowledge of CI/CD systems, such as Jenkins and GitHub Actions. Familiarity with Infrastructure as Code (IaC) methodologies and tools. Excellent interpersonal skills for identifying and communicating data-driven insights. NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for exceptional people like you to help us accelerate the next wave of artificial intelligence. The base salary range is 224,000 USD - 425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA is the world leader in accelerated computing. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society. Learn more about NVIDIA. \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Python, DevOps, Cloud, Senior and Engineer jobs that are similar:\n\n
$60,000 — $135,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
\n\n#Location\nUS, CA, Santa Clara
๐ Please reference you found the job on Remote OK, this helps us get more companies to post here, thanks!
When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it's the actual company's main domain name. Scams in remote work are rampant, be careful! Read more to avoid scams. When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.
\nRestaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique, centralized solution for accounting and back-office operations for restaurants. Restaurant365โs culture is focused on empowering team members to produce top-notch results while elevating their skills. Weโre constantly evolving and improving to make sure we are and always will be โBest in Classโ ... and we want that for you too!\n\n\nRestaurant365 is looking for an experienced Data Engineer to join our data warehouse team thatenables the flow of information and analytics across the company. The Data Engineer will participate in the engineering of our enterprise data lake, data warehouse, and analytic solutions. This is a key role on a highly visible team that will partner across the organization with business and technical stakeholders to create the objects and data pipelines used for insights, analysis, executive reporting, and machine learning. You will have the exciting opportunity to shape and grow with a high performingteam and the modern data foundation that enables the data-driven culture to fuel the companyโs growth. \n\n\n\nHow you'll add value: \n* Participate in the overall architecture, engineering, and operations of a modern data warehouse and analytics platforms. \n* Design and develop the objects in the Data Lake and EDW that serve as core building blocks for the semantic layer and datasets used for reporting and analytics across the enterprise. \n* Develop data pipelines, transformations (ETL/ELT), orchestration, and job controls using repeatable software development processes, quality assurance, release management, and monitoring capabilities. \n* Partner with internal business and technology stakeholders to understand their needs and then design, build and monitor pipelines that meet the companyโs growing business needs. \n* Look for opportunities for continuous improvements that automate workflows, reduce manual processes, reduce operational costs, uphold SLAs, and ensure scalability. \n* Use an automated observability framework for ensuring the reliability of data quality, data integrity, and master data management. \n* Partner closely with peers in Product, Engineering, Enterprise Technology, and InfoSec teams on the shared enterprise needs of a data lake, data warehouse, semantic layer, transformation tools, BI tools, and machine learning. \n* Partner closely with peers in Business Intelligence, Data Science, and SMEs in partnering business units o translate analytics and business requirements into SQL and data structures \n* Responsible for ensuring platforms, products, and services are delivered with operational excellence and rigorous adherence to ITSM process and InfoSec policies. \n* Adopt and follow sound Agile practices for the delivery of data engineering and analytics solutions. \n* Create documentation for reference, process, data products, and data infrastructure \n* Embrace ambiguity and other duties as assigned. \n\n\n\nWhat you'll need to be successful in this role: \n* 3-5 years of engineering experience in enterprise data warehousing, data engineering, business intelligence, and delivering analytics solutions \n* 1-2 years of SaaS industry experience required \n* Deep understanding of current technologies and design patterns for data warehousing, data pipelines, data modeling, analytics, visualization, and machine learning (e.g. Kimball methodology) \n* Solid understanding of modern distributed data architectures, data pipelines, API pub/sub services \n* Experience engineering for SLA-driven data operations with responsibility for uptime, delivery, consistency, scalability, and continuous improvement of data infrastructure \n* Ability to understand and translate business requirements into data/analytic solutions \n* Extensive experience with Agile development methodologies \n* Prior experience with at least one: Snowflake, Big Query, Synapse, Data bricks, or Redshift \n* Highly proficient in both SQL and Python for data manipulation and assembly of Airflow DAGโs. \n* Experience with cloud administration and DevOps best practices on AWS and GCP and/or general cloud architecture best practices, with accountability cloud cost management \n* Strong interpersonal, leadership and communication skills, with the ability to relate technical solutions to business terminology and goals \n* Ability to work independently in a remote culture and across many time zones and outsourced partners, likely CT or ET \n\n\n\nR365 Team Member Benefits & Compensation\n* This position has a salary range of $94K-$130K. The above range represents the expected salary range for this position. The actual salary may vary based upon several factors, including, but not limited to, relevant skills/experience, time in the role, business line, and geographic location. Restaurant365 focuses on equitable pay for our team and aims for transparency with our pay practices. \n* Comprehensive medical benefits, 100% paid for employee\n* 401k + matching\n* Equity Option Grant\n* Unlimited PTO + Company holidays\n* Wellness initiatives\n\n\n#BI-Remote\n\n\n$90,000 - $130,000 a year\n\nR365 is an Equal Opportunity Employer and we encourage all forward-thinkers who embrace change and possess a positive attitude to apply. \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Design, SaaS, InfoSec, Python, Accounting, DevOps, Cloud, API and Engineer jobs that are similar:\n\n
$60,000 — $110,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
\n\n#Location\nRemote
๐ Please reference you found the job on Remote OK, this helps us get more companies to post here, thanks!
When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it's the actual company's main domain name. Scams in remote work are rampant, be careful! Read more to avoid scams. When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.
\nHi there! Thanks for stopping by =]\n\n \n\nAre you actively looking for a new opportunity? Or just checking the market? Wellโฆ you might just be in the right place!\n\n \n\nWeโre looking for a SRE/DevOps engineer to join our team in Tbilisi. This team of experts enables our development teams to efficiently build and run great software to excite our customers. Currently the SRE team is operating 5 different product platforms, mostly running on AWS Cloud infrastructure. Utilizing a variety of technologies including Kubernetes, Elasticsearch, Kafka, Lambdaโs etc.\n\n \n\nYouโll be focussing on the strategic expansion and scalability improvements of our flagship product - while being part of the global team to keep the lights on on all platforms.\n\nAs an SRE youโll work closely together with engineers and product managers around the world in order to provide tailored and innovative solutions for the market.\n\nLightspeed is happy to offer relocation for this role.\n\n \n\nโ What youโll be responsible for:\n\n\n* Initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development\n\n* Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams\n\n* Design and architect operational solutions with the specific goal of increasing the standardization, automation, repeatability, cost-efficiency and consistency of operational tasks\n\n* Work with developers and other SRE to design and build scalable and reliable Cloud cost efficient infrastructure\n\n* Write and maintain architectural, stakeholder, policy and processes documentation\n\n* Adhere to and advocate for best practices, including Infrastructure as Code, monitoring, high availability, disaster recovery, security, and DevOps methodologies\n\n* Collaborate with development teams and use intuition, experience and understanding to create SLIs, SLOs, and SLAs\n\n* Provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems (You will be on call for periods of time)\n\n\n\n\n \n\nโ Must have skills:\n\n\n* Kubernetes/Docker\n\n* Bash or Python or Ruby or any other backend language (programing skills)\n\n* Cloud experience, preferably AWS or GCP\n\n* Good experience provisioning and managing infrastructures with high availability constraints\n\n* Good communication skills in English and Russian\n\n\n\n\n \n\nโ Nice to have:\n\n\n* Terraform, Config Management (puppet/chef/ansible/salt)\n\n* Knowing how to work with Data & Linux systems (ElasticSearch/ Kafka/MySql or any other database)\n\n* CI/CD\n\n\n\n\n \n\nโ Whatโs in it for you\n\n\n* Lots of autonomy. Flexible work culture, and the possibility of remote work.\n\n* Everyone matters. Day by day, we improve all our products and processes. We have a global team and flexible work culture that provides many opportunities to grow and develop your career.\n\n* We care. We provide Macbooks for our team members so they can take it anywhere and work from any place. We provide training and educational materials to keep everyone updated.\n\n* We have a concierge service. If you need to meet someone at the airport, get delivery, or even change tires on your car, we have a team that will do that for you.\n\n* We are reliable. Lightspeed was founded in 2004 and remains profitable and self-sustaining. Lightspeed is a public company, and we provide RSU for our employees. We have a head office in Montreal, Canada, and now we are opening offices in Tbilisi and Yerevan. We also have more than 25 offices worldwide, from France to New Zealand.\n\n\n\n\n \n\nWho We Are\n\nPowering the businesses that are the backbone of the global economy, Lightspeed's one-stop commerce platform helps merchants innovate to simplify, scale, and provide exceptional customer experiences. Our cloud commerce solution transforms and unifies online and physical operations, multichannel sales, expansion to new locations, global payments, financial solutions, and connection to supplier networks.\n\nFounded in Montrรฉal, Canada in 2005, Lightspeed is dual-listed on the New York Stock Exchange (NYSE: LSPD) and Toronto Stock Exchange (TSX: LSPD). With teams across North America, Europe, and Asia Pacific, the company serves retail, hospitality, and golf businesses in over 100 countries. \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Design, Python, DevOps, Cloud, Ruby, Senior, Engineer, Linux, Backend and Digital Nomad jobs that are similar:\n\n
$70,000 — $120,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
\n\n#Location\nTbilisi, Tbilisi, Georgia
๐ Please reference you found the job on Remote OK, this helps us get more companies to post here, thanks!
When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it's the actual company's main domain name. Scams in remote work are rampant, be careful! Read more to avoid scams. When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.
This job post is closed and the position is probably filled. Please do not apply. Work for Stride Health and want to re-open this job? Use the edit link in the email when you posted the job!
๐ค Closed by robot after apply link errored w/ code 404 3 years ago
About the role\nThe Stride Engineering team is on the lookout for a motivated Cloud Platform Engineer to help take us to the next level. We are a small engineering team that collaborates very closely. You will drive operational excellence by implementing and supporting DevOps practices such as fast feedback cycles using monitoring, increased delivery cadence by improving our Continuous Delivery processes and pipelines, and increasing the reliability and scalability of our systems through automation and infrastructure as code. Youโll work closely with other engineers to help them build safe and reliable systems.\n\nOur technology stack consists of NodeJS and Python backend services along with Redis, ElasticSearch, DynamoDB and Postgres data stores. All of this is running in AWS on EC2, containerized in ECS or on Lambda. We use Terraform to manage our infrastructure resources. Ideally, youโll have experience building and managing cloud-ready systems on AWS.\n\nWe are building the world's first benefits platform designed specifically for independent contractors and part-time employees. You will be a cornerstone to ensuring that the platform is reliable and scalable. If youโre a seasoned engineer who wants to make an impact, the kind of which only a fast-growing startup can provide, we would love for you to join us on this adventure. The nature of work is changing, forever. Be a part of it.\n\nAbout Us\nStride provides the worldโs first benefits platform designed specifically for independent workers, helping them save time and money on insurance, taxes and hundreds of thousands of products and services. Since launching in 2014, Stride has helped over 2 million workers save over $2 billion.ย \n\nStride partners with the worldโs leading platforms and employers of non-benefited workers including DoorDash, Uber, Postmates, MasterCard, Amazon, Aon, Keller Williams and others to help them provide their workers access to affordable health and wealth benefits and perks. The company is backed by Venrock, New Enterprise Associates and F-Prime Capital Partners.\n\nWe support a flexible work structure with an office in San Francisco, along with multiple hubs across the US. You also have the option to establish your own fully remote office.\n\nMore about Us (Press):\nBorn Digital: "Mastercard, Stride Partner for Gig Worker Benefits"\nFast Company: "Ahead of its IPO, Postmates starts offering no-strings perks for couriers"\nThe New York Times: "Itโs Not Just You: Picking a Health Insurance Plan Is Really Hard"\n\nPerks & Benefits\nAt Stride we work hard, sweat the details, and enjoy life away from the computer, too. We are a diverse group that celebrates and supports our differences.ย \n- Competitive salary\n- Equity package\n- Health, dental, and vision plans\n- Flexible work arrangement with the ability to work remotely and meet face to face with co-workers on a regular basis (as conditions permit)\n- $100/month work from home stipend\n- $75/month stipend for wellness programs\n- Commuter benefits\n- Flexible vacation time\n- Parental leave\n- A culture of learning and development\n- And more! \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Cloud, Engineer, Backend, DevOps, Python and Medical jobs that are similar:\n\n
$75,000 — $120,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for Sketch and want to re-open this job? Use the edit link in the email when you posted the job!
Over a million designers use Sketch to transform their ideas into incredible products, every day. Would you like to join us and help take the infrastructure that supports this leading design tool to the next level? We're looking to expand our team with a full-time **Site Reliability Engineer**.\n\nAt Sketch, we work with a unique technology blend: a cloud platform and macOS and iOS applications. Our cloud stack is based on a mix of serverless and traditional server applications built on Elixir and Go, along with other cloud services like RDS PostgreSQL, S3, SQS, ... Most pieces are deployed on AWS ECS and automated through Terraform; we use Chef for configuration management where it's needed. Our SRE team usually codes with Python whenever we need to write a small program or script.\n\nAs a Site Reliability Engineer at Sketch, you will focus on shaping our cloud infrastructure and make sure all the pieces work well together: development environments, metrics processing and observability, security policies, network design, deployment strategies, high availability, etc. You will work closely with backend, frontend, Mac developers and product managers to guarantee product focussed, smooth engineering processes.\n\nAs an example of one complex project we have worked on lately, we recently migrated our production database from MariaDB to PostgreSQL using streaming replication to minimise the potential downtime and have replicated environments to adapt and test our backend APIs properly.\n\n**About you**\n\nWe look for someone who has experience with different stacks (mainly Linux based), technologies and production models and has participated actively on the build of essential pieces of a cloud platform.\n\nSomeone that knows how to conduct a technical operation that potentially affects users and at the same time can code small applications and scripts to automate the platform and also debug problems in other people's code.\n\nYou care about security, code quality, scalability, performance, and simplicity. Above all, you seek operational excellence and apply the best engineering practices possible. Not everything that you or your team do can be perfect, but you make sure that you always know the trade-offs. You back your decisions with arguments. You don't care for hype and always try to find the best solution and technology for the job and its context.\n\n**Essentials**\n\n* Professional experience managing Linux-based and cloud-native distributed systems in the past\n* Experience coding with high-level programming languages like Python\n* Experience with Infrastructure as Code tools such as Terraform, and configuration management tools to automate manual operations\n* A good understanding of the HTTP protocol and the behavior of production web services\n* Excellent communication skills and a good written and spoken English\n* You're based in European / African timezones.\n\n**About Sketch**\n\nSketch is a 100% remote company, and your colleagues are distributed around the globe. Being remote adds great flexibility, and helps us build a more diverse team. We put respect for each other above everything else.\n\nBesides being remote we work asynchronously as often as we can. This means that our team communicates mostly using Slack and GitHub. When we need it, we also have video calls.\n\nOur Technology team has more than 60 people today, split between Mac, Backend, Frontend, Infrastructure and QA. In particular, the Infrastructure team has 6 members. We work in multidisciplinary squads: people from different roles, including members of the Product team, work together on solving problems and delivering functionality to our users.\n\n**We care about your well-being and your professional success, so we offer you**\n\n* Flexibility to organize your own time, no set hours\n* As many vacation days as you need\n* Whatever training you need to develop in your job\n* The laptop you need\n* The option to work anywhere in European/African timezones\n* Company equity\n* Paid family leave \n* An annual company meetup \n\nPlease mention the words **UPDATE BANANA UNUSUAL** when applying to show you read the job post completely (#RMjE2LjczLjIxNi4xNDY=). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n
\n\n#Benefits\n
โฐ Async\n\n
\n\n#Location\nWorldwide
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for Netdata Inc and want to re-open this job? Use the edit link in the email when you posted the job!
Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team.\n\n\nAs a Senior SRE/DevOps engineer you will focus on supporting our netdata cloud offerings, augmenting our existing development infrastructure by implementing the automations necessary to catalyze further development of both our open-source project and our commercial offerings and last, but certainly not least, participating in the development of Netdata by making sure it's a first class citizen in various operating environments (e.g. orchestrated containers, IoT devices etc.)\n\nYour work will include building CI/CD pipelines, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\n\n**Responsibilities**\n\n* Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n* Develop automation tools to catalyse existing development or operational processes.\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n* Troubleshoot, maintain, enhance and augment our platform.\n* Automate tasks wherever possible.\n* Stay up-to-date on emerging technologies.\n\n**Job Requirements**\n\n**Required experience**\n\n* A bachelor's degree in Computer Science or equivalent\n* 3+ years of experience on CI/CD tools (Travis, Gitlab, AWS, Azure, etc) and methodologies\n* Minimum 3 years of Linux systems development and/or administration.\n* Minimum 2 years of experience with at least one scripting language, coupled with related automation projects\n* Previous experience with cloud-based technologies and surrounding operational processes\n* Self motivated, conscientious, with a problem-solving, hands-on mindset.\n* Perfectionist where it matters, but also pragmatic, with effective time management skills.\n* Team player, eager to help.\n* Excellent analytical skills.\n* Excellent command of spoken and written English.\n\n**Preferred experience**\n\n* Minimum 2 years of Go, Javascript and C development experience in demanding environments.\n* Expert on Continuous Integration, with long experience in Test Automation\n* 5+ years of shell scripting experience, on at least 2 languages (BASH, python, perl, ruby, etc.)\n* Minimum 2 years of experience with Google Cloud app engine and surrounding operational processes\n* Experience on configuration management and tools to support it (Ansible, puppet, etc.)\n* Experience with monitoring solutions and service assurance in general.\n* A linux, cross-distribution artisan. A good amount of knowledge on windows system administration\n* Open source contributor\n* Agile Development Methodology\n\n\n**Why join Netdata**\n\n* We are a team of industry veterans and senior engineers that prioritize performance and ease of use over anything else.\n* We embrace remote work and great work-life balance.\n* We are solving hard problems that affect thousands of organisations worldwide.\n* We are deeply committed to Open Source and love our community.\n* We deeply care about system performance.\n\n**When you join Netdata, you can expect**\n\n* A competitive salary.\n* A generous stock plan.\n* To join a venture-backed startup working with some of the most sophisticated investors of Silicon Valley.\n* To be part of our world-class team and interact with an amazing community.\n* To see first-hand how to grow and succeed in an engineering-first, open source-based company.\n* To find a culture that rewards doers.\n\n*Netdata is an Equal Opportunity Employer. We are committed to providing an inclusive work environment free of discrimination and harassment for everyone, regardless of race, color, religion, national or ethnic origin, sex, age, sexual orientation, gender identity, disability, sexual orientation, marital status, military service or other non-merit factor.*\n \n\nPlease mention the words **SWAMP MOTION COTTON** when applying to show you read the job post completely (#RMjE2LjczLjIxNi4xNDY=). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Location\nWorldwide
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for Netdata Inc and want to re-open this job? Use the edit link in the email when you posted the job!
Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team.\n\nAs a Senior SRE/DevOps engineer you will focus on supporting our netdata cloud offerings, augmenting our existing development infrastructure by implementing the automations necessary to catalyze further development of both our open-source project and our commercial offerings and last, but certainly not least, participating in the development of Netdata by making sure it's a first class citizen in various operating environments (e.g. orchestrated containers, IoT devices etc.)\n\nYour work will include building CI/CD pipelines, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\n\n**Responsibilities**\n\n* Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n* Develop automation tools to catalyse existing development or operational processes.\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n* Troubleshoot, maintain, enhance and augment our platform.\n* Automate tasks wherever possible.\n* Stay up-to-date on emerging technologies.\n\n**Job Requirements**\n\n**Required experience**\n\n* A bachelor's degree in Computer Science or equivalent\n* 3+ years of experience on CI/CD tools (Travis, Gitlab, AWS, Azure, etc) and methodologies\n* Minimum 3 years of Linux systems development and/or administration.\n* Minimum 2 years of experience with at least one scripting language, coupled with related automation projects\n* Previous experience with cloud-based technologies and surrounding operational processes\n* Self motivated, conscientious, with a problem-solving, hands-on mindset.\n* Perfectionist where it matters, but also pragmatic, with effective time management skills.\n* Team player, eager to help.\n* Excellent analytical skills.\n* Excellent command of spoken and written English.\n \n**Preferred experience**\n\n* Minimum 2 years of Go, Javascript and C development experience in demanding environments.\n* Expert on Continuous Integration, with long experience in Test Automation\n* 5+ years of shell scripting experience, on at least 2 languages (BASH, python, perl, ruby, etc.)\n* Minimum 2 years of experience with Google Cloud app engine and surrounding operational processes\n* Experience on configuration management and tools to support it (Ansible, puppet, etc.)\n* Experience with monitoring solutions and service assurance in general.\n* A linux, cross-distribution artisan. A good amount of knowledge on windows system administration\n* Open source contributor\n* Agile Development Methodology \n\nPlease mention the words **EQUIP BOSS TENANT** when applying to show you read the job post completely (#RMjE2LjczLjIxNi4xNDY=). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Location\nWorldwide
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for YouGov and want to re-open this job? Use the edit link in the email when you posted the job!
\nCrunch.i, part of the YouGov PLC is a market-defining company in the analytics SaaS marketplace. We’re a company on the rise. We’ve built a revolutionary platform that transforms our customers’ ability to drive insight from market research and survey data. We offer a complete survey data analysis platform that allows market researchers, analysts, and marketers to collaborate in a secure, cloud-based environment, using a simple, intuitive drag-and-drop interface to prepare, analyze, visualize and deliver survey data and analysis. Quite simply, Crunch provides the quickest and easiest way for anyone, from CMO to PhD, with zero training, to analyze survey data. Users create tables, charts, graphs and maps. They filter, and slice-and-dice survey data directly in their browser.\n\nOur start-up culture is casual, respectful of each other’s varied backgrounds and lives, and high-energy because of our shared dedication to our product and our mission. We are loyal to each other and our company. We value work/life balance, efficiency, simplicity, and fantastic customer service! Crunch has no offices and fully embraces a 100% remote culture. We have 40 employees spread across 5 continents. Remote work at Crunch is flexible and largely independent, yet highly cooperative.\n\nWe are hiring a DevOps Lead to help expand our platform and operations excellence. We are inviting you to join our small, fully remote team of developers and operators helping make our platform faster, more secure, and more reliable. You will be self-motivated and disciplined in order to work with our fully distributed team.\n\nWe are looking for someone who is a quick study, who is eager to learn and grow with us, and who has experience in DevOps and Agile cultures. At Crunch, we believe in learning together: we recognize that we don’t have all the answers, and we try to ask each other the right questions. As Crunch employees are completely distributed, it’s crucial that you can work well independently, and keep yourself motivated and focused.\n\nOur Stack:\n\nWe currently run our in-house production Python code against Redis, MongoDB, and ElasticSearch services. We proxy API requests through NGINX, load balance with ELBs, and deploy our React web application to AWS CloudFront CDN. Our current CI/CD process is built around GitHub, Jenkins, BlueOcean including unit, integration, and end to end tests and automated system deployments. We deploy to auto-scaling Groups using Ansible and Cloud-Init.\n\nIn the future, all or part of our platform may be deployed via DroneCI, Kubernetes, nginx ingress, Helm, and Spinnaker.\n\nWhat you'll do:\n\nAs a Leader:\n\n\n* Manage and lead a team of Cloud Operations Engineers who are tasked with ensuring our uptime guarantees to our customer base.\n\n* Scale the worldwide Cloud Operations Engineering team with the strategic implementation of new processes and tools.\n\n* Hire and ramp exceptional Cloud Operations Engineers.\n\n* Assist in scoping, designing and deploying systems that reduce Mean Time to Resolve for customer incidents.\n\n* Inform executive leadership and escalation management personnel of major outages\n\n* Compile and report KPIs across the full company.\n\n* Work with Sales Engineers to complete pre-sales questionnaires, and to gather customer use metrics.\n\n* Prioritize projects competing for human and computational resources to achieve organizational goals.\n\n\n\n\nAs an Engineer:\n\n\n* Monitor and detect emerging customer-facing incidents on the Crunch platform; assist in their proactive resolution, and work to prevent them from occurring.\n\n* Coordinate and participate in a weekly on-call rotation, where you will handle short term customer incidents (from direct surveillance or through alerts via our Technical Services Engineers).\n\n* Diagnose live incidents, differentiate between platform issues versus usage issues across the entire stack; hardware, software, application and network within physical datacenter and cloud-based environments, and take the first steps towards resolution.\n\n* Automate routine monitoring and troubleshooting tasks.\n\n* Cooperate with our product management and engineering organizations by identifying areas for improvement in the management of applications powering the Crunch infrastructure.\n\n* Provide consistent, high-quality feedback and recommendations to our product managers and development teams regarding product defects or recurring performance issues.\n\n* Be the owner of our platform. This includes everything from our cloud provider implementation to how we build, deploy and instrument our systems.\n\n* Drive improvements and advancements to the platform in areas such as container orchestration, service mesh, request/retry strategies.\n\n* Build frameworks and tools to empower safe, developer-led changes, automate the manual steps and provide insight into our complex system.\n\n* Work directly with software engineering and infrastructure leadership to enhance the performance, scalability and observability of resources of multiple applications and ensure that production hand off requirements are met and escalate issues.\n\n* Embed into SRE projects to stay close to the operational workflows and issues.\n\n* Evangelize the adoption of best practices in relation to performance and reliability across the organization.\n\n* Provide a solid operational foundation for building and maintaining successful SRE teams and processes.\n\n* Maintain project and operational workload statistics.\n\n* Promote a healthy and functional work environment.\n\n* Work with Security experts to do periodic penetration testing, and drive resolution for any issues discovered.\n\n* Liaise with IT and Security Team Leads to successfully complete cross-team projects, filling in for these Leads when necessary.\n\n* Administer a large portfolio of SaaS tools used throughout the company.\n\n\n\n\nQualifications:\n\n\n* Team Lead experience of an on-call DevOps, SRE, or Cloud Operations team (at least 2 years).\n\n* Experience recruiting, mentoring, and promoting high performing team members.\n\n* Experience being an on-call DevOps, SRE, or Cloud Operations engineer (at least 2 years).\n\n* Proven track record of designing, building, sizing, optimizing, and maintaining cloud infrastructure.\n\n* Proven experience developing software, CI/CD pipelines, automation, and managing production infrastructure in AWS.\n\n* Proven track record of designing, implementing, and maintaining full CI/CD pipelines in a cloud environment (Jenkins experience preferred).\n\n* Experience with containers and container orchestration tools (Docker, Kubernetes, Helm, traefik, Nginx ingress and Spinnaker experience preferred).\n\n* Expertise with Linux system administration (5 yrs) and networking technologies including IPv6.\n\n* Knowledgeable about a wide range of web and internet technologies.\n\n* Knowledge of NoSQL database operations and concepts.\n\n* Experience in monitoring, system performance data collection and analysis, and reporting.\n\n* Capability to write small programs/scripts to solve both short-term systems problems and to automate repetitive workflows (Python and Bash preferred).\n\n* Exceptional English communication and troubleshooting skills.\n\n* A keen interest in learning new things.\n\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to DevOps, Executive, React, English, Elasticsearch, Cloud, NoSQL, Python, API, Sales, SaaS, Engineer, Nginx and Linux jobs that are similar:\n\n
$70,000 — $120,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for NetData and want to re-open this job? Use the edit link in the email when you posted the job!
\nNetdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team. \n\nAs a Senior SRE/DevOps engineer you will focus on supporting our netdata cloud offerings, augmenting our existing development infrastructure by implementing the automations necessary to catalyze further development of both our open-source project and our commercial offerings and last, but certainly not least, participating in the development of Netdata by making sure it's a first class citizen in various operating environments (e.g. orchestrated containers, IoT devices etc.)\n\nYour work will include building CI/CD pipelines, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\nResponsibilities\n\n\n* Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n\n* Develop automation tools to catalyze existing development or operational processes.\n\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n\n* Troubleshoot, maintain, enhance and augment our platform; candidates will be expected to participate in an on-call rota.\n\n* Automate tasks wherever possible.\n\n* Stay up-to-date on emerging technologies.\n\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to DevOps, Admin, Senior, Engineer, Sys Admin, Cloud and Python jobs that are similar:\n\n
$70,000 — $120,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for Netdata Inc. and want to re-open this job? Use the edit link in the email when you posted the job!
Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team.\n\nAs a Senior SRE/DevOps engineer you will focus on supportring our netdata cloud offerings and augment our existing development infrastructure by implementing the automations necessary to catalyse further development of both our open-source project and our commerical offerings. This includes building upon our existing CI/CD, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\n**Why join Netdata**\n* We are a team of industry veterans and senior engineers that prioritize performance and ease of use over anything else.\n* We embrace remote work and great work-life balance.\n* We are solving hard problems that affect thousands of organisations worldwide.\n* We are deeply committed to Open Source and love our community.\n* We deeply care about system performance.\n\n**When you join Netdata, you can expect**\n* A competitive salary.\n* A generous stock plan.\n* To join a venture-backed startup working with some of the most sophisticated investors of Silicon Valley.\n* To be part of our world-class team and interact with an amazing community.\n* To see first-hand how to grow and succeed in an engineering-first, open source-based company.\n* To find a culture that rewards doers.\n\n*Netdata is an Equal Opportunity Employer. We are committed to providing an inclusive work environment free of discrimination and harassment for everyone, regardless of race, color, religion, national or ethnic origin, sex, age, sexual orientation, gender identity, disability, sexual orientation, marital status, military service or other non-merit factor.*\n\n# Responsibilities\n
* Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n* Develop automation tools to catalyse existing development or operational processes.\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n* Troubleshoot, maintain, enhance and augment our platform; candidates will be expected to participate in an on-call rota.\n* Automate tasks wherever possible.\n* Stay up-to-date on emerging technologies. \n\n# Requirements\n**Required experience**\n* A bachelor's degree in Computer Science or equivalent\n* 3+ years of experience on CI/CD tools (Travis, Gitlab, AWS, Azure, etc) and methodologies\n* Minimum 3 years of Linux systems development and/or administration.\n* Minimum 2 years of experience with at least one scripting language, coupled with related automation projects\n* Previous experience with cloud-based technologies and surrounding operational processes\n* Self motivated, conscientious, with a problem-solving, hands-on mindset.\n* Perfectionist where it matters, but also pragmatic, with effective time management skills.\n* Team player, eager to help.\n* Excellent analytical skills.\n* Excellent command of spoken and written English.\n \n\n**Preferred experience**\n* Minimum 2 years of Go, Javascript and C development experience in demanding environments.\n* Expert on Continuous Integration, with long experience in Test Automation\n* 5+ years of shell scripting experience, on at least 2 languages (BASH, python, perl, ruby, etc.)\n* Minimum 2 years of experience with Google Cloud app engine and surrounding operational processes\n* Experience on configuration management and tools to support it (Ansible, puppet, etc.)\n* Experience with monitoring solutions and service assurance in general.\n* A linux, cross-distribution artisan. A good amount of knowledge on windows system administration\n* Open source contributor\n* Agile Development Methodology \n\nPlease mention the words **WANT SLENDER JOB** when applying to show you read the job post completely (#RMjE2LjczLjIxNi4xNDY=). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to DevOps, Admin, Senior, Engineer, Sys Admin, Cloud and Python jobs that are similar:\n\n
$70,000 — $120,000/year\n
\n\n#Location\nGMT-3 to GMT+5
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for Canonical and want to re-open this job? Use the edit link in the email when you posted the job!
\nAt Canonical it is our mission to make open source software available to people everywhere. We believe the best way to fuel innovation is to give the innovators the technology they need. As a Systems Reliability Engineer (SRE) for the Information Services (IS) team you'll play a key role in driving this mission and helping to define the future of free software.\n\nWhy this job is important\n\nIS supports and maintains all of Canonical’s production services and IS team members use real-life operational experiences to contribute to product improvements. The IS team at Canonical runs the services used by over 60 million Ubuntu users. As an SRE you’ll be in a unique position that will allow you to provide critical feedback to developers by writing code, submitting bugs, and working with others within the company to ensure that Canonical products are as good as they can be. You will also be able to develop and submit fixes and enhancements directly.\n\nWhat you will learn at this job\n\nSREs work closely with development teams to build and maintain the extraordinary infrastructure required to run all of Canonical and Ubuntu’s systems and services. The scope of our responsibility combined with the overall size of our environment means that our SREs face new challenges every day. You can expect to gain hands-on experience in the following areas:\n\n\n* Software development in Python and Go in order to automate repetitive tasks\n\n* Continuous integration and continuous deployment using a combination of open source and Canonical developed tools\n\n* Operating clouds at scale using OpenStack, Ceph, MAAS and Juju\n\n* Deploying, troubleshooting, and optimising services running on both private and public clouds using open source software like Ubuntu, Apache, HAProxy, PostgreSQL, and Squid.\n\n\n\n\nCanonical’s IS team embraces autonomy and to that end has instituted Self Directed (SD) time. A portion of your work week is set aside to allow you to work on what you think will most benefit the IS team specifically and Canonical in general.\n\nKey Responsibilities\n\n\nSREs rotate through three roles:\n\n* Maintaining all core services, networks, and infrastructure (including public and private clouds). The ability to work under pressure and demonstrate sound problem solving skills in a fast-paced and complex environment are key here.\n\n* Working directly with a variety of development teams within Canonical in a devops role to test, deploy, monitor and maintain services running on our production clouds. This will require an overlap of development and administration skills, as you help write and review code you will then use to deploy and maintain services using Canonical's cloud products.\n\n* Larger project work, currently focused on large scale cloud deployments and overall process improvements. This role gives SREs the ability to utilize development and architecting skills in a focused manner that is unique to Canonical.\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Admin, Engineer, Sys Admin, DevOps, Cloud and Python jobs that are similar:\n\n
$70,000 — $120,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
This job post is closed and the position is probably filled. Please do not apply. Work for Canonical and want to re-open this job? Use the edit link in the email when you posted the job!
At Canonical it is our mission to make open source software available to people everywhere. We believe the best way to fuel innovation is to give the innovators the technology they need. As a Systems Reliability Engineer (SRE) for the Information Services (IS) team you'll play a key role in driving this mission and helping to define the future of free software.\n\nSREs work closely with development teams to build and maintain the extraordinary infrastructure required to run all of Canonical and Ubuntuโs systems and services. The scope of our responsibility combined with the overall size of our environment means that our SREs face new challenges every day. From developing automated processes for faster, more reliable deployments to building large and scalable cloud environments, every day at Canonical is an opportunity to learn something new and collaborate with some of the most talented technical minds in the industry.\n\nIS supports and maintains all of Canonicalโs production services and IS team members use real-life operational experiences to contribute to product improvements. As an SRE youโll be in a unique position that will allow you to provide critical feedback to developers by writing code, submitting bugs, and working with others within the company to ensure that Canonical products are as good as they can be. You will also be able to develop and submit fixes and enhancements directly.\n\n \n\nKEY RESPONSIBILITIES & ACCOUNTABILITIES\n\n \n\nSREs rotate through three roles:\n\n1. Maintaining all core services, networks, and infrastructure (including public and private clouds). The ability to work under pressure and demonstrate sound problem solving skills in a fast-paced and complex environment are key here.\n\n2. Working directly with a variety of development teams within Canonical in a devops role to test, deploy, monitor and maintain services running on our production clouds. This will require an overlap of development and administration skills, as you help write and review code you will then use to deploy and maintain services using Canonical's cloud products.\n\n3. Larger project work, currently focused on large scale cloud deployments and overall process improvements. This role gives SREs the ability to utilize development and architecting skills in a focused manner that is unique to Canonical.\n\n \n\nREQUIRED SKILLS & EXPERIENCE\n\n \n\n You have prior experience working in a large highly available environment\n You are willing to be flexible and adaptable with the ability to learn new things quickly.\n You have strong development skills (Python, Go, Ruby, etc.) with experience writing code.\n You are heavily focused on automation preferably with experience in building and maintaining self-service tools.\n You have authoritative understanding and experience with the administration of infrastructure services such as DNS, DHCP, SSH, Apache/Nginx, HAProxy, Squid/Varnish, PostgreSQL/MySQL etc.\n You have practical knowledge of IP networking and routing\n You have a strong security focus including knowledge of network, operating system and application level practices\n You have familiarity with software development and code review practices, including use of DVCS (e.g. git or bzr)\n You have experience deploying, administering and maintaining services in a cloud computing environment\n You are able to communicate clearly in English, especially using email and IRC\n You have a college degree in a relevant technical field or equivalent experience.\n You are are self-driven and able to troubleshoot, ask others when\n appropriate and find answers\n You are motivated, organised, and willing and able to work well remotely within a distributed team\n You are able to participate in our weekend on call rotation approximately 1 weekend every 18 weeks\n\nDESIRED SKILLS & EXPERIENCE\n\n You have prior experience administering OpenStack\n You have familiarity with Juju and MAAS\n You have familiarity with Ubuntu or Debian\n You have prior experience with configuration management tools (Puppet, Chef, CFEngine, etc.)\n You have prior experience maintaining and configuring routers and firewalls (Cisco, iptables)\n\n\nCanonical is an equal opportunity employer.\n\nExtra tags: DNS, DHCP, SSH, Python, Go, Ruby \n\nPlease mention the words **WHERE SODA ARRIVE** when applying to show you read the job post completely (#RMjE2LjczLjIxNi4xNDY=). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Python, Ruby, Admin, Engineer, Sys Admin, DevOps, Cloud and Git jobs that are similar:\n\n
$70,000 — $120,000/year\n
\n\n#Benefits\n
๐ Distributed team\n\n
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.