\nWe are looking for people willing to work a 10am - 7pm PST schedule or later. This role can be fully remote. \n\nAbout the role:\n\nThe High Performance Computing Operations team is responsible for the day-to-day provisioning, management and uptime of CoreWeaveโs ever-expanding fleet of server nodes. Playing a central role in CoreWeaveโs growth strategy, this team is on the front line for configuration, updates and remote troubleshooting of our highest tier of supercomputing clusters and their networking, delivery platforms and tools dependencies. You will be in a daily battle with the forces of entropy to maximize the number of nodes CoreWeave can deliver to customers.\n\nWe are seeking curious, creative and persistent problem solvers to join our HPC Operations team to help us drive batches of server nodes through our provisioning and validation processes while efficiently and effectively troubleshooting node or cluster problems as they arise. This individual will join a team of committed engineers working to deploy nodes as fast as they can be racked and turned on. \n\nKey Responsibilities:\n\n\n* Install, configure, and maintain large-scale high-performance supercomputing clusters running state-of-the-art GPUs\n\n* Troubleshoot hardware and software issues; escalate and coordinate as needed with data center, network and platform teams to drive resolution\n\n* Monitor and analyze system performance and take appropriate remediation actions for cloud health\n\n* Approach your work with flexibility and optimism anticipating shifting business and technical priorities\n\n* Create and maintain documentation of team processes, knowledge and best practices for system management\n\n* Think critically about your day-to-day work and work collaboratively to improve team processes and efficiency\n\n\n\n\nSuccessful candidates typically share the following skills and experience:\n\n\n* 2 or more years of experience troubleshooting or administering data center or on-prem infrastructure (servers, storage, network or a mix)\n\n* Strong understanding of Linux system administration and networking concepts\n\n* Ability to troubleshoot hardware and software issues and perform system maintenance tasks consistently and reliably\n\n\n\n\nIdeal candidates may also have experience in one or more of these:\n\n\n* Software development or scripting languages (bash, python, powershell, etc)\n\n* Grafana, prometheus, promsql queries or similar observability platforms\n\n* Data center environments including server racks, HVAC systems, fiber trays\n\n* Kubernetes administration\n\n\n\n\nOur compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $80,000/year in our lowest geographic market up to $110,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Cloud, Node, Engineer and Linux jobs that are similar:\n\n
$52,500 — $95,000/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐ We hire old (and young)\n\n
\n\n#Location\nLas Vegas, Nevada, United States
๐ Please reference you found the job on Remote OK, this helps us get more companies to post here, thanks!
When applying for jobs, you should NEVER have to pay to apply. You should also NEVER have to pay to buy equipment which they then pay you back for later. Also never pay for trainings you have to do. Those are scams! NEVER PAY FOR ANYTHING! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. A good idea is to check the domain name for the site/email and see if it's the actual company's main domain name. Scams in remote work are rampant, be careful! Read more to avoid scams. When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.