Sourcegraph is hiring a
Remote ML Engineer IC3
Location\n\n๐ While we are an all-remote company and hire almost anywhere in the world, we have a preference for someone to reside in the following locations for this role. However, if you feel qualified, we welcome you to apply regardless of location. No matter what, working hours must overlap with PST for at least 20 hours/week.\n\nPreferred locations:\n\n\n* Hybrid - San Francisco\n\n\n\nWhy this job is exciting\n\nWe recently created a machine learning team at Sourcegraph, aimed at creating the most powerful coding assistant in the world. Many companies are trying, but Sourcegraph is uniquely differentiated by our rich code intelligence data and powerful code search platform. In the world of prompting LLMs, context is everything, and Sourcegraphโs context is simply the best you can get: IDE-quality, global-scale, and served lightning fast. Our code intelligence, married with modern AI, is already providing a remarkable alpha experience, and you can help us unlock its full potential.\n\nWe are looking for an experienced full-stack ML engineer with demonstrated industry experience in productionizing large-scale ML models in industrial settings. And if you happen to have an entrepreneurial streak, youโre in luck: We have an enterprise distribution pipeline, so whatever you build can be deployed straight to enterprise customers with some of the largest code bases in the world, without all the go-to-market hassle youโd encounter in a startup.\n\nYou will be an engineer at Sourcegraph doing R&D, and pushing the boundaries of what AI can do, as an IC on our ML team. You will have the full power of Sourcegraphโs Code Intelligence Platform at your disposal, and youโll be working on a coding assistant to multiply dev productivity to unprecedented levels.\n\n๐
Within one month, you willโฆ\n\n\n* Start building a trusting relationship with your peers, and learning the company structure.\n\n* Be set up to do local development, and be actively prototyping.\n\n* Dive deep into how AI and ML is already used at Sourcegraph and identify ways to improve moving forward.\n\n* Develop simulated datasets using Gym style frameworks across a number of Cody use cases.\n\n* Experiment with changes to Cody prompts, context sources and evaluate the changes with offline experimentation datasets.\n\n* Ship a substantial new feature to end users.\n\n\n\n\n๐
Within three months, you willโฆ\n\n\n* Building out feature computation, storage, monitoring, analysis and serving systems for features required across our Cody LLM stack\n\n* Be contributing actively to the worldโs best coding assistant.\n\n* Developing distributed training & experiment infrastructure over Code AI datasets, and scaling distributed backend services to reliably support high-QPS low latency use cases.\n\n* Be following all the relevant research, and conducting research of your own.\n\n\n\n\n๐
Within six months, you willโฆ\n\n\n* Be fully ramped up and owning key pieces of the assistant.\n\n* Be ramped up on other relevant parts of the Sourcegraph product.\n\n* Be helping design and build what might become the biggest dev accelerator in 20 years.\n\n* Owning a number of ML systems, and building core data and model metadata systems powering the end-to-end ML lifecycle.\n\n* Be developing a highly scalable, high-QPS inference service providing low latency performance using a mix of CPU and GPU hardware to most efficiently utilize resources.\n\n* Be driving the technical vision and owning a couple of major ML components, including their modeling and ML infra roadmap.\n\n\n\nAbout you\n\nYou are an experienced full-stack ML engineer with demonstrated industry experience in formulating ML solutions, developing end-to-end data orchestration pipelines, deploying large-scale ML models, and experimenting offline and online to drive business impact for Cody users. You want to be part of a world-class team to push the boundaries of AI, with a particular focus on leveraging Sourcegraphโs code intelligence to leapfrog competitors.\n\n\n* You have 5-8 years of industry experience\n\n* You are a backend focused ML engineer who has worked on the entire ML lifecycle\n\n* You have deployed ML models to production to users and have developed feature pipelines\n\n* You understand the nuances of ML for users to move metrics forward\n\n\n\n\nYour working hours overlap with 8am-4pm PT for at least 20 hours per week so we have time to collaborate synchronously when necessary.\nLevel\n\n๐ This job is an IC3. You can read more about our job leveling philosophy in our Handbook.\nCompensation\n\n๐ธ We pay you an above-average salary because we want to hire the best people who are fully focused on helping Sourcegraph succeed, not worried about paying bills. As an open and transparent company that values competitive compensation, our compensation ranges are visible to every single Sourcegraph teammate.\n\nTo determine your salary, we use a number of market and data-driven salary sources, along with your location zone, and target the high-end of the range to ensure weโre always paying above market regardless of where you live in the world. Both U.S. and international locations are divided into one of four zones, determined by the cost of labor index for each area. The starting salary for a successful candidate will be based on level, job-related skills, experience, qualifications, and location zone. Please note that these salary ranges may be adjusted in the future.\n\n๐ฐThe target compensation for this role is $185,000 USD base.\n\nPlease speak with a recruiter for additional information regarding zone locations.\n\n๐ In addition to our cash compensation, we offer equity (because when we succeed as a company, we want you to succeed, too) and generous perks & benefits.\nInterview process\n\nBelow is the interview process you can expect for this role (you can read more about the types of interviews in our Handbook). It may look like a lot of steps, but rest assured that we move quickly and the steps are designed to help you get the information needed to determine if weโre the right fit for youโฆ Interviewing is a two-way street, after all! \n\nWe expect the interview process to take 5.5 hours in total.\n\n๐ Introduction Stage - we have initial conversations to get to know you betterโฆ\n\n\n* [30m] Recruiter Screen\n\n* [45m] Technical Deep Dive\n\n\n\n\n๐งโ๐ป Team Interview Stage - we then delve into your experience in more depth and introduce you to members of the team, including cross-functional partnersโฆ\n\n\n* [60m] ML Depth Interview\n\n* [60m] ML Breadth & ML Systems\n\n* [15m + async] Pairing Exercise\n\n\n\n\n๐ Final Interview Stage - we move you to our final round, where you gain a better understanding of our business and values holisticallyโฆ\n\n\n* [30m] Values\n\n* [30m] Leadership with co-founder \n\n* We check references and conduct your background check\n\n\n\n\nPlease note - you are welcome to request additional conversations with anyone you would like to meet, but didnโt get to meet during the interview process. \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Design, Recruiter, Engineer and Backend jobs that are similar:\n\n
$57,500 — $92,500/year\n
\n\n#Benefits\n
๐ฐ 401(k)\n\n๐ Distributed team\n\nโฐ Async\n\n๐ค Vision insurance\n\n๐ฆท Dental insurance\n\n๐ Medical insurance\n\n๐ Unlimited vacation\n\n๐ Paid time off\n\n๐ 4 day workweek\n\n๐ฐ 401k matching\n\n๐ Company retreats\n\n๐ฌ Coworking budget\n\n๐ Learning budget\n\n๐ช Free gym membership\n\n๐ง Mental wellness budget\n\n๐ฅ Home office budget\n\n๐ฅง Pay in crypto\n\n๐ฅธ Pseudonymous\n\n๐ฐ Profit sharing\n\n๐ฐ Equity compensation\n\nโฌ๏ธ No whiteboard interview\n\n๐ No monitoring system\n\n๐ซ No politics at work\n\n๐
We hire old (and young)\n\n
\n\n#Location\nSan Francisco Bay Area, California, United States