Open Startup
Health Insurance Post a job
👩‍💻 Join Remote OK 👋  Log in
General
Remote OK Frontpage 🏝 Remote jobs 👩‍💻 Hire remote workers 🚨 Post new job
Top jobs
🤓Developer Jobs 🤓Engineer Jobs 👵Senior Jobs 💼Executive Jobs ☕️JavaScript Jobs 💎Ruby Jobs 🥞Full Stack Jobs ♾️DevOps Jobs
Companies
🚨 Post new job 📦 Buy a job bundle Safetywing Health insurance for teams Safetywing Health insurance for nomads
Feeds
🛠 Remote Jobs API 🪚 RSS feed 🪓 JSON feed

Hacker News mode  Hacker News mode

Safe for work mode  Safe for work mode

Other
🤲  Open Startup ($97k/mo) 🟢  Uptime (99.94%) 📈  Pageviews (1.12M/mo) 📊 Remote work stats new
🌍  Become a digital nomad
🔮  Web3 Jobs new
🇵🇹  Get Portuguese residency new

👉 Hiring for a Remote position?

Post a job
on the 🏆 #1 Remote Jobs board.

Remote Health by SafetyWing


Global health insurance for freelancers & remote workers

Creative Commons


verified closed
🌏 Worldwide

data

 

engineering

 

engineer

 

senior

Creative Commons

creativecommons.org

🔎2,776 views

✅ 0 applied (0%)

This job post is closed and the position is probably filled. Please do not apply.
Creative Commons is building a “front door” to the growing universe of openly licensed and public domain content through CC Search and the CC Catalog API. The Senior Data Engineer reports to the Director of Engineering and is responsible for CC Catalog, the open source catalog that powers those products. This project will unite billions of records for openly-licensed and public domain works and metadata, across multiple platforms, diverse media types, and a variety of user communities and partners.\n\n**Diversity & inclusion**\n\nWe believe that diverse teams build better organizations and better services. Applications from qualified candidates from all backgrounds, including those from under-represented communities, are very welcome. Creative Commons works openly as part of a global community, guided by collaboratively developed codes of conduct and anti-harassment policies.\n\n**Work environment and location**\n\nCreative Commons is a fully-distributed organization - we have no central office. You must have reasonable mobility for travel to twice-annual all-staff meetings and the CC Global Summit (a total of 3 trips per year). We provide a subsidy towards high-speed broadband access. Laptop/desktop computer and necessary resources are supplied.\n\n\n\n# Responsibilities\n **Primary responsibilities**\nArchitect, build, and maintain the existing CC Catalog, including:\n* Ingesting content from new and existing sources of CC-licensed and public domain works.\n* Scaling the catalog to support billions of records and various media types.\n* Implementing resilient, distributed data solutions that operate robustly at web scale.\n* Automating data pipelines and workflows.\n* Collaborating with the Backend Software Engineer and Front End Engineer to support the smooth operation of the CC Catalog API and CC Search.\n\nAugment and improve the metadata associated with content indexed into the catalog using one or more of the following: machine learning, computer vision, OCR, data analysis, web crawling/scraping.\n\nBuild an open source community around the CC Catalog, including:\n* Restructuring the code and workflows such that it allows community contributors to identify new sources of content and add new data to the catalog.\n* Guiding new contributors and potentially participating in projects such as Google Summer of Code as a mentor. \n* Writing blog posts, maintaining documentation, reviewing pull requests, and responding to issues from the community.\n\nCollaborate with other outside communities, companies, and institutions to further Creative Commons’ mission. \n\n# Requirements\n* Demonstrated experience building and deploying large scale data services, including database design and modeling, ETL processing, and performance optimization\n* Proficiency with Python\n* Proficiency with Apache Spark\n* Experience with cloud computing platforms such as AWS\n* Experience with Apache Airflow or other workflow management software\n* Experience with machine learning or interest in picking it up\n* Fluent in English\n* Excellent written and verbal communication skills\n* Ability to work independently, build good working relationships and actively communicate, contribute, and speak up in a remote work structure\n* Curiosity and a desire to keep learning\n* Commitment to consumer privacy and security\n\nNice to have (but not required):\n* Experience with contributing to or maintaining open source software\n* Experience with web crawling\n* Experience with Docker\n \n\nBe sure to mention the words **GAUGE TINY PEASANT** when applying to show you read the job post completely. This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Salary and compensation\n /year\n \n\n#Location\n🌏 Worldwide


See more jobs at Creative Commons

Visit Creative Commons's website

# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.
Feedback If you find a bug, or have feedback, put it here. Please no job applications in here, click Apply on the job instead. Thanks for the message! We will get back to you soon.

[Spam check] What is the name of Elon Musk's company going to Mars?

Send feedback
298ms