๐Ÿ‘ฉโ€๐Ÿ’ป Join Remote OK ๐Ÿ‘‹  Log in
General
Remote OK Frontpage ๐Ÿ Remote jobs ๐ŸŒ—  Dark mode ๐Ÿ‘ฉโ€๐Ÿ’ป Hire remote workers ๐Ÿšจ Post a remote job ๐Ÿฑ Compact mode โœ๏ธ Remote work blog new
Top jobs
๐Ÿฆพ  AI Jobs
โฐ Async jobs ๐ŸŒŽ Distributed team ๐Ÿค“ Engineer jobs ๐Ÿ’ผ Executive jobs ๐Ÿ‘ต Senior jobs ๐Ÿค“ Developer jobs ๐Ÿ’ฐ Finance jobs โ™พ๏ธ Sys Admin jobs โ˜•๏ธ JavaScript jobs ๐Ÿ‘ Backend jobs
Companies
๐Ÿšจ Post a remote job ๐Ÿ“ฆ Buy a job bundle ๐Ÿท Ask for a discount Safetywing Health insurance for teams Safetywing Health insurance for nomads
Feeds
๐Ÿ›  Remote Jobs API ๐Ÿชš  RSS feed ๐Ÿช“  JSON feed

Hacker News mode  Hacker News mode

Safe for work mode  Safe for work mode

Other
๐ŸŸข  Uptime (99.99%) ๐Ÿ“ˆ  Pageviews (2.44M/mo) ๐Ÿ“Š Remote work stats new ๐Ÿ‘ท Top remote companies ๐Ÿ’ฐ Highest paying remote jobs ๐Ÿงช State of remote work new
๐ŸŒ  Become a digital nomad
โœจ  Applicant AI
๐Ÿ”ฎ  Web3 Jobs
๐Ÿ“ธ  Photo AI
๐Ÿก  Interior AI
๐Ÿ‡ต๐Ÿ‡น  Get Portuguese residency new
Post a remote job Log in

lithiferous

Remote worker in Krakow, Poland with 6+ years of experience - Last seen ago

Data Engineer who worked in industries: medical, e-commerce, education, retail. Former finance student, who is passionate about automating routine and delivering good data for decision-making.

Details concerning my experience / achievements are presented below.
1010data (May 2021 - June 2022)
โ€ข Moved existing production computation to AWS EMR, Redshift
โ€ข Migrated corporate processing system to Airflow 2 / Google Api v4
โ€ข Supported, optimised distributed ETL workflow in Scala, Spark
โ€ข Code review and workload distribution among peers

First Line Software (Dec 2020 - Nov 2021)
โ€ข Memory & performance bottleneck optimization in Spark, graphX, cats
โ€ข Delivering graph processing algorithms for VAT tax fraud detection
โ€ข International medical product development support for sustainable code
โ€ข End-to-end OMOP CDM medical data processing delivery

Uchi (Apr 2020 โ€“ Sep 2020)
โ€ข Migrated over 25 data pipelines from 4 departments to SparkSQL
โ€ข Engaged in architecture planning, design and implementation for DWH
โ€ข Delivered code optimization for streaming / disributed processing Scala
โ€ข Active monitoring with Ansible, Swarm to support continuous data delivery

OutOfCloud (Nov 2019 โ€“ Apr 2020)
โ€ข Ad-hoc research for customer journey, segmentation, cohort analysis
โ€ข Designed DWH architecture to support Analysts with baseline metrics
โ€ข Responsible for designing ETL solution for communication campaigns through Python libs (bigquery, numpy, pandas, gspread)

X5 Retail Group (Jul 2018 - May 2019)
โ€ข Optimized \& contributed to corporate ETL solution (python, sqlalchemy)
โ€ข Engaged with descriptive report automation via pyspark, matplotlib
โ€ข Hive metastore datamart maintenance for batch processing
โ€ข Sources quality amelioration with ML practices (xgboost, clustering)
โ€ข Ad-hoc research for statistical analysis on time-series data


Skilled in scala python rust sql bash c plus plus hdfs hive spark git avro emr docker airflow postgres kafka clickhouse s3 vertica mongodb redshift bigquery azure cloud databricks 
Fluent in englishfrenchrussian
Preferred annual pay (min) $100,000/year
Last seen 1 year ago
Signed up 1 year ago
Badges ๐Ÿ‘จโ€๐Ÿ’ป Remote worker

๐ŸŽ– Early adopter

Employment

2021 - 2022: Senior Data Engineer @ 1010data

2020 - 2021: Hadoop Scala Developer @ First Line Software

2020 - 2021: Data Engineer @ Uchi.ru

2019 - 2020: Lead Data Analyst @ OutOfCloud

2018 - 2019: Data Engineer @ X5 Retail Group

Education

2015 - 2019: Finance @ ร‰cole de Commerce Rennes

2013 - 2015: International Baccalaureate @ Imatran Yhteislukio

722ms