Data Engineer who worked in industries: medical, e-commerce, education, retail. Former finance student, who is passionate about automating routine and delivering good data for decision-making.
Details concerning my experience / achievements are presented below.
1010data (May 2021 - June 2022)
โข Moved existing production computation to AWS EMR, Redshift
โข Migrated corporate processing system to Airflow 2 / Google Api v4
โข Supported, optimised distributed ETL workflow in Scala, Spark
โข Code review and workload distribution among peers
First Line Software (Dec 2020 - Nov 2021)
โข Memory & performance bottleneck optimization in Spark, graphX, cats
โข Delivering graph processing algorithms for VAT tax fraud detection
โข International medical product development support for sustainable code
โข End-to-end OMOP CDM medical data processing delivery
Uchi (Apr 2020 โ Sep 2020)
โข Migrated over 25 data pipelines from 4 departments to SparkSQL
โข Engaged in architecture planning, design and implementation for DWH
โข Delivered code optimization for streaming / disributed processing Scala
โข Active monitoring with Ansible, Swarm to support continuous data delivery
OutOfCloud (Nov 2019 โ Apr 2020)
โข Ad-hoc research for customer journey, segmentation, cohort analysis
โข Designed DWH architecture to support Analysts with baseline metrics
โข Responsible for designing ETL solution for communication campaigns through Python libs (bigquery, numpy, pandas, gspread)
X5 Retail Group (Jul 2018 - May 2019)
โข Optimized \& contributed to corporate ETL solution (python, sqlalchemy)
โข Engaged with descriptive report automation via pyspark, matplotlib
โข Hive metastore datamart maintenance for batch processing
โข Sources quality amelioration with ML practices (xgboost, clustering)
โข Ad-hoc research for statistical analysis on time-series data
๐ Nationality | ๐ต๐ฑ Poland |
๐ก Residency | ๐ต๐ฑ Poland |
๐ Location | ๐ต๐ฑ Poland |
Remote OK | rok.co/@lithiferous |
GitHub | lithiferous |
https://www.linkedin.com/in/degterev/ | |
Skilled in | scala python rust sql bash c plus plus hdfs hive spark git avro emr docker airflow postgres kafka clickhouse s3 vertica mongodb redshift bigquery azure cloud databricks |
Fluent in | englishfrenchrussian |
Preferred annual pay (min) | $100,000/year |
Last seen | 1 year ago |
Signed up | 1 year ago |
Badges |
๐จโ๐ป Remote worker ๐ Early adopter |
2021 - 2022: Senior Data Engineer @ 1010data
2020 - 2021: Hadoop Scala Developer @ First Line Software
2020 - 2021: Data Engineer @ Uchi.ru
2019 - 2020: Lead Data Analyst @ OutOfCloud
2018 - 2019: Data Engineer @ X5 Retail Group
2015 - 2019: Finance @ รcole de Commerce Rennes
2013 - 2015: International Baccalaureate @ Imatran Yhteislukio