Job Description
The salary range for this position is (contract of employment):
mid: 12 300 - 17 600 PLN in gross terms
senior: 16 100 - 23 200 PLN in gross terms
A hybrid work model that incorporates solutions developed by the leader and the team
As a Senior Data Engineer you will play a vital role in leveraging the potential of AI technology in Allegro by building solutions for efficient management and quality assurance of Machine Learning (ML) datasets.
We are looking for people with:
- Degree in Computer Science, Mathematics or another STEM field
- 4+ years hands-on experience in Python and its data processing toolset (pandas, NumPy, Jupyter), text processing libraries (spacy, nltk) and techniques (e.g. regular expressions), interactive reports (Looker Studio, Tableau, Power BI etc.)
- Proficiency in using development tools (profiling, git, issue tracking, etc.)
- DevOps experience (CI/CD, testing, automation etc.)
- Experience in writing advanced and efficient SQL queries
- Experience in using GCP tools for data processing e.g. BigQuery, Dataproc etc.
- Experience in workflow automation solutions, e.g. Airflow
- Understanding of AI related concepts (ML Ops, modeling, evaluation etc.)
- Demonstrated ability to use metrics to back up assumptions and evaluate outcomes
- Pro-activity in seeking, clarifying and understanding information from end users and stakeholders leading up to an understanding and ownership of deliverables
The following are also a plus:
- Experience in building, evaluating or deploying AI-based solutions
- Experience in ML research in R&D or academia. Peer-reviewed publications.
- Experience in building and leading technical teams of data scientists, data engineers, ML-engineers or researchers.
- Experience in working with noSQL databases e.g. MongoDB, neo4j
What we offer
- A hybrid work model that you will agree on with your leader and the team. We have well-located offices (with fully equipped kitchens and bicycle parking facilities) and excellent working tools (height-adjustable desks, interactive conference rooms)
- Annual bonus up to 10% of the annual salary gross (depending on your annual assessment and the company's results)
- A wide selection of fringe benefits in a cafeteria plan – you choose what you like (e.g. medical, sports or lunch packages, insurance, purchase vouchers)
- English classes that we pay for related to the specific nature of your job
- An internal educational platform, MindUp (including training courses on work organization, means of communications, motivation to work and various technologies and subject-matter issues)
- If you want to learn more, check it out
In your daily work you will handle the following tasks:
- Designing and building data processing pipelines enabling efficient and scalable management of ML (machine learning) datasets, as well as automatic execution of repetitive tasks
- Extracting, transforming and propagating data from multiple sources (e.g GCP BigQuery, various APIs) into various target applications e.g. in-house and external annotation platforms, data catalogs, analytics tools, etc.
- Designing and implementing solutions for change management, monitoring and quality control of ML datasets content and structure e.g. automatic test suites for quality control, KPI dashboards, custom metrics and analytics reports
- Working closely with the Machine Learning Research and Product Management teams on R&D initiatives related to ML data sets quality
- Creating and maintaining documentation of all processes related to ML datasets and Quality Evaluation processes management
- Educating non-technical users on the usage of self-service data management tools, e.g. Jupyter notebooks, Airflow jobs
Why is it worth working with us:
- You will be part of an interdisciplinary team of Localization, Product Management and Machine Learning experts working on various AI-based solutions driving the future of Allegro technology
- You will have a key role in the process of ensuring a quality AI-based solutions
- You will have access to AI education programs and hands-on training delivered by leading AI researchers in Poland
This may also be of interest to you
https://podcast.allegro.tech/ - Allegro Tech Podcast
https://ml.allegro.tech/ - Machine Learning in Allegro
Send in your CV and see why it is #dobrzetubyć (#goodtobehere)