Data Engineer
Warszawa, PL, 00-841
About the job:
We are looking for a Junior Data Engineer to join our Data Engineering team, which builds and powers the advertising analytics ecosystem at Allegro. We operate as a center of knowledge and engineering standards, treating the highly technical analytics teams as our primary internal clients. This means our work is a balance between building robust data solutions directly for them, managing infrastructure, and designing our own data / GenAI products.
Our analytics clients drive advanced data initiatives - from building automated user profiling systems to developing their AI agents. We are here to clear away technical blockers, provide scalable engineering foundations, and ensure the entire ecosystem runs seamlessly.
What you will be doing:
-
Pipeline Development & Product Ownership: Design, build, and maintain high-volume data processing pipelines, both as tailored solutions for your analytics clients and as part of your team's core data products.
-
Building AI Agents: Actively develop and orchestrate AI agents to deliver innovative GenAI products and solutions.
-
Data Modeling & Looker: Streamline our data architecture by building and maintaining a clean, accessible semantic layer in Looker.
-
Infrastructure & Orchestration Management: Manage and optimize our data infrastructure, focusing heavily on our orchestration setup using Airflow and GCP Composer.
-
Engineering Excellence: Promote development best practices through educational code reviews, clean code, TDD, and CI/CD workflows.
-
Structured Client Support: Act as a trusted technical partner for our analytics clients when complex architectural challenges arise. To protect our deep-focus development time, we use a structured, rotational on-duty support system so that you can focus on coding without constant interruptions.
Important things for you:
-
Flexible working hours in the hybrid model (4/1) - working hours start between 8:00 a.m. and 09:30 a.m. We also have 30 days of occasional remote work.
- Our team is based in Warsaw.
We are looking for people who:
-
Develop and maintain data ingestion and processing pipelines for large volumes of data using BigQuery
-
Experience working with SQL and Python, utilizing data processing tools like PySpark.
-
Have knowledge of GCP (especially Dataflow and Composer)
-
Are familiar with data orchestration tools like Airflow.
-
Design and streamline data architecture that powers analytical products
-
Help drive the adoption of AI in analytics and unlock new possibilities
-
Use good practices (clean code, code review, TDD, CI/CD)
-
Demonstrate ability to leverage AI/ML tooling to enhance data pipelines and analytical products.
-
Are eager for personal development and keeping their knowledge up to date
-
Possess a great communication skills, positive attitude and team-working skills
-
Know English at B2 level
What's in it for you:
-
Well-located offices (with e.g. fully equipped kitchens, bicycle parking, terraces full of greenery) and excellent work tools (e.g., raised desks, ergonomic chairs, interactive conference rooms).
-
A 16" or 14" MacBook Pro or corresponding Dell with Windows (if you don't like Macs) and all the necessary accessories
-
A wide selection of fringe benefits in a cafeteria plan - you choose what you like (e.g., medical, sports or lunch packages, insurance, purchase vouchers).
-
English classes that we pay for related to the specific nature of your job.
-
A training budget, inter-team tourism (see more here), hackathons, and an internal learning platform where you will find multiple trainings.
-
An additional day off for volunteering, which you can use alone, with a team, or with a larger group of people connected by a common goal.
-
Social events for Allegro people - Spin Kilometers, Family Day, Fat Thursday, Advent of Code, and many other occasions we enjoy.
And that's just the beginning! You can read more about the benefits here.
#goodtobehere means that:
-
You will join a team you can count on - we work with top-class specialists who have knowledge- and experience-sharing in their DNA.
-
You will love our level of autonomy in team organization, the space for continuous development, and the opportunity to try new things. You get to choose which technology solves the problem and you are responsible for what you create.
-
You will value our Developer Experience and the full platform of tools and technologies that make creating software easier. We rely on an internal ecosystem based on self-service and widely used tools such as Kubernetes, Docker, GitHub and GitHub Actions. Thanks to this, you can contribute to Allegro from your very first days on the job.
-
You will create solutions that will be used (and loved!) by your friends, family and millions of our customers.
-
You will meet the Allegro Scale, which starts with over 1000 microservices, an open-source data bus (Hermes) with 300K+ rps, a Service Mesh with 1M+ rps, tens of petabytes of data, and production-used machine learning.
Send us your CV and… see you at Allegro!