Companies Makro PRO Senior Data Engineer (Pipelines & Data Quality) - Data Platform

About the role

Makro PRO · Onsite

We are seeking an experienced Senior Data Engineer to own the pipeline-standardization and data-quality program for the enterprise lakehouse. This role ships compliance gates that block non-compliant deployments, stands up the data-quality framework, builds the dashboards business users trust, and drives measurable reductions in data incidents across the retail data estate.

Key Responsibilities:

  • Design, ship, and operate a pipeline-compliance checker that validates naming, metadata, config schema, DQ-rule declarations, and cluster-policy reference on every new deployment.
  • Deploy a data-quality framework (Great Expectations, Databricks DQ Rules, or equivalent) across new production pipelines; build a domain onboarding template; configure alert routing by severity.
  • Build and publish the Data Quality Dashboard — quality health by domain, source, table; near-real-time refresh; freshness, completeness, accuracy.
  • Establish Source Change Management agreements with key source systems (SLA contracts, change-request process, automated schema-change alerting); map source lineage end-to-end.
  • Lead the migration playbook to bring the legacy pipeline estate to standard; mentor engineers executing migration; own the playbook, not every migration.
  • Drive data-incident reduction through prevention (compliance gate, DQ framework, DCM, lineage), not reactive firefighting; lead incident response and post-mortems for major DQ failures.
  • Partner with platform engineering on Event Stream domain-event schemas and data-product contracts.
  • Author runbooks, code review at senior level, and contribute to engineering culture.

Requirements

  • Bachelor's degree in Computer Science, Data Engineering, or a related discipline.
  • 5+ years designing, building, and operating production data pipelines on a major lakehouse or warehouse (Databricks, Snowflake, BigQuery).
  • Strong PySpark and SQL; understands Spark performance tuning at production scale.
  • Deep experience with data-quality frameworks (Great Expectations, dbt tests, Soda, Monte Carlo) — has defined SLAs, set thresholds, tuned alert noise.
  • Built and operated medallion / multi-layer lakehouse architectures with explicit transformation layers.
  • Solid Git / CI experience for data code; opinions on testing data transformations.
  • Comfortable defining and enforcing standards (naming, partitioning, retention, PII tagging) and reviewing PRs against them.
  • Cloud platform experience (Azure preferred; AWS / GCP transferable).

Preferred Qualifications

  • Streaming experience (Spark Structured Streaming, Delta Live Tables, Flink, Kafka Streams).
  • Data modeling discipline (Kimball, Data Vault 2.0) with clear rationale; Unity Catalog production experience (lineage, tags, RLS).
  • Retail data exposure — POS, inventory, replenishment, loyalty — and BI optimization for Power BI consumption.
  • Vendor certifications such as Databricks Data Engineer Professional or Azure Data Engineer Associate.
Ready to apply to Makro PRO?
Apply to Makro PRO

Similar jobs

Sign up for suggestions tailored to the jobs you open and the searches you save.

Apply now
🤖

Whoa — hold up

JobsRadar was built for real people having a rough time in their job search — not for automated requests. You're clicking way too fast and you're now temporarily blocked.

Come back later. If you're genuinely job hunting, we've got your back — just act like a human.

Catch your next role the second it’s posted.

Create a free account and we’ll watch the boards for you — the instant a job matches your search, it lands in your inbox or Telegram. No digging, no refreshing.

Create free account

Free forever · takes 30 seconds · already have one?

Get the worldwide-remote edge.

Join our Telegram channel for the stuff that helps you land the role — salary benchmarks, the weekly market pulse, and new-feature drops. No spam, just signal.

Join the channel — it's free