About this role
We are looking for a Senior Data Engineer to join our growing data platform team. You will own the design, build, and reliability of our cloud-native data lakehouse โ from raw ingestion through to analytics-ready Gold tables. You will work closely with data analysts, analytics engineers, and product stakeholders to deliver trusted data at speed, while championing data quality and observability as first-class concerns.
This role sits at the intersection of data engineering and platform engineering โ you will be expected to think in architectures, not just pipelines.
What You Will Do
Data Platform & Pipeline Engineering
โธ Design, build, and maintain scalable ETL/ELT pipelines using Azure Data Factory (ADF) and Apache Airflow, processing structured and semi-structured data across the Medallion architecture (Bronze โ Silver โ Gold).
โธ Implement incremental load patterns, change data capture (CDC), and event-driven ingestion to ensure data freshness across the platform.
โธ Build and optimise Snowflake data warehouse objects โ tables, views, dynamic tables, streams, tasks, and stored procedures โ for performance and cost efficiency.
โธ Develop modular, tested dbt models aligned to each Medallion layer, enforcing consistent naming conventions, documentation, and lineage across all transformations.
Data Quality & Observability
โธ Embed automated data validation at every Medallion layer using Elementary (dbt's observability layer), ensuring anomaly detection, freshness checks, and schema drift alerts are in place before data reaches consumers.
โธ Define and enforce data contracts between producers and consumers โ row count checks, null rate thresholds, referential integrity, and value domain validation.
โธ Build and maintain data quality dashboards to give engineering and business stakeholders real-time confidence in platform health.
Azure Cloud Infrastructure
โธ Manage and optimise Azure Data Lake Storage Gen2 (ADLS) โ folder structures, lifecycle policies, access tiers, and partition strategies.
โธ Build and maintain Azure Functions and Azure Logic Apps for lightweight event-driven processing, orchestration triggers, and operational automation.
โธ Manage secrets, credentials, and environment-specific configuration securely using Azure Key Vault โ no hardcoded credentials in pipelines or code.
โธ Contribute to infrastructure-as-code practices for provisioning Azure data services (Terraform or Bicep preferred).
Collaboration & Delivery
โธ Translate ambiguous business requirements into well-defined data models and pipeline designs, working with analysts and stakeholders to validate assumptions before build.
โธ Participate in code reviews, enforce standards, and mentor junior engineers on data engineering best practices.
โธ Support CI/CD adoption for pipeline and dbt model deployment across Dev / Test / Prod environments.
What We Are Looking For
Must-Have
โธ Snowflake: Snowflake
โ Advanced SQL โ window functions, CTEs, recursive queries, query profiling
โ Snowflake-native features: streams, tasks, snowpipe, dynamic tables, row-level security
โ Virtual warehouse tuning and credit cost optimisation
โธ dbt + Elementary: dbt + Elementary
โ Writing, testing, and documenting production dbt models
โ Elementary integration for data observability and anomaly detection
โ dbt incremental strategies, snapshots, and semantic layer
โธ Azure Cloud: Azure Cloud
โ Azure Data Factory โ pipeline authoring, triggers, parameterisation, linked services
โ ADLS Gen2 โ zone/folder design, lifecycle management, Parquet/Delta partitioning
โ Azure Key Vault โ secret management, managed identities
โ Azure Functions / Logic Apps โ event-driven triggers and lightweight automation
โธ Airflow: Airflow
โ DAG authoring, task dependencies, XCom, sensors, and connection management
โ Airflow deployment and monitoring in cloud-hosted environments
โธ Python: Python
โ Data pipeline scripting, PySpark basics, REST API integration
โ Unit testing pipeline logic and transformation functions
โธ Data Quality & Medallion Architecture: Medallion Architecture:
โ Hands-on experience implementing Bronze / Silver / Gold Medallion architecture
โ Data validation checks at each layer โ not just at the final Gold layer
โ Schema evolution handling and SCD Type 2 dimension management
โธ 4+ years of professional data engineering experience with at least 2 years on Azure cloud data platforms.
Nice-to-Have
โธ Exposure to Snowflake Cortex, dbt Semantic Layer, or Boomi Data Hub for AI-assisted data enrichment within pipeline layers.
โธ Experience integrating LLM-based quality checks or AI-assisted anomaly detection into data workflows.
โธ Familiarity with Microsoft Fabric and OneLake as a complementary or future-state platform.
โธ Knowledge of data mesh or data product thinking and how it maps to Medallion layer ownership.
โธ Experience with Terraform or Bicep for Azure infrastructure provisioning.