Data Engineer (Databrick + Pyspark)

capcoยท Data & Analytics
Apply Now โ†—
๐Ÿ“ India; India - Pune

About this role

ย 

Job Title: Data Engineer (PySpark / Databricks)

Experience: 5โ€“9 Years Location: Pune (Hybrid โ€“ Capco Office)

Job Summary

We are looking for a skilled Data Engineer with strong expertise in PySpark, Databricks, and modern data engineering practices. The ideal candidate will have hands-on experience in building scalable data pipelines, working with large datasets, and leveraging cloud-based data platforms.

Key Responsibilities Design, develop, and maintain scalable ETL/ELT data pipelines Work extensively with PySpark and Apache Spark for large-scale data processing Build and manage workflows using Apache Airflow Develop and optimize data solutions on Databricks (Jobs, Delta Lake) Work with cloud-based data lakes (S3 or equivalent) Write efficient and complex SQL queries for data transformation and analysis Run and manage Spark workloads on EMR Serverless or other managed Spark platforms Ensure data quality, reliability, and performance optimization of pipelines Must Have Skills Strong hands-on experience with PySpark and Apache Spark internals Experience with Databricks (Jobs, Delta Lake) Proficiency in Apache Airflow for workflow orchestration Solid experience building ETL/ELT pipelines at scale Strong SQL skills and experience with Data Warehouse (DWH) systems Experience running Spark workloads on EMR Serverless or managed Spark platforms Hands-on experience with cloud data lakes (S3 or equivalent) Good to Have Skills Experience with Delta Lake / Apache Iceberg Exposure to streaming frameworks (Spark Structured Streaming, Kafka) Familiarity with CI/CD pipelines for data engineering workflows Knowledge of data governance, cataloging, and lineage tools

Frequently Asked Questions

Is the salary disclosed for the Data Engineer (Databrick + Pyspark) position at capco?
The salary for this Data Engineer (Databrick + Pyspark) role at capco is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Data Engineer (Databrick + Pyspark) position at capco located?
This Data Engineer (Databrick + Pyspark) role at capco is based in India; India - Pune. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Which team or department does the Data Engineer (Databrick + Pyspark) at capco belong to?
This Data Engineer (Databrick + Pyspark) position is part of the Data & Analytics department at capco. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Data Engineer (Databrick + Pyspark) position at capco?
Click the "Apply Now" button on this page. You will be redirected to capco's official application portal hosted on greenhouse where you can submit your application directly.
When was the Data Engineer (Databrick + Pyspark) job at capco posted?
This Data Engineer (Databrick + Pyspark) position at capco was posted on Apr 14, 2026. Apply as soon as possible โ€” early applications are often reviewed first.
Data Engineer (Databrick + Pyspark)
capco
Apply for this role โ†—

You'll be redirected to capco's official application page on Greenhouse.