Databricks engineer - Mumbai
Job Location: Mumbai (only)
Experience: 4 to 7 years
Databricks engineer Roles and Responsibilities:
Responsible for data management activities related to the migration of on-prem sources to cloudsystemsusing Databricks architecture andsolutions. In this role, you willberesponsible forthe development and maintenance of data pipelines and analytics solutions in a cloud-based environment.
Desired Candidate Profile:
-Minimum 5 years of professional experience with working knowledge in a Data and Analytics role with a Global organization
-5 to 8 years of experience in working with Databricks tech stacks
-Experience in leading development of Data and Analytics products, from Requirement Gathering State to Driving User Adoption
-Develop andoptimizeETL processes using Databricks and related tools like Apache Spark
-Design efficient data processing systems and pipelines using Databricks, APIs, and other cloud services
-Candidate with strong data transformation experience on Unity Catalog, Delta Tables, DLT
-Strongproficiencyin writing andoptimizingSQL queries and working with databases
-Ability toacquirespecialized domain knowledge required to be more effective in all work activities
-BI & Data-warehousing concepts area must.
-Design, develop, andmaintainscalable ETL/ELT pipelines usingPySparkonDatabricks.
-Ingest and transform data from multiple structured and unstructured sources including cloud storage (Azure Data Lake, AWS S3, etc.).
-OptimizeSpark jobs for performance and cost-efficiency on the Databricks platform.
-Collaborate with data scientists, analysts, and stakeholders to understand data requirements and deliver high-quality solutions.
-Implement best practices in data engineering, including modular coding, unit testing, and version control (e.g., Git).
-Automate data workflows and schedule jobs usingDatabricks Workflowsor external orchestration tools (e.g., Airflow, Azure Data Factory).
-Ensure data quality, integrity, and governance in all data pipelines.
-Participate in code reviews, performance tuning, and system monitoring.
-Document solutions, processes, and configurations.