Position:
Data Engineer
Experience:
5–8 Years
Location:
Noida, Hyderabad
Mode:
Work from Office (5 Days Working)
Notice Period:
Immediate
Position Overview
We are seeking a highly skilled Data Engineer with proven experience in designing and
implementing scalable data platforms using Databricks. The ideal candidate will bring deep
expertise in building modern data architecture, integrating multiple data sources, and enabling
analytics and machine learning workflows. This role requires strong technical proficiency,
problem-solving ability, and collaboration across cross-functional data and product teams.
Key Responsibilities
-
Design, develop, and maintain robust data pipelines and architectures on Databricks.
-
Build efficient ETL/ELT processes to ingest data from SQL Server, MongoDB, and
InfluxDB into Databricks Delta Lake.
-
Implement real-time and batch data ingestion strategies using tools such as Kafka,
Azure Event Hubs, or equivalent.
-
Optimize data storage, query performance, and computing efficiency for scalability
and cost-effectiveness.
-
Develop data models that enable BI reporting, advanced analytics, and machine
learning use cases.
-
Collaborate with Data Scientists, Analysts, and Product Teams to define data
requirements and ensure availability of high-quality datasets.
-
Implement data governance, lineage, and security best practices across the platform.
-
Monitor, troubleshoot, and enhance data workflows for performance and reliability.
Required Skills & Qualifications
-
Proven hands-on experience in Databricks, including Delta Lake, Spark, PySpark, and
SQL.
-
Strong expertise in data integration from multiple sources — SQL Server, MongoDB,
InfluxDB, and others.
-
Experience in ETL/ELT pipeline design and orchestration using Airflow, Azure Data
Factory, or similar tools.
-
Solid understanding of data modeling, data warehousing concepts, and query
optimization techniques.
-
Experience in real-time data streaming using Kafka, Azure Event Hubs, or equivalent
technologies.
-
Strong programming skills in Python and SQL for data processing and automation.
-
Proficiency in Azure Cloud Services, especially for data engineering and storage.
-
Familiarity with Change Data Capture (CDC) methodologies.
-
Strong analytical mindset, attention to detail, and ability to work in fast-paced, agile
environments.
Preferred Qualifications
-
Experience handling time-series data or using InfluxDB for data ingestion and analysis.
-
Exposure to machine learning pipelines or ML model deployment within Databricks.
-
Knowledge of data governance frameworks such as Unity Catalog, Azure Purview,
or equivalent.
-
Understanding of CI/CD pipelines for data workflows and infrastructure automation.
What We Offer
-
Opportunity to work on cutting-edge data engineering projects using modern cloud and
analytics technologies.
-
A collaborative environment with data-driven innovation at its core.
-
Continuous learning opportunities in cloud, AI, and advanced analytics domains.
-
Competitive compensation and growth opportunities.