Data Engineer (PySpark Cloudera) (Pune)

03 Apr

Zorba Consulting India

Pune

03 Apr

Zorba Consulting India

Pune

Job Summary

We are looking for experienced Data Engineers with strong expertise in PySpark and Cloudera Data Platform to design, develop, and optimize scalable data pipelines. The ideal candidate should have hands-on experience with distributed data systems, cloud platforms (AWS), and modern data architecture, along with a strong understanding of data governance and cataloging tools.

Key Responsibilities

- Design, build, and maintain scalable batch and real-time data pipelines using PySpark
- Work with Cloudera Data Platform (CDP) components such as CDE, CDW, Ozone, and Airflow
- Manage and optimize data workflows, ensuring high performance and reliability
- Implement data governance, security, and access control using Apache Ranger
- Develop and maintain data models, Hive Metastore, and large-scale distributed datasets
- Collaborate with cross-functional teams to deliver data solutions for analytics and reporting
- Work with AWS services like EMR, S3, MWAA, Glue Catalog,

and Lake Formation
- Ensure proper data partitioning, bucketing, and optimization using formats like Iceberg and Parquet
- Integrate data cataloging and lineage using Atlan

Required Skills & Qualifications

- 6+ years of experience in Data Engineering
- Solid hands-on experience with PySpark
- Deep understanding of modern data platforms and distributed data systems
- Experience with Cloudera Data Platform (CDP) ecosystem
- Proficiency in SQL and data modeling concepts
- Experience with AWS data services (EMR, S3, MWAA, Glue, Lake Formation)
- Strong knowledge of Hive Metastore and big data architectures
- Experience with file formats (Iceberg, Parquet) and optimization techniques
- Familiarity with data governance, cataloging, and lineage tools (Atlan)

📌 Data Engineer (PySpark Cloudera) (Pune)
🏢 Zorba Consulting India
📍 Pune

Reply to this offer

Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.

MANAGER - HR (Pune)

29 Mar

Synergytech Automation

Pune

29 Mar
Synergytech Automation
Pune

- Work Location-Pune - Schedule: Full-time - Job Posting Date: 27-April-2024 •Qualification: - MBA/MSW in Human Resource Management with 3-4 years of Experience •Responsibilities - Identifyin [...]

DESIGN ENGINEER (BIW) (Pune)

30 Mar

Synergytech Automation

Pune

30 Mar
Synergytech Automation
Pune

- Work Location-Pune - Schedule: Full-time - Job Posting Date: 27-April-2024 •Qualification - Degree/Diploma in Mechanical Engineering with 4-5 years Experience in Designing of material handling s [...]

Opening For Sales Executive Reputed Fmcg Industry Pune, Maharashtra, In K Priya

30 Mar

SEVEN CONSULTANCY

Pune

30 Mar
SEVEN CONSULTANCY
Pune

JOB DETAILS 1.Market Research / Scenario 2.Demand Generation 3.Rates / Quotation 4.Payment Follow up 5.Dispatch Follow up 6.Govt Compliance 7.Handle the sales & marketing for the [...]

Retail Technology - Technical Analyst (Pune)

01 Apr

Barclays

Pune

01 Apr
Barclays
Pune

Be a part of a place where challenges are measured in billions qubits and nanoseconds Build your career in an environment where we re advancing machine learning leveraging blockchains and harnessing F [...]

Data Engineer (PySpark Cloudera) (Pune)

Data Engineer (PySpark Cloudera) (Pune)

Reply to this offer

MANAGER - HR (Pune)

MANAGER - HR (Pune)

DESIGN ENGINEER (BIW) (Pune)

DESIGN ENGINEER (BIW) (Pune)

Subscribe to this job alert:

Enter Your E-mail address to receive the latest job offers for: data engineer (pyspark cloudera) (pune) / pune

Opening For Sales Executive Reputed Fmcg Industry Pune, Maharashtra, In K Priya

Opening For Sales Executive Reputed Fmcg Industry Pune, Maharashtra, In K Priya

Retail Technology - Technical Analyst (Pune)

Retail Technology - Technical Analyst (Pune)

Subscribe to this job alert:

Enter Your E-mail address to receive the latest job offers for: data engineer (pyspark cloudera) (pune) / pune