Data Engineer (Nellore)

Data Engineer (Nellore)

02 Apr
|
Cactus Communications
|
Nellore

02 Apr

Cactus Communications

Nellore

Overview:

CACTUS is a remote-first organization and we embrace an accelerate from anywhere culture. You may be required to travel to our Mumbai office based on business requirements or for company/team events.

We are looking for a Data Engineer to build and maintain the robust data foundations required for high-impact AI/ML projects. In this role, you will design scalable data pipelines, develop sophisticated ETL processes, and ensure the integrity of datasets sourced from diverse platforms. If you are passionate about optimizing data flow performance and implementing governance practices that align with national standards like the DPDPA, this role offers the chance to play a vital part in shaping secure, data-driven solutions.

Responsibilities:

- Design, implement, and maintain robust data pipelines supporting AI/ML models.
- Develop ETL processes for ingesting data from multiple sources including APIs, databases, and flat files.
- Ensure data integrity, lineage, and compliance with metadata standards defined by NeGD.
- Collaborate with Data Science and AI/ML teams to optimize datasets for model consumption.
- Implement data versioning and quality validation routines.
- Monitor data flow performance and optimize for latency and throughput.
- Apply data governance practices aligned with MeitY’s Responsible AI framework and
- DPDPA (2023).

Requirements:

- B.Tech / M.Tech in Computer Science, Information Systems, or Data Engineering.
- Certification in Big Data / Cloud Data Platforms (AWS, Azure, GCP) preferred.




- 4–7 years in designing and implementing scalable data pipelines and integration frameworks.
- Robust understanding of ETL, data quality, and schema design in distributed systems.
- Experience in integrating structured, semi-structured, and unstructured data for AI/ML projects.

Technical Competencies:

- Programming: Python, SQL, Scala.
- Data Tools: Apache Airflow, Kafka, Spark, NiFi.
- Databases: PostgreSQL, MongoDB, BigQuery, Snowflake.
- ETL & Warehousing: Talend, AWS Glue, Azure Data Factory.
- Data Management: Delta Lake, DataBricks, Hive.
- Cloud Data: AWS (S3, RDS, Lambda), Azure (Data Factory, Storage), GCP (BigQuery, Cloud Storage).
- Tools: Docker, Git, data modelling tools, basic infrastructure automation.
- Streaming: Apache Kafka, AWS Kinesis, real-time data processing
- Best Practices: Data validation, error handling, and pipeline observability.

About Cactus:

Established in 2002, Cactus Communications (cactusglobal.com) is a leading technology company that specializes in expert services and AI-driven products which improve how research gets funded, published, communicated, and discovered. Its flagship brand Editage offers a comprehensive suite of researcher solutions, including expert services and cutting-edge AI products like Mind the Graph, Paperpal, and R Discovery. With offices in Princeton, London, Singapore, Beijing, Shanghai, Seoul, Tokyo, and Mumbai and a global workforce of over 3,000 experts, CACTUS is a pioneer in workplace best practices and has been consistently recognized as a great place to work.

📌 Data Engineer (Nellore)
🏢 Cactus Communications
📍 Nellore

Reply to this offer

Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.

Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: data engineer (nellore) / nellore
Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: data engineer (nellore) / nellore