Data Engineer (Performance Tuning, ETL, Hive, Cloudera/Spark)

Exasoft

Singapore

7-9 Years

This job is no longer accepting applications

Posted a month ago

Job Description

Job Description Senior Big Data Performance Engineer

Role Overview

We are seeking a highly experienced Senior Big Data Performance Engineer with 7+ years of hands-on expertise in large scale data platforms, Spark/Hadoop tuning, and enterprise-grade data pipelines. This role is responsible for conducting platform-wide performance analysis, optimizing AML data pipelines, and ensuring the stability, scalability, and efficiency of Cloudera-based applications and integrations.

Technology Stack Development & Tuning

Apache Spark (Scala, Python)

Spark performance optimization & troubleshooting

Hive query and table optimization

HBase data modeling & tuning

Scope of Work

Perform application-aware performance analysis for AML data pipelines.

Analyze and optimize pipelines running on Cloudera, Spark, Hive, HBase, JBoss, and MariaDB.

Tune Spark, Hive, and HBase jobs, queries, and tables for performance and scalability.

Review ETL job design and data pipeline architecture for efficiency, resilience, and scalability.

Identify misconfigurations or misuse causing performance degradation or recurring issues.

Assess and recommend tuning for Cloudera, Phoenix, YARN, and related platform configurations.

Support large-scale data ingestion, transformation, and downstream data delivery.

Participate in joint Root Cause Analysis (RCA) with internal and vendor teams.

Support performance benchmarking and validation of tuning recommendations.

Support application and system integrations with Cloudera and AML platforms.

Build and refine data validation dashboards and data growth monitoring tools.

Troubleshoot slow-running jobs, data skews, shuffle issues, GC challenges, YARN resource pressure, and cluster constraints.

Improve performance of API/UI components interacting with Big Data pipelines.

Deliverables

Formal assessment report covering pipeline, application, and platform interaction analysis.

Identified performance bottlenecks and inefficiencies.

Documented risks, constraints, and recurring issue patterns.

Tuning and optimization recommendations for Spark, Hive, HBase, and Cloudera/Phoenix configurations.

Corrective and preventive action plan categorized into immediate remediation, configuration changes, and long-term

improvements.

Performance benchmarking and validation results post-tuning.

RCA documentation for recurring application-impacting issues.

Governance-ready documentation aligned with CRQ and JIRA standards.

More Info

Job Type:

Industry:

Function:

Employment Type:

About Company

ExasoftJob Source: www.linkedin.com

Job ID: 141756863

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 26-02-2026 11:16:52 AM

Homejobs in SingaporeData Engineer (Performance Tuning, ETL, Hive, Cloudera/Spark)

Similar Jobs

Data Engineer (ETL, Talend, Azure) - Senior Associate, AI & Data, Technology Consulting

4-6 yrs

Singapore

Data Engineer (Big Data / ETL) 12 Months Contract

NTT SINGAPORE PTE. LTD.

8-10 yrs

SGD 5,500 - 7,500 per month

Singapore, Kallang

Data Engineer (ETL, Talend, Azure), Senior Associate, Technology Consulting

Ernst & Young Advisory PTE. LTD.

4-6 yrs

SGD 5,500 - 11,000 per month

Singapore

Data ETL Consultant

ALLEGIS GROUP SINGAPORE PRIVATE LIMITED

4-6 yrs

SGD 5,000 - 10,000 per month

Singapore

Do you want to see more relevant and perfect job for you?

Beware of Scammers

We don’t charge any money for job offers

What it feels like to have

48% more interview calls?

To get 5X more recruiter views on your profile