
Search by job, company or skills
Job Title: Data Engineer
• Create EDW ETL/ELT codes using Teradata SQL, Informatica, Apache Spark, QueryGrid, Trino to perform various transformations and load into Teradata based data warehouse or datamarts
- Performance tuning of highly complex applications to reduce resource
- Create framework using GCFR, shell scripts, BTEQ and shell scripts to automate end to end. Also to integrate with control
- Build in-house CDC framework using Java to process OR logs from SAP(Oracle) and ingestion the data into Teradata. Create framework around this CDC ingestion to make it configurable
•Install and build Master Data Management (MDM) application for various user uploads along with data validations and approval workflow
- Strong hands on experience with the ETL(Informatica Power Center) and Teradata based ETL development, similarly Hadoop ecosystem (Hive, Impala, Spark, Kafka, Iceberg, Ranger, Atlas, Nifi, Flink etc.,), and data pipeline orchestration.
- Create EDW ETL/ELT codes using Teradata SQL, Informatica, apache Spark, QueryGrid, Trino to perform various transformation and load into Teradata based data warehouse or datamarts
- Performance tuning of highly complex applications to reduce resource
- Create framework using GCFR, shell scripts, BTEQ and shell scripts to automate end to end. Also to integrate with control
- Build in-house CDC framework using Java to process OR logs from SAP(Oracle) and ingestion the data into Teradata. Create framework around this CDC ingestion to make it configurable
- Install and build Master Data Management(MDM) application for various user uploads along with data validations and approval workflow Usage
Must Have:
Job ID: 148638325
Skills:
Apache Flink, Adf, Pyspark, Azure Databricks, Data Modeling, Sql, MLops, Apache Kafka, Devops Tools, Talend, Airflow, Data Processing, Git workflows, AWS Kinesis, ETL pipelines
Skills:
Java, Data Modeling, Hadoop, Pyspark, Scala, Data Warehouse, AI ML, Data Mart, Retail Analytics, Python, LLM Model, Data Quality Checks, Building Data Pipelines, Data Catalog Tools, FSLM, Data Mapping, ML Model Operationalization, Feature Pipeline, Cloud Ready Data Solutions, Cloudera CDP, Spark Based Ingestion Framework
Skills:
Java, Bteq, Ranger, Hadoop, Apache Spark, Kafka, Teradata Sql, Informatica, Impala, Hive, ATLAS, Flink, GCFR shell scripting, Iceberg, QueryGrid, Trino, NiFi
Skills:
Java, Distributed Computing, Hadoop, Linux, Scala, Spark, Big Data Technologies, Python, Go, Clickhouse, Flink
Skills:
Informatica Power Center, Bteq, Java, Ranger, Hadoop Ecosystem, Apache Spark, Kafka, Impala, ELT, Hive, Etl, GCFR shell scripts, ATLAS, Flink, Teradata, Iceberg, QueryGrid, Trino, Nifi
We don’t charge any money for job offers