Search by job, company or skills

AmpsTek

Machine Learning Engineer

10-12 Years
Save
  • Posted 20 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)

• Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.

• Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).

• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).

• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.

• Experience working with ML platforms such as CML, Spark MLlib, and Python ML libraries (scikit learn, XGBoost), including model deployment.

• Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)

• Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.

• Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).

• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).

• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.

Total Experience

10+ yrs

Relevant Experience

6+ yrs

Mandatory skills

• Hadoop ecosystem (Spark, Hive, Kafka, Flink, NiFi, Iceberg, Trino)

• Java, Python, Spark (batch & real-time processing)

• Data ingestion & transformation frameworks

• Performance tuning on Hadoop platforms

• Shell scripting

• Real-time data processing systems

• ML model operationalization (CML / Spark ML)

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 150686333

Similar Jobs

Singapore, Ubi

Skills:

JavaRangerHadoopKafkaReactHiveXGBoostSparkShell scriptingFlaskPythonMLlibFlinkOzoneIcebergTrinoNifiCloudera Machine Learning

Singapore

Skills:

JavaApache FlinkHadoop EcosystemApache SparkApache Nifishell scriptingXGBoostApache KafkaPythonApache HiveApache Icebergscikit-learnSpark MLlib

Singapore

Skills:

JavaHadoopScalaDevopsSparkPythonEtlML algorithmsdata management technologyRelational Databasescloud-based AI platformsnon-relational databases

Singapore

Skills:

PytorchTensorflowPythonWorld ModelsSequential Modelling

Singapore, Kallang

Skills:

KafkaSqlTensorflowGcpPytorchDockerSparkDatabricksAzureKubernetesPythonAWSNoSQL databasesScikit-learn