
Search by job, company or skills
KEY RESPONSIBILITES AND SKILL SET
- Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi
- Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time
- Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React)
- learning models using Cloudera Machine Learning (CML).
• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.
• Experience working with ML platforms such as CML, Spark MLlib, and Python ML libraries (scikit learn, XBoost), including model deployment.
- Design and develop highly scalable, Real time systems using Hadoop ecosystem components Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)
- Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.
- Develop full stack applications and internal engineering tools using
Python, shell scripting, and modern web frameworks (e.g., Flask, React).
• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).
- Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.
KEY SKILLS
Job ID: 148950449
Skills:
Jax, Pytorch, Python, SFT, GPU-based training and inference system, Distillation, DPO, LoRA, QLoRA
Skills:
Java, Ranger, Hadoop, Scala, Kafka, Scikit Learn, React, Nlp, Hive, XGBoost, Spark, Shell scripting, Flask, Keras, Python, Hugging Face, Flink, Ozone, Iceberg, NLQ, Trino, Nifi
Skills:
Ffmpeg, Tensorflow, Git, Pytorch, Docker, Opencv, Python, Weights Biases, Linux environments, TensorRT, MLflow, ONNX, quantization
Skills:
Ranger, Kafka, Scikit Learn, React, Nlp, XGBoost, Shell scripting, Flask, Python, Java, Hadoop, Scala, Hive, Spark, Keras, Hugging Face, Flink, NLQ, Spark MLlib, Trino, Ozone, Iceberg, Gen AI, Nifi, Cloudera Machine Learning
Skills:
Python
We don’t charge any money for job offers