
Search by job, company or skills

Duration: 1 year Contract
Location: Central
Job Description
Design and deliver scalable real-time data and machine learning solutions by building robust ingestion and transformation frameworks across Hadoop ecosystems. Enable end-to-end ML model operationalization and performance optimization, while supporting multi-modal data processing and development of engineering tools and applications.
Key Responsibilities & Skillset
• Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)
• Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.
• Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).
• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).
• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.
• Experience working with ML platforms such as CML, Spark MLlib, and Python ML libraries (scikit learn, XGBoost), including model deployment.
• Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)
• Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.
• Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).
• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).
• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.
Salary up to 8000 SGD
About CLPS RiDiK
RiDiK is a global technology solutions provider and a subsidiary of CLPS Incorporation (NASDAQ: CLPS), delivering cutting-edge end-to-end services across banking, wealth management, and e-commerce. With deep expertise in AI, cloud, big data, and blockchain, we support clients across Asia, North America, and the Middle East in driving digital transformation and achieving sustainable growth. Operating from regional hubs in 10 countries and backed by a global delivery network, we combine local insight with technical excellence to deliver real, measurable impact. Join RiDiK and be part of an innovative, fast-growing team shaping the future of technology across industries.
We will review applications on a rolling basis until 5 Jun 2026, and early submissions are encouraged. Please note that only shortlisted candidates will be contacted. Thank you for your understanding.
Job ID: 148946167
Skills:
Java, Aws Lambda, Azure Functions, Docker, Azure, Azure Machine Learning, Kubernetes, Python, Machine Learning Algorithms, AWS, data preprocessing, Google Cloud AI Platform, Google Cloud Functions, Amazon SageMaker, Google Cloud AutoML, Google GenAI services, OpenAI, feature engineering, GPT-3
Skills:
Pytorch, Python, GPU workflows, Evaluation, Inference optimisation, Fine-tuning, Model training
Skills:
Tensorflow, Java, Gcp, Pytorch, Azure, Python, AWS, Scikit-learn
Skills:
Tensorflow, Version Control Systems, Pytorch, Data Architecture, Data Warehousing, Python, Cloud Computing, Machine Learning Algorithms, data lakes, SageMaker, model evaluation techniques, DevOps practices, Statistical Modeling, ETL processes
Skills:
BigQuery, Pyspark, Sql, Tensorflow, Pandas, Pytorch, Gcp, Matplotlib, Azure, Python, AWS, agentic AI frameworks, AgentEval, Code Roo, Claude, Code Cursor, Gemini CLI, AutoGen, LangGraph, Cline, Google ADK, Windsurf
We don’t charge any money for job offers