Search by job, company or skills

ANUTTACON PTE. LTD.

LLM Post-Training Researcher

Early Applicant
  • Posted 3 days ago
  • Be among the first 10 applicants
2-5 Years
SGD 18,000 - 32,000 per month

Job Description

Technical Staff, LLM Post-Training


Key Responsibilities:

  • Implement state-of-the-art RLHF (Reinforcement Learning with Human Feedback) or RLAIF (Reinforcement Learning with AI Feedback) algorithms, such as DPO and PPO, to enhance game and role-play characters.
  • Conduct data analysis and data cleaning to improve post-training data quality.
  • Research and apply advanced reasoning techniques, such as chain-of-thought reasoning, to enhance AI agent capabilities.
  • Develop reward model for RLHF.


Qualifications:

  • Master's or PhD in Computer Science, AI, Machine Learning, Linguistics, Statistics, or a related technical field.
  • Proven experience in NLP, LLM research, or machine learning projects.
  • Strong creativity and problem-solving skills.
  • Proficiency in Python and deep learning frameworks such as PyTorch, TensorFlow, or Hugging Face.
  • Excellent programming skills, including familiarity with data structures and algorithms. Competitions such as ACM/ICPC, USACO/NOI/IOI, Top Coder, or Kaggle are a plus.
  • Effective communication and collaboration skills, with a passion for exploring new technologies and driving technological innovation.

More Info

Industry:Other

Function:Ai/Machine Learning

Job Type:Permanent Job

Date Posted: 25/06/2025

Job ID: 120127119

Report Job

Hi , want to stand out? Get your resume crafted by experts.

Last Updated: 25-06-2025 10:06:21 PM
Home Jobs in Singapore LLM Post-Training Researcher