About The Team
We are seeking a highly skilled AI Engineer specializing in Large Language Model to contribute to the development of a scalable and optimized South-East Asia foundation language model. In this role, you will collaborate with multi-region teams, adopting the advanced AI technologies and applying AI models and strategies to enhance business models. Your primary focus will be on following and crafting the advanced AI algorithms for South-East Asia language in the ecommerce domain.
Job Description
- Contribute to the research and implementation of pre-training and alignment algorithms include ultra-large-scale multilingual pre-training technology, Mixture-of-Experts model training, Instruction Pretraining, SFT, and RLHF.
- Contribute to the explanation and safety improvement of AI, especially in trustworthy Large language models.
- Follow the frontier technologies and make comparisons about the advanced technologies to apply in business scenarios.
- Conduct experiments to test the performance of different AI models, identifying areas for improvement and exploring new directions for enhancement.
- Work collaboratively in a team environment, applying expertise in statistics, scripting, and relevant programming languages.
Requirements
- Doctorate degree in Computer Science, Information Technology, Programming & Systems Analysis, or other related disciplines
- Excellent coding skills, data structure and basic algorithm skills, proficiency in Python/Pytorch coding.
- Minimum 1 year of research experience in basic principles and training methods of industry-leading LLM (such as GPT, LLaMA).
- Have research experience in text generation or dialogue systems
- Excellent problem analysis and solving skills, able to deeply solve problems in large model training and application.
- Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.