Responsibilities
About the team Seed Global Data is a team focused on producing international data for LLMs. For the training of large models, data is the lifeline of model quality and the Global Data team is working closely with technical, product, and operations teams to ensure effective data production strategies and execution management. As a key member of our LLM Global Data Team, the LLM Training Operations Analyst will play a pivotal role in managing the intricate processes involved in training large language models (LLMs) with diverse coding datasets. This role focuses on overseeing and improving operational workflows, primarily for safety-related projects, ensuring they are delivered with high quality and efficiency. Job Responsibilities - Design and execute data synthesis projects to support model safety training across text, image, audio, and other modalities. Coordinate with internal teams to scope training needs, establish timelines, and deliver scalable, high-quality synthetic safety datasets aligned with evolving model needs. - Systematically analyze evaluation outputs to identify failure modes, behavioral gaps, and underrepresented cases. Translate these findings into targeted safety data synthesis strategies that strengthen model robustness and performance. - Work closely with model development, policy, and evaluation teams to ensure synthesized data meets quality standards and safety alignment. Partner with cross-functional teams to integrate data synthesis efforts into broader model safety training pipelines. - Conduct research on the latest model training methodologies and synthetic data best practices from academia and industry. Proactively identify gaps in existing safety training data strategies and propose innovative approaches to improve scalability, efficiency, and coverage. Please note that this role may involve exposure to potentially harmful or sensitive content, either as a core function, through ad hoc project participation, or via escalated cases. This may include, but is not limited to, text, images, or videos depicting: - Hate speech or harassment - Self-harm or suicide-related content - Violence or cruelty - Child safety - Support resources and resilience training will be provided to support employee well-being.
Qualifications
Minimum Qualifications - Bachelor's degree or higher, preferably in Artificial Intelligence, Political Science, Journalism, International Relations, Regional Studies, or a related discipline. - Exceptional proficiency in both English and Mandarin, with strong written and verbal communication skills to collaborate effectively with internal teams and stakeholders across English- and Mandarin-speaking regions. - Strong data analytical and operational skills, with the ability to interpret qualitative and quantitative data and translate insights into actionable executions. - Demonstrated strong project management skills, with experience leading cross-functional initiatives in dynamic, fast-paced environments. - Creative problem-solving mindset, with the ability to work under ambiguity and leverage tools and technology to improve processes and outputs. Preferred Qualifications - Prior experience in AI safety, Trust & Safety, Risk Consulting, or Risk Management. Experience working at or with AI companies is highly desirable. - Prior prompt engineering and prompt writing skills, with demonstrated ability to design effective prompts for diverse use cases. - Intellectually curious, self-motivated, detail-oriented, and collaborative. - Strong interest in emerging technologies, user behavior, and the societal impact of AI systems, with enthusiasm for applying insights from real-world case studies in a high-impact setting.