Software Tester

8-15 Years

Save

Early Applicant

Job Description

Role: Gen AI Testing SME

Experience - 8-15 yrs

Responsibilities

Gen AI Testing SME

Test Strategy & Planning

Define comprehensive testing strategies tailored for Gen AI models (LLMs, diffusion models, etc.).
Identify key testing dimensions: accuracy, relevance, coherence, bias, toxicity, hallucination, and safety.
Develop test plans for different stages: pre-training, fine-tuning, prompt engineering, and deployment.

Test Case Design & Automation

Design test cases for both deterministic and non-deterministic outputs.
Create benchmark datasets and golden sets for evaluation.
Develop automated testing pipelines using tools like LangChain, PromptLayer, or custom frameworks.

Evaluation Metrics & Analysis

Define and apply appropriate evaluation metrics (e.g., BLEU, ROUGE, perplexity, factual consistency).
Analyze model outputs for hallucinations, bias, and harmful content.
Conduct A/B testing and human-in-the-loop evaluations.

Prompt & Scenario Testing