Job Description :
We are seeking highly analytical and detail-oriented professionals with hands-on experience inRed Teaming, Prompt Evaluation, andAI / LLM Quality Assurance.
The ideal candidate will help us rigorously test and evaluate AI-generated content to identify vulnerabilities, assess risks, and ensure compliance with safety, ethical, and quality standards.
Key Responsibilities :
ConductRed Teaming exercisesto identify adversarial, harmful, or unsafe outputs from large language models (LLMs).
Evaluate and stress-test AI prompts across multiple domains (e.g., finance, healthcare, security) to uncover potential failure modes.
Develop and apply test cases to assessaccuracy, bias, toxicity, hallucinations, and misuse potential in AI-generated responses.
Collaborate with data scientists, safety researchers, and prompt engineers to report risks and suggest mitigations.
Performmanual QAand content validation across model versions, ensuring factual consistency, coherence, and guideline adherence.
Create evaluation frameworks and scoring rubrics for prompt performance and safety compliance.
Document findings, edge cases, and vulnerability reports with high clarity and structure.
Requirements :
Proven experience inAI red teaming, LLM safety testing, or adversarial prompt design.
Familiarity withprompt engineering, NLP tasks, and ethical considerations in generative AI.
Strong background inQuality Assurance, content review, or test case development for AI / ML systems.
Understanding of LLM behaviors, failure modes, and model evaluation metrics.
Excellent critical thinking, pattern recognition, and analytical writing skills.
Ability to work independently, follow detailed evaluation protocols, and meet tight deadlines.
Preferred Qualifications :
Prior work with teams like OpenAI, Anthropic, Google DeepMind, or other LLM safety initiatives.
Experience in risk assessment, red team security testing, or AI policy & governance.
Background in linguistics, psychology, or computational ethics is a plus.
Next Steps
To proceed further in the evaluation process, you will need to complete two assessments :
Assessment Test
Evaluates your linguistic and analytical skills
Link : ?
enc=oUTZVsr / Pnz / 0Xygc2EK32MdtinqnjC9vy8RU3Ha4EOAPwT2LJJQDD68MkY6jszYhhsYYecqmKWja8eKXV801gezikielezikiel
Versant English Proficiency Test
Focuses on assessing your spoken and written English proficiency
AC1 or C2 levelis required to qualify
Once both assessments are successfully completed, you will be eligible for onboarding.
Language test
Action Required : XConnect Registration
You will also receive an invitation to our internal job platform,XConnect.
Please take a few minutes to register and complete your profile.
All project onboarding, communication, and documentation are managed through this platform.
If interested, kindly share your resume at :
Content • Belo Horizonte, Minas Gerais, Brasil