Nonprofit
Published 2 months ago
AI Safety & Evaluation Lead – LLM Systems
Remote, Volunteer can be anywhere in the world
Details
Available Times:
Weekends (daytime, evenings)
Time Commitment:
A few hours per week
Commitment Details:
4–6 hours per week, Flexible schedule
Recurrence:
Recurring
Volunteers Needed:
2
Cause Areas:
Community Development, Education, Health & Medicine, International Relations, Volunteering
Participation Requirements:
Background Check
Age Requirement:
21+
Description
This is currently a volunteer position. Contributors will gain hands-on experience building real-world AI systems for public health impact.
About the Role
This unpaid volunteer role oversees evaluation, safety, and responsible AI governance for LLM-powered healthcare chatbot systems.
Commitment
- 4–6 hours per week
- Remote
- Flexible schedule
Qualifications
- Background in data science, NLP, machine learning, or AI research
- Experience with LLM evaluation, RAG, and prompt engineering
- Understanding of model validation and bias mitigation
- Interest in healthcare ethics and responsible AI
Responsibilities
- Design evaluation framework for chatbot outputs
- Develop medical safety test cases
- Measure hallucination rates and factual accuracy
- Implement guardrails and bias monitoring
- Document governance practices
Location
Remote
Volunteer can be anywhere in the world
Associated Location
PO Box 1232, Palo Alto, California, US
