About PageshiftPageshift is a Research Lab committed to pushing the frontier of AI storytelling and creativity. We are envisioning a world in which most entertainment is personalized and AI-generated. Our goal is to build the underlying story engine that powers it all. To do this, we are not afraid to explore new ways and create novel categories of model capability.About the roleYou are expected to process role play data into structured datasets by building extraction and validation pipelines. You will manually audit samples to maintain a high quality bar. You are also expected to build evaluation pipelines that test our model generations for correct role play behavior and surface actionable failure cases.What we're looking for:- Passion for role playing
- Cares about data and data quality
- Willing to manually read a bunch of data
- Understanding of role play, world building and character development
- Basic understanding of LLM prompting, or willingness to learn
- Basic understanding of Python and data processing, or willingness to learnNice to have:- Have built some prior datasets
- Have done some theoretical-based analysis of stories
- Have a deep understanding of the limitations of LLMs when it comes to the understanding of stories and role playYour responsibilities:- Develop and validate LLM-driven data processing pipelines for role play
- Build LLM-based validation pipelines for aspects of role play
