LLM evaluation | Andy Halterman
Andy Halterman
Andy Halterman
Posts
Papers
Code
Teaching
CV
Light
Dark
Automatic
LLM evaluation
Codebook LLM Evaluation Framework: Systematic Approach to Large Language Models in Political Science Text Analysis
New framework for evaluating LLMs on political science codebooks with behavioral testing, zero-shot assessment, and instruction tuning guidance.
Cite
×