LLM evaluation | Andy Halterman

LLM evaluation

Codebook LLM Evaluation Framework: Systematic Approach to Large Language Models in Political Science Text Analysis

New framework for evaluating LLMs on political science codebooks with behavioral testing, zero-shot assessment, and instruction tuning guidance.