Introducing FaithBench: Measuring What AI Gets Wrong About Theology
AI models score 90%+ on general knowledge benchmarks but collapse on theological questions. FaithBench is the first dedicated benchmark framework for evaluating AI theological accuracy across Christian traditions.