Evaluating Generative AI for Legal Research: A Benchmarking Project
LLRX
MAY 29, 2024
So to test the abilities of these products, librarians can use prompt engineering to figure out how to get desired results (controlling statutes, key cases, drafts of a memo, etc.). It is difficult to test Large-Language Models (LLMs) without back-end access to run evaluations.
Let's personalize your content