Use of test collections and evaluation measures to assess the
effectiveness of information retrieval systems has its origins in
work dating back to the early 1950s. Across the nearly 60 years
since that work started, use of test collections is a de facto
standard of evaluation. This monograph surveys the research
conducted and explains the methods and measures devised for
evaluation of retrieval systems, including a detailed look at the
use of statistical significance testing in retrieval
experimentation. It reviews more recent examinations of the
validity of the test collection approach and evaluation measures as
well as outlining trends in current research exploiting query logs
and live labs. At its core, the modern-day test collection is
little different from the structures that the pioneering
researchers in the 1950s and 1960s conceived of. This tutorial and
review shows that despite its age, this long-standing evaluation
method is still a highly valued tool for retrieval research.
General
Is the information for this product incomplete, wrong or inappropriate?
Let us know about it.
Does this product have an incorrect or missing image?
Send us a new image.
Is this product missing categories?
Add more categories.
Review This Product
No reviews yet - be the first to create one!