Intro to Predictive Coding: Overview & Interpretation of Terminology June 2014 | Page 30

Glossary Random sampling—the statistical process of choosing objects randomly, meaning that each object has an equal chance of being selected. Random sampling can be used to train predictive coding systems and to evaluate their efficacy. Recall –the proportion of responsive documents in the entire collection that have been retrieved. Relevance feedback—a class of machine learning techniques where users indicate the relevance of items that have been retrieved for them and the machine learns thereby to improve the quality of its recommendations. Richness – the proportion of responsive documents in a collection. Sampling – the process of selecting a subset of items from a population and inferring from the characteristics of the sample what the characteristics of the population are likely to be.