Intro to Predictive Coding: Overview & Interpretation of Terminology June 2014 | Page 9

Chapter Two 9 Technologies and What They Contribute to Predictive Coding 1. Latent Semantic Analysis. A mathematical approach that seeks to summarize the meaning of words by looking at the documents that share those words. LSA builds up a mathematical model of how words are related to documents and lets users take advantage of these computed relations to categorize documents. 2. Probabilistic Latent Semantic Analysis. A second mathematical approach that seeks to summarize the meaning of words by looking at the documents that share those words. PLSA builds up a mathematical model of how words are related to documents and lets users take advantage of these computed relations to categorize documents. 3. Support Vector Machine. A mathematical approach that seeks to find a line that separates responsive from nonresponsive documents so that, ideally, all of the responsive documents are on one side of the line and all of the nonresponsive ones are on the other side. 4. Nearest Neighbor Classifier. A classification system that categorizes documents by finding an already classified example that is very similar (near) to the document being considered. It