APPENDIX CSAS Patents in Text Analytics
Computer-Implemented System and Method for Text-Based Document Processing
This is foundational patent work that established the basis for treating textual products in a quantitative fashion to construct factorizations of documents (which would be later extended to produce text topics) as well as document clusters.
Patent number: 6996575
Abstract: A computer-implemented system and method for processing text-based documents. A frequency-of-terms data set is generated for the terms appearing in the documents. Singular value decomposition is performed upon the frequency of terms data set to form projections of the terms and documents into a reduced dimensional subspace. The projections are normalized, and the normalized projections are used to analyze the documents.
Type: Grant
Filed: May 31, 2002
Date of Patent: February 7, 2006
Assignee: SAS Institute Inc.
Inventors: James A. Cox, Oliver M. Dain
Method and System for Responding to User-Input Based on Semantic Evaluations of User-Provided Expressions
This is foundational patent work that extended the text parsing and semantic processing engine that SAS uses to break down incoming text into its part of speech components.
Patent number: 7809724
Abstract: A method for processing user input includes the step of receiving, during a session, via one of a plurality of media gateways, from a user, an expression having a semantic structure. The semantic structure of the expression is evaluated. ...
Get Text as Data now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.