APPENDIX CSAS Patents in Text Analytics

Computer-Implemented System and Method for Text-Based Document Processing

This is foundational patent work that established the basis for treating textual products in a quantitative fashion to construct factorizations of documents (which would be later extended to produce text topics) as well as document clusters.

Patent number: 6996575

Abstract: A computer-implemented system and method for processing text-based documents. A frequency-of-terms data set is generated for the terms appearing in the documents. Singular value decomposition is performed upon the frequency of terms data set to form projections of the terms and documents into a reduced dimensional subspace. The projections are normalized, and the normalized projections are used to analyze the documents.

Type: Grant

Filed: May 31, 2002

Date of Patent: February 7, 2006

Assignee: SAS Institute Inc.

Inventors: James A. Cox, Oliver M. Dain

Method and System for Responding to User-Input Based on Semantic Evaluations of User-Provided Expressions

This is foundational patent work that extended the text parsing and semantic processing engine that SAS uses to break down incoming text into its part of speech components.

Patent number: 7809724

Abstract: A method for processing user input includes the step of receiving, during a session, via one of a plurality of media gateways, from a user, an expression having a semantic structure. The semantic structure of the expression is evaluated. ...

Get Text as Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.