Skip to Main Content
Working with Text
book

Working with Text

by Emma Tonkin, Gregory J.L Tourte
July 2016
Intermediate to advanced content levelIntermediate to advanced
344 pages
10h 11m
English
Chandos Publishing
Content preview from Working with Text
Appendix B

Databases and Vocabularies

B.1 Sample Data Sets

Various data sets have been made available under text mining-friendly licences. Of these, a proportion have become a popular choice of data set for testing approaches to text and data mining. One benefit of using standard data sets is the ability to benchmark directly against other approaches to the same problem, which permits standardisation in evaluation.

In this section we list a number of standard data sets and competit ions, referencing the areas in which they are primarily used. This list is not exhaustive, and is intended to serve as an introduction. We indicate the availability and licencing of each data set where relevant. Although data sets are often used for more than one ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Communicate with Teams More Effectively

Communicate with Teams More Effectively

Charles Humble

Publisher Resources

ISBN: 9781780634302