Book description
Provides readers with the methods, algorithms, and means to perform text mining tasks
This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet (www.perl.org
). It covers mining ideas from several perspectives--statistics, data mining, linguistics, and information retrieval--and provides readers with the means to successfully complete text mining tasks on their own.
The book begins with an introduction to regular expressions, a text pattern methodology, and quantitative text summaries, all of which are fundamental tools of analyzing text. Then, it builds upon this foundation to explore:
Probability and texts, including the bag-of-words model
Information retrieval techniques such as the TF-IDF similarity measure
Concordance lines and corpus linguistics
Multivariate techniques such as correlation, principal components analysis, and clustering
Perl modules, German, and permutation tests
Each chapter is devoted to a single key topic, and the author carefully and thoughtfully introduces mathematical concepts as they arise, allowing readers to learn as they go without having to refer to additional books. The inclusion of numerous exercises and worked-out examples further complements the book's student-friendly format.
Practical Text Mining with Perl is ideal as a textbook for undergraduate and graduate courses in text mining and as a reference for a variety of professionals who are interested in extracting information from text documents.
Table of contents
- COVER
- SERIES TITLE
- TITLE
- COPYRIGHT PAGE
- DEDICATION
- LIST OF FIGURES
- LIST OF TABLES
- PREFACE
- ACKNOWLEDGMENTS
- CHAPTER 1: INTRODUCTION
- CHAPTER 2: TEXT PATTERNS
- CHAPTER 3: QUANTITATIVE TEXT SUMMARIES
- CHAPTER 4: PROBABILITY AND TEXT SAMPLING
- CHAPTER 5: APPLYING INFORMATION RETRIEVAL TO TEXT MINING
- CHAPTER 6: CONCORDANCE LINES AND CORPUS LINGUISTICS
- CHAPTER 7: MULTI VARIATE TECHNIQUES WITH TEXT
- CHAPTER 8: TEXT CLUSTERING
- CHAPTER 9: A SAMPLE OF ADDITIONAL TOPICS
- APPENDIX A: OVERVIEW OF PERL FOR TEXT MINING
- APPENDIX B: SUMMARY OF R USED IN THIS BOOK
- REFERENCES
- INDEX
Product information
- Title: Practical Text Mining with Perl
- Author(s):
- Release date: August 2008
- Publisher(s): Wiley
- ISBN: 9780470176436
You might also like
book
Tiny Python Projects
The projects are tiny, but the rewards are big: each chapter in Tiny Python Projects challenges …
book
Racket Programming the Fun Way
At last, a lively guided tour through all the features, functions, and applications of the Racket …
book
Practices of the Python Pro
Practices of the Python Pro teaches you to design and write professional-quality software that’s understandable, maintainable, …
book
Perl & LWP
Perl soared to popularity as a language for creating and managing web content, but with LWP …