
文本数据:扁平化、过滤和分块
|
51
3.5
参考文献
Bird, Steven, Ewan Klein, and Edward Loper. Natural Language Processing with Python [M].
Sebastopol, CA: O
’
Reilly Media, 2009.
Dunning, Ted. Accurate Methods for the Statistics of Surprise and Coincidence [J]. ACM Journal
of Computational Linguistics, special issue on using large corpora 19:1 (1993): 61–74.
Khan Academy. Hypothesis Testing and p-Values [EB/OL]. https://www.khanacademy.org/math/
probability/statistics-inferential/hypothesis-testing/v/hypothesis-testing-and-p-values.
Manning, Christopher D. and Hinrich Schütze. Foundations of Statistical Natural Language
Processing [M]. Cambridge, MA: MIT Press, 1999.