Index
A
A Christmas Carol
abbreviations
adverbs
anagrams dictionary
anasquares
apostrophes
arguments
B
backquote
bag-of-words model
Bayesian inference
Bayesian model
bias
bigrams
bioinformatics
block of code
C
Caesar cipher
caret
centroids
classification
cluster means
clustering
clustering vector
coin tossing
collocations
commas
concordances
A Christmas Carol
Die Leiden des jungen Werthers
Enronsent
The Call of the Wild
context
array
scalar
string
corcordances
The Call of the Wild
corpora
corpus
corpus linguistics
corpus linguistics and sampling
corpus
EnronSent
correlation matrix
correlations
correlations and cosines
correlations and covariances
counting
covariance
CPAN
CRAN
crossword puzzles
crwth
cryptanalysis
D
dashes
dendrogram
Dickens
A Christmas Carol
dimension
dimensionless
DNA
dot product
doublets
E
eigenvalues
eigenvectors
end punctuation
Eszett
ETAOIN SHRDLU
events
exclamation points
F
factor analysis
false positives
filehandle
files comma-separated variables
flat file
frequencies
bigram
letter
letters
word
word lengths
words
G
Goethe
Die Leiden des jungen Werthers
H
hangman
hapax legomena
histogram
histograms
hyphens
I
independence
inner product
interpolation
array
inverse document frequency (IDF)
isograms
K
k-means clustering
key word in context (KWIC)
L
lemma
linear algebra
lipograms
logarithms
London
The Call of the Wild
M
Mahalanobis distance
main diagonal
main program
matrices
commuting
matrix factorization
matrix multiplication
matrix diagonal
mean word frequency ...