When dealing with tabulated datasets there are occasions when some of the values are missing. One of the features of statistical languages is that they can handle such situations.
In Julia, the
DataFrames package has been developed in order to treat such cases and this is the subject of this chapter.
The package extends the Julia base by adding three new types:
NAis introduced in order to represent a missing value. This type only has one particular value
DataArrayis a type that emulates Julia's standard
Arraytype, but is able to store missing values in the array.
DataFrameis a type that is capable of representing tabular datasets such as those found in typical databases or spreadsheets. The concept ...