July 2017
Beginner to intermediate
715 pages
17h 3m
English
DataFrames are a convenient way of representing tabular data in memory. Originally DataFrames came from the R programming language, but they are now common in other languages; for example, in Python the pandas library offers a DataFrame data structure similar to R's.
The usual way of storing data in Java is lists, maps, and other collections of objects. We can think of these collections as tables, but we can assess the data only by row. However, for data science manipulation columns are equally important, and this is where DataFrames are helpful.
For example, they allow you to apply the same function over all values of the same column or look at the distribution of the values.
In Java there are not so many mature implementations, ...