All the processing power in the world is of no use unless you have data to work with. In this chapter, we’ll look at different techniques to get your data into Databricks. We’ll also take a closer look at file types that you are likely to come across in your data work.
To get a better understanding of how data is stored in Databricks, we’ll investigate their own file system, called Databricks File System or DBFS for short. With this knowledge, we’ll look at how we can pull data from the Web, from files, and from data lakes.
Getting data is easier if you have continuous ...