Data science tasks come with a wide variety of computational costs of both space and time. Data wrangling jobs may need the support of large storage, while advanced ML algorithms need high intensity computing speed. Some ML algorithms work better with the support of large local memory (RAM) and cannot perform well with data situated far from the CPU on a hard disk, while others are optimized to perform well with distributed data storage.
Furthermore, the nature of the data may change slowly ...