Big Data Tools and Pipelines
Ideas and resources related to data tools.

article
In defense of the pie chart

article
Embeddable data transformation for real-time streams

article
Best practices for data lakes

article
Why your next analytics project should be in procurement

article
BayesDB: Data science is a communication problem

article
Get started with SQL: Plan and design a database

article
Architecting Druid for failure

article
Lego-powered Kafka training

article
Multitenancy on Hadoop: Is your cluster half full?

article
Deploying a hybrid Hadoop architecture

article
Apache Cassandra for analytics: A performance and storage analysis

article
NoSQL technologies are built to solve business problems, not just “wrangle big data”

article
Clustering geolocated data using Spark and DBSCAN

article
Lessons on Hadoop application architectures

article
Best kept machine learning secret in security

article