What is data science?What is data science?Where data comes fromWorking with data at scaleMaking data tell its storyData scientistsThe SMAQ stack for big dataMapReduceHadoop MapReduceOther implementationsStorageHadoop Distributed File SystemHBase, the Hadoop DatabaseHiveCassandra and HypertableNoSQL database implementations of MapReduceIntegration with SQL databasesIntegration with streaming data sourcesCommercial SMAQ solutionsQueryPigHiveCascading, the API ApproachSearch with SolrConclusionScraping, cleaning, and selling big dataData hand toolsHadoop: What it is, how it works, and what it can doFour free data tools for journalists (and snoops)WHOISBlekkobit.lyCompeteThe quiet rise of machine learningWhere the semantic web stumbled, linked data will succeedSocial data is an oracle waiting for a questionThe challenges of streaming real-time data