Bibliography
- Daniel J. Abadi, Peter A. Boncz, and Stavros Harizopoulos. Column‐oriented database systems. Proceedings of the VLDB Endowment, 2 (2): 1664–1665, 2009.
- Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. Tensorflow: A system for large‐scale machine learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pages 265–283, 2016.
- Inside Airbnb. Inside Airbnb, 2020. http://insideairbnb.com/get-the-data.html.
- Airflow. Airflow is a platform created by the community to programmatically author, schedule and monitor workflows, 2020. https://airflow.apache.org/.
- Saleema Amershi, Andrew Begel, Christian Bird, Robert DeLine, Harald Gall, Ece Kamar, Nachiappan Nagappan, Besmira Nushi, and Thomas Zimmermann. Software engineering for machine learning: a case study. In 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE‐SEIP), pages 291–300. IEEE, 2019.
- James Ang and Thompson S.H. Teo. Management issues in data warehousing: insights from the housing and development board. Decision Support Systems, 29 (1): 11–20, July 2000. ISSN 0167‐9236. 10.1016/S0167‐9236(99)00085‐8. http://dx.doi.org/10.1016/S0167-9236(99)00085-8.
- Apache Flume. Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming event data, 2019. https://flume.apache.org/ ...
Get Designing Big Data Platforms now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.