From infrastructure to tools to training, Ben Lorica looks at what’s ahead for data.
The O’Reilly Data Show Podcast: Haoyuan Li on accelerating analytic workloads, and innovation in data and AI in China.
The O’Reilly Data Show Podcast: Ben Lorica looks ahead at what we can expect in 2019 in the big data landscape.
We need to do more than automate model building with autoML; we need to automate tasks at every stage of the data pipeline.
The O’Reilly Data Show Podcast: Alex Wong on building human-in-the-loop automation solutions for enterprise machine learning.
The O’Reilly Data Show Podcast: Vitaly Gordon on the rise of automation tools in data science.
Considerations for a world where ML models are becoming mission critical.
The O’Reilly Data Show Podcast: Francesca Lazzeri and Jaya Mathew on digital transformation, culture and organization, and the team data science process.
The O’Reilly Data Show Podcast: Alon Kaufman on the interplay between machine learning, encryption, and security.
Create a coherent BI strategy that aligns data collection and analytics with the general business strategy.
The O’Reilly Data Show Podcast: Jacob Ward on the interplay between psychology, decision-making, and AI systems.
The O’Reilly Data Show Podcast: Sharad Goel and Sam Corbett-Davies on the limitations of popular mathematical formalizations of fairness.
Getting DataOps right is crucial to your late-stage big data projects.
Julia Angwin discusses what she's learned about forgiveness from her series of articles on algorithmic accountability and the lessons we all need to learn for the coming AI future.
Amber Case covers methods product designers and managers can use to improve interactions through an understanding of sound design.
Chad Jennings explains how Geotab's smart city application helps city planners understand traffic and predict locations of unsafe driving.
Ben Sharma shares how the best organizations immunize themselves against the plague of static data and rigid process
Dinesh Nirmal explains how AI is helping supply school lunch and keep ahead of regulations.
Jacob Ward reveals the relationship between the unconscious habits of our minds and the way that AI is poised to amplify them, alter them, maybe even reprogram them.
Ziya Ma discusses how recent innovations from Intel in high-capacity persistent memory and open source software are accelerating production-scale deployments.
Hilary Mason explores the current state of AI and ML and what’s coming next in applied ML.
Brain-based human-machine interfaces: New developments, legal and ethical issues, and potential uses
Amanda Pustilnik highlights potential applications of data from new technologies that capture brain-based processes.
The O’Reilly Data Show Podcast: Alan Nichol on building a suite of open source tools for chatbot developers.
Ted Dunning discusses how new tools can change the way production systems work.
Ben Lorica offers an overview of recent tools for building privacy-preserving and secure machine learning products and services.
Drew Paroski and Aatif Din share how to develop modern database applications without sacrificing cost savings, data familiarity, and flexibility.
Joseph Lubin explains how Ethereum can help with new innovations like cryptocurrencies, automated and self-executing legal agreements, and self-sovereign identity.
Cassie Kozyrkov explores why businesses fail at machine learning despite its tremendous potential and excitement.
Executives from Cloudera and PNC Bank look at the challenges posed by data-hungry organizations.
DD Dasgupta explores the edge-cloud continuum, explaining how the roles of data centers and cloud infrastructure are redefined through the mainstream adoption of AI, ML, and IoT technologies.
Watch highlights from expert talks covering data science, machine learning, algorithmic accountability, and more.
It has become much more feasible to run high-performance data platforms directly inside Kubernetes.
If we’re going to think about the ethics of data and how it’s used, then we have to take into account how data flows.
This collection of data governance resources will get you up to speed on the basics and best practices.
The O’Reilly Data Show Podcast: Eric Jonas on Pywren, scientific computation, and machine learning.
Fernando Perez talks about UC Berkeley's transition into an environment where many undergraduates use Jupyter and the open data ecosystem as naturally as they use email.
David Schaaf explains how data science and data engineering can work together to deliver results to decision makers.
Michelle Gill discusses how data science methods and tools can link information from different scientific fields and accelerate discovery.
Michelle Ufford shares how Netflix leverages notebooks today and describes a brief vision for the future.
Tracy Teal explains how to bring people to data and empower them to address their questions.
Cristian Capdevila explains how Prognos is predicting disease.
Ryan Abernathey makes the case for the large-scale migration of scientific data and research to the cloud.
Watch keynotes covering Jupyter's role in business, data science, higher education, open source, journalism, and other domains, from JupyterCon in New York 2018.
Dan Romuald Mbanga walks through the ecosystem around the machine learning platform and API services at AWS.
Julia Meinwald outlines effective ways to support the unseen labor maintaining a healthy open source ecosystem.
Luciano Resende explores some of the open source initiatives IBM is leading in the Jupyter ecosystem.
Paco Nathan shares a few unexpected things that emerged in Jupyter in 2018.
Mark Hansen explains how computation has forever changed the practice of journalism.
Carol Willing shows how Jupyter's challenges can be addressed by embracing complexity and trusting others.
All the cool kids are doing it, maybe we should too? Jupyter, gravitational waves, and the LIGO and Virgo Scientific Collaborations
Will Farr offers lessons about the many advantages and few disadvantages of using Jupyter for global scientific collaborations.
The O’Reilly Data Show Podcast: Harish Doddi on accelerating the path from prototype to production.
The deployment of big data tools is being held back by the lack of standards in a number of growth areas.
New survey results highlight the ways organizations are handling machine learning's move to the mainstream.
These studies provide a foundation for discussing ethical issues so we can better integrate data ethics in real life.
The O’Reilly Data Show Podcast: Chang Liu on operations research, and the interplay between differential privacy and machine learning.
We can build a future we want to live in, or we can build a nightmare. The choice is up to us.
Five framing guidelines to help you think about building data products.
Recognizing the interest in ML, the Strata Data Conference program is designed to help companies adopt ML across large sections of their existing operations.
The O’Reilly Data Show Podcast: Andrew Feldman on why deep learning is ushering a golden age for compute architecture.
While models and algorithms garner most of the media coverage, this is a great time to be thinking about building tools in data.