Ingest the data you need in an agile manner.
A glimpse into what lies ahead for response automation, model compliance, and repeatable experiments.
Decoding simple regex features to match complex text patterns.
A look at the rise of the deep learning library PyTorch and simultaneous advancements in recommender systems.
Without the proper cataloging, curation, and security that self-service data platforms allow, companies are left vulnerable to cybersecurity threats and misinformation.
O’Reilly Media Podcast: David Hsieh, of Qubole, in conversation with John Slocum, of MediaMath.
A survey of usage, access methods, projects, and skills.
Drawing parallels and distinctions around neural networks, data sets, and hardware.
Analyzing tweets and posts around Trump, Russia, and the NFL using information entropy, network analysis, and community detection algorithms.
Reduce troubleshooting time from days to seconds.
The convergence of big data, artificial intelligence, and business intelligence
Solving challenges of data analytics to make data accessible to all.
Fast data and virtualization are shifting the way telcos approach the IoT.
The right AI solution is the one that fits the skill set of the users and solves the highest-priority problems for the business.
To become a “machine learning company,” you need tools and processes to overcome challenges in data, engineering, and models.
The O'Reilly Podcast: Han Yang on the importance of investment, innovation, and improvisation.
Applying methods from Agile software development to data science projects.
Untangling data pipelines with a streaming platform.
Become more agile with business intelligence and data analytics.
How human-in-the-loop data analytics is accelerating the discovery of insights.
The O’Reilly Podcast: Achieving greater reliability and security when integrating data.
The O'Reilly Podcast: Gary Orenstein on developing a data infrastructure that enables the latest applications in machine learning and AI.
Utilizing GPU power to improve performance and agility.
A deep dive into Uber's engineering effort to optimize geospatial queries in Presto.
The O'Reilly Podcast: Dave Cassel on building a unified enterprise database to store and query any type of data.
6 lessons learned to get a quick start on productivity.
A look at the Layer API, TFLearn, and Keras.
Applications of CNNs for real-time image classification in the enterprise.
Building a production-grade real-time image classification system.
Why machine learning needs real-time data infrastructure.
Recent trends in practical use and a discussion of key bottlenecks in supervised machine learning.
The toughest part of machine learning with Spark isn't what you think it is.
Human-guided ML pipelines for data unification and cleaning might be the only way to provide complete and trustworthy data sets for effective analytics.
Using a single cloud provider is a thing of the past.
Practical questions to help you make a decision.
Tamr’s Eliot Knudsen on algorithms that work alongside human experts.
A multi-model approach to transforming data from a liability to an asset.
A framework for moving from data to wisdom.
Authors Julia Silge and David Robinson discuss the power of tidy data principles, sentiment lexicons, and what they're up to at Stack Overflow.
Recapping winners of the Strata San Jose Startup Showcase.
Stewart Rogers on building and managing products with embedded analytics.
A new architecture for today’s data-rich modern applications.
Integrate and access any form of data using a multi-model database.
Exploring a reference architecture solution.
Overcome three types of debt to ship quality machine learning code.
A new role focused on creating data products and making data science work in production.
The O'Reilly Podcast: Ken Krupa on the challenge of data integration, and a solution.
Nothing says machine learning can't outperform humans, but it's important to realize perfect machine learning doesn't, and won't, exist.
Bas Geerdink details the technology stack for real-time account forecasting at ING, and outlines how Spark is used for outbound communications.
Access to critical data in real time enables workers to generate insights from large amounts of information.
Metadata is central to a modern data architecture.
A possible solution to the complexities that plague big data projects.
June Andrews talks about simple, cost-effective algorithmic computing at scale.
Kurt Brown discusses services in use, such as Genie, Metacat, Charlotte, and Microbots.
There’s money to be made in exhaust data (not just data exhaust).
Scientific use cases show promise, but challenges remain for complex data analytics.
Andra Keay discusses the five laws of robotics design.
Michael Jordan on developing a new platform to support real-time decision-making.
O'Reilly Podcast: Ian Fyfe of Zoomdata on the importance of “speed-of-thought analysis” in modern data environments.
The present and future of data integration in the cloud.