The O’Reilly Data Show Podcast: Danny Lange on how reinforcement learning can accelerate software development and how it can be democratized.
A comparison of the accuracy and performance of Spark-NLP vs. spaCy, and some use case recommendations.
A step-by-step guide to building and running a natural language processing pipeline.
A step-by-step guide to initialize the libraries, load the data, and train a tokenizer model using Spark-NLP and spaCy.
A look at the new streaming SQL engine for Apache Kafka.
Attend a day-long exploration of Jupyter's best practices and practical use cases in business and industry.
Ingest the data you need in an agile manner.
The O’Reilly Data Show Podcast: Leo Meyerovich on building large-scale, interactive applications that enable visual investigations.
Alysa Hutnik discusses the Fair Credit Reporting Act, the Equal Credit Opportunity Act, the Gramm-Leach Bliley Act, and the FTC’s focus on FinTech.
How companies such as athenahealth can transform legacy data into insights.
Gain agility by loading first and transforming later.
A glimpse into what lies ahead for response automation, model compliance, and repeatable experiments.
The media and ad tech sessions at the Strata Data Conference in San Jose will dig deep into how media businesses are changing.
The O’Reilly Data Show Podcast: Mark Hammond on applications of reinforcement learning to manufacturing and industrial automation.
Regardless of country or culture, any solid data science plan needs to address veracity, storage, analysis, and use.
By packaging and delivering actionable data in applications, product managers can help users achieve their goals.
The ability to appeal may be the most important part of a fair system, and it's one that isn't often discussed in data circles.
Designing application architectures for real-time decisions.
Facilitating data exchange across the enterprise.
The classic “write once, run everywhere” principle comes to life in streaming data.
Exploring many small regions of a graph with low latency using specialized graph and multi-model databases.
The O’Reilly Data Show Podcast: Fabian Yamaguchi on the potential of using large-scale analytics on graph representations of code.
It’s time to think about how the systems we build interact with our world and build systems that make our world a better place.
The sessions and training courses at Strata Data San Jose 2018 will focus on practical use cases of machine learning for data scientists, engineers, managers, and executives.
Top five characteristics to consider when deciding which library to use.
As the use of analytics proliferate, companies will need to be able to identify models that are breaking bad.
The O’Reilly Data Show Podcast: Kris Hammond on business applications of AI technologies and educating future AI specialists.
AI, blockchain, payment regionalization, and other fintech trends to watch.
How new developments in algorithms, machine learning, analytics, infrastructure, data ethics, and culture will shape data in 2018.
The O’Reilly Data Show Podcast: Tim Kraska on why ML will change how we build core algorithms and data structures.
Decoding simple regex features to match complex text patterns.
Every line of business must have access to the digital tools needed to innovate at the edge.
Ajey Gore looks at how the impossible can be made possible with technology and data insights.
Kira Radinsky describes a system that mines medical records and Wikipedia to reduce spurious correlations and provide guidance about drug repurposing.
Watch highlights covering machine learning, smart cities, automation, and more. From Strata Data Conference in Singapore 2017.
Carme Artigas asks: Are innovations like autonomous vehicles and flying drones making our societies more intelligent?
Amr Awadallah explains the historic importance of the next wave in automation.
Pascale Fung explains how emotional interaction is being integrated into machines.
Tony Lee outlines the unique big data and AI challenges JD.com is tackling.
The O’Reilly Data Show Podcast: Christine Hung on using data to drive digital transformation and recommenders that increase user engagement.
A look at the rise of the deep learning library PyTorch and simultaneous advancements in recommender systems.
Without the proper cataloging, curation, and security that self-service data platforms allow, companies are left vulnerable to cybersecurity threats and misinformation.
Melanie Johnston-Hollitt discusses a radio telescope project that will produce data on a scale that dwarfs most big data efforts.
Steve Leonard explores how Singapore is bringing together ambitious and capable people to build technology that can solve the world’s toughest challenges.
Cesar Delgado joins Mick Hollison to discuss how Apple is using its big data stack and expertise to solve non-data problems.
Joshua Bloom explains why the real revolution will happen—in improved and saved lives—when machine learning automation is coupled with industrial data.
Bruno Fernandez-Ruiz discusses the tradeoffs we make to ensure safer transportation.
Ben Lorica explains how to guard against flaws and failures in your machine learning deployments.
Felipe Hoffa says data-based conclusions are possible when stakeholders can easily analyze all relevant data.
The O’Reilly Media Podcast: Gayle Sheppard, Saffron AI Group at Intel, and David Thomas, Bank of New Zealand.
O’Reilly Media Podcast: David Hsieh, of Qubole, in conversation with John Slocum, of MediaMath.
A survey of usage, access methods, projects, and skills.
The O’Reilly Data Show Podcast: Neha Narkhede on data integration, microservices, and Kafka’s roadmap.
Drawing parallels and distinctions around neural networks, data sets, and hardware.
The O’Reilly Data Show Podcast: David Talby on a new NLP library for Spark, and why model development starts after a model gets deployed to production.
Analyzing tweets and posts around Trump, Russia, and the NFL using information entropy, network analysis, and community detection algorithms.
Reduce troubleshooting time from days to seconds.
Multi-model database architectures provide a flexible data governance platform
The convergence of big data, artificial intelligence, and business intelligence
The O’Reilly Data Show Podcast: Rhea Liu on technology trends in China.