O'Reilly Webcasts Webcasts RSS Feed

Learn from experts online, for free

Register for an upcoming free, live webcast or browse our video archive of past events.

Sign up
and never miss a webcast.

View Upcoming Webcasts by Category:

Upcoming: Data

View Archived Webcasts by Category:

Video Archive: Data

Webcasts are made available as a video shortly following each live event.

Up Your R Game

By Bill Franks, James Taylor | July 29, 2014
This webcast discusses requirements for R as it evolves into a big data and enterprise-analytic solution and presents a novel approach to make Open source R massively scalable, reliable, and easy to use.

http://cdn.oreillystatic.com/images/people/weblogs/pete_warden2-50.jpgPete Warden

How to Get Started with Deep Learning in Computer Vision

By Pete Warden | July 24, 2014
In this webcast Pete Warden will walk through some popular open-source tools from the academic world, and show you step-by-step how to process images with them.

http://cdn.oreillystatic.com/images/people/weblogs/sarah_guido-50.jpgSarah Guido

Analyzing Data with Python

By Sarah Guido | July 09, 2014
In this webcast led by Sarah Guido, you'll get a bird's eye overview of some of the best tools for data analysis and how you can apply them to your workflow.

http://cdn.oreillystatic.com/images/people/weblogs/jodok_batlogg-50.jpgJodok Batlogg

Super Simple Real-Time Big Data Backend: Crate Data

By Jodok Batlogg | July 08, 2014
In this webcast we will demonstrate, step-by-step example how a web service can be deployed with the full service stack (data and application) on a single node and then add nodes as needed just by starting them.

Big Data, Fast Data: The Need for In-Memory Database Technology

By Michael Stonebraker, Scott Jarr | June 25, 2014
In this webcast, Scott Jarr, co-founder and chief strategy officer at VoltDB, will discuss the new corporate data architecture — and the necessary technology components for facing this data management challenge.

http://cdn.oreillystatic.com/images/people/weblogs/yves_hilpisch-50.jpgYves Hilpisch

Derivatives Analytics with Python

By Yves Hilpisch | June 24, 2014
In this webcast you will learn how Python can be used for Derivatives Analytics and Financial Engineering.

http://cdn.oreillystatic.com/images/people/weblogs/alice_zheng-50.jpgAlice Zheng

Scalable Data Science on a Laptop

By Alice Zheng | June 24, 2014
In this webcast, we'll demonstrate doing scalable data science using GraphLab Create, an end-to-end platform for prototyping and deploying data products.

http://cdn.oreillystatic.com/images/people/weblogs/elliot_williams-50.jpgElliot Williams

Building your Own USB Devices for AVR with the V-USB Library

By Elliot Williams | June 20, 2014
This webcast will walk you through two example projects: a custom scrollwheel mouse and a USB temperature controller.

http://cdn.oreillystatic.com/images/people/weblogs/mikio_braun-50.jpgMikio Braun

Data Analysis on Streams

By Mikio Braun | June 12, 2014
In this webcast, Mikio Braun will discuss building reliable and efficient solutions for real-time data analysis, including approaches that rely on scaling--both batch-oriented (such as MapReduce), and stream-oriented (such as Apache Storm and Apache ...

http://cdn.oreillystatic.com/images/people/weblogs/paco_nathan.jpgPaco Nathan

Computational Thinking: Just Enough Math

By Paco Nathan | June 04, 2014
In the webcast, we'll review some of the historical context that led to machine learning techniques.

http://cdn.oreillystatic.com/images/people/weblogs/florian_haas-50.jpgFlorian Haas

Hands On Trove: Database as a Service in OpenStack

By Florian Haas | May 21, 2014
In this webcast led by Florian Haas, he'll cover the architecture of Trove, and demonstrates the deployment of OpenStack Trove on an OpenStack private cloud, in order to provide MySQL DBaaS to OpenStack users.

http://cdn.oreillystatic.com/images/people/weblogs/jay_kreps-50.jpgJay Kreps

I ♥ Logs: Apache Kafka and Real-time Data Integration

By Jay Kreps | May 21, 2014
This webcast talk will discuss how logs and stream-processing can form a backbone for data flow, ETL, and real-time data processing.

http://cdn.oreillystatic.com/images/people/weblogs/michael_armbrust-50.jpgMichael Armbrust

Performing Advanced Analytics on Relational Data with Spark SQL

By Michael Armbrust | April 29, 2014
In this webcast, we'll examine Spark SQL, a new Alpha component that is part of the Apache Spark 1.0 release.

http://cdn.oreillystatic.com/images/people/weblogs/michael_collins-50.jpgMichael Collins

Before the Math: Detecting Security Issues Using Exploratory Data Analysis

By Michael Collins | April 24, 2014
In this webcast, we'll discuss how to apply the art of exploratory data analysis to security questions.

http://cdn.oreillystatic.com/images/people/weblogs/lutz_finger.jpgLutz Finger

Ask-Measure-Learn to Gain Actionable Insights from Your Big Data

By Lutz Finger | April 03, 2014
This webcast shows how to extract significant business value from big data with Ask-Measure-Learn, a system that helps you ask the right questions, measure the right data, and then learn from the results.

http://cdn.oreillystatic.com/images/people/weblogs/mark_larosa-50.jpgMark LaRosa

Better, Faster Business Analytics with In-memory Databases

By Mark LaRosa | April 02, 2014
In this webcast, we will look at the benefits of in-memory technology and the business value it brings. Attendees will also have a chance to see the speed, scale, and simplicity of MemSQL’s in-memory solution, and why it is evolving the database landscape...

http://cdn.oreillystatic.com/images/people/weblogs/tricia_wang_50.jpgTricia Wang

Why Big Data Needs Thick Data

By Tricia Wang | March 28, 2014
Big Data can help predict the future, but can too much Big Data be dangerous for your organization? Yes, says global tech ethnographer, Tricia Wang. This webcast examines the risks of over-reliance on big data and the need to bring in Thick Data&mdash...

http://cdn.oreillystatic.com/images/people/weblogs/max_shron-50.jpgMax Shron

Thinking with Data

By Max Shron | March 13, 2014
This webcast examines a framework for incorporating ideas from other fields (like design, argument studies, and consulting) into Data Science.

http://cdn.oreillystatic.com/images/people/weblogs/karen_hsu-50.jpgKaren Hsu

Instant Visualization in Every Step of Analysis

By Karen Hsu | February 27, 2014
In this webcast, we'll discuss how IT and business users can leverage self-service visualizations to quickly spot and correct data anomalies throughout the analytic process.

http://cdn.oreillystatic.com/images/people/weblogs/adam_kawa-50.jpgAdam Kawa

Hadoop Adventures at Spotify

By Adam Kawa | February 27, 2014
In this webcast talk led by Adam Kawa, we will talk about our real-world Hadoop issues that either broke our cluster or made it very unstable, especially when we were growing very fast from a 60 to 690-node Hadoop cluster.

http://cdn.oreillystatic.com/images/people/weblogs/ken_gleason-50.jpgKen Gleason

Data Quality Demystified: Knowing When Your Data is Good Enough

By Ken Gleason | February 13, 2014
This webcast introduces a simple conceptual framework for thinking about data quality and strategies for evaluating quality proactively to improve results and reduce unnecessary repetition.

How to Get Statistics Right in AB Testing: The Short Answer

By Zack Exley, Sahar Massachi | February 05, 2014
In this webcast talk we'll present simple methods that we believe accurately predict future performance from AB test results, and that allow us to determine the smallest acceptable sample size. Using four years of AB testing data, we'll show that these...

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Data Everywhere: Data Anthropology, Quantified Self, Machine Data, Human Centered Design, and more

By Alistair Croll | February 04, 2014
In this free online conference, we'll be showcasing some of the hot topics and thought-provoking speakers who will be joining us for the event. It's your chance to see what we're covering and to find those can't-miss tracks and sessions.

Predictive Analytics, Machine Learning, and Recommendation Systems on Hadoop

By Wayne Thompson, Georgia Mariani | January 30, 2014
Join us to learn more about how to reveal insights in your Big data and redefine how your organization solves complex problems.

http://cdn.oreillystatic.com/images/people/weblogs/andrew_collette-50.jpgAndrew Collette

Managing Large Datasets with Python and HDF5

By Andrew Collette | January 28, 2014
This webcast provides a practical, Python-based introduction to the world of HDF5.

http://cdn.oreillystatic.com/images/people/weblogs/paco_nathan.jpgPaco Nathan

Getting Started Running Apache Spark on Apache Mesos

By Paco Nathan | January 24, 2014
This tutorial shows a simple way to launch a Mesos cluster in the cloud, how to configure run Spark on Mesos, then how to run jobs in Spark.

The End of the Analytics Black Box

By Steven Hillion, Joel Horwitz | January 23, 2014
Please join Steven Hillion, Alpine Chief Product Officer, and Joel Horwitz, Alpine Head of Product Marketing, as they share the story of how to and take advanced analytics out of the black box and into the hands of every decision maker in your organization...

http://cdn.oreillystatic.com/images/people/weblogs/joy_beatty.jpgJoy Beatty

Forward Thinking for Tomorrow's Projects: Requirements for Business Analytics

By Joy Beatty | January 23, 2014
In this webcast presentation, Joy Beatty, VP of R&D at Seilevel, offers advice on tackling requirements for business analytics projects. Drawing from the book she co-authored with Karl Wiegers, Software Requirements 3rd Ed., Joy will outline how ...

http://cdn.oreillystatic.com/images/people/weblogs/trent_hauck-50.jpgTrent Hauck

A Detailed Look at Pandas' Indexes

By Trent Hauck | January 22, 2014
Join Trent Hauck author of Instant Data Intensive Apps with Pandas How-to, for a hands-on webcast where he will discuss motivations for using indexed data structures over non-indexed data structures in pandas.

http://cdn.oreillystatic.com/images/people/weblogs/scott_murray.jpgScott Murray

From Scattered to Scatterplot in 2 Hours: An Introduction to d3.js

By Scott Murray | January 08, 2014
Confused by D3? Interested in coding data visualizations on the web, but don't know where to start? This online tutorial will have you transforming data into visual images in no time at all, starting from scratch and building an interactive scatterplot...

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Data, Crime, and Conflict

By Alistair Croll | January 07, 2014
Join a lineup of thinkers and technologists for this free online event as we look at the ways data is shaping how we police ourselves, from technological innovations to ethical dilemmas.

http://cdn.oreillystatic.com/images/people/weblogs/matthew_a_russell.jpgMatthew Russell

Data Science Experiments with Twitter and IPython Notebook

By Matthew Russell | December 13, 2013
After attending this mini-workshop, you'll be able to run your own data science experiments with Twitter's API and IPython Notebook! Besides learning the fundamentals of how to use IPython Notebook, you'll learn how to do the following kinds of things...

http://cdn.oreillystatic.com/images/people/weblogs/scott_murray.jpgScott Murray

Whatever Happened to "Augmenting Human Intellect"?

By Scott Murray | November 20, 2013
Join us for an interactive webcast presented by Scott Murray where we explore the fundamental role of data visualization is to express information in a form more palatable to human perception than rows and columns of raw values.

http://cdn.oreillystatic.com/images/people/weblogs/lynwood_bishop-50.jpgLynwood Bishop

Using Every Pixel to Visualize Big Data

By Lynwood Bishop | November 08, 2013
Visualizing patterns, relationships and anomalies in multi-sourced data is challenging when the number of records continues to grow exponentially. Many traditional methods of visualization for business intelligence and reporting aggregate the results...

http://cdn.oreillystatic.com/images/people/weblogs/michael_shoffner-50.jpgMichael Shoffner

Turning Bigger Data Into Better Healthcare

By Michael Shoffner | November 07, 2013
This webcast presentation paints a picture of the direction clinical medicine is heading in the age of Big Data, highlighting ongoing data cyberinfrastructure development by University of North Carolina at Chapel Hill based RENCI and key partners to ...

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Data and Ethics: Etiquette and Law for an Always-On World

By Alistair Croll | November 05, 2013
In this online conference, we'll look at where ethics and the law are headed in an always-on, data-driven society.

http://cdn.oreillystatic.com/images/people/weblogs/susan_etlinger-50.jpgSusan Etlinger

Canary in the Coalmine: How Social Data Can Prepare Us for Big Data

By Susan Etlinger | October 22, 2013
Modern data analysis requires that you have two jobs: being a statistician and being a programmer. This is especially true with R, where pointing and clicking to analyze data is mostly not an option. Fortunately, the jump from writing code like a statistician...

http://cdn.oreillystatic.com/images/people/weblogs/marc_garrett-50.jpgMarc Garrett

Fitter, Happier: Improve Your Health and Productivity with R

By Marc Garrett | October 15, 2013
Intridea is famous for our distributed team. We believe that letting people work from home leads to happier employees and better client outcomes. But there's one drawback: the freshman fifteen! Working from home means working close to your refrigerator...

http://cdn.oreillystatic.com/images/people/weblogs/matthew_a_russell.jpgMatthew Russell

Why Twitter Is All the Rage: A Data Miner's Perspective

By Matthew Russell | October 15, 2013
In order to be successful, technology must amplify a meaningful aspect of our human experience, and Twitter’s success largely has been dependent on its ability to do this quite well. Although you could describe Twitter as just a “free, high-speed...

Real-time Stream Processing and Visualization Using Kafka, Storm, and d3.js

By Byron Ellis, Justin Langseth | October 10, 2013
In this hands-on webcast you'll learn how LivePerson and Zoomdata perform stream processing and visualization on mobile devices of structured site traffic and unstructured chat data in real-time for business decision making.

http://cdn.oreillystatic.com/images/people/weblogs/kim_rees-50.jpgKim Rees

Best of Strata + Hadoop World 2012: How to See Data

By Kim Rees | October 09, 2013
Join us for an exclusive presentation by Kim Reese recorded live at Strata + Hadoop World 2012

http://cdn.oreillystatic.com/images/people/weblogs/carl_steinbach-50.jpgCarl Steinbach

The Best of Strata Santa Clara 2013: SQL on Hadoop

By Carl Steinbach | October 02, 2013
In this talk we will discuss the unavoidable cost and performance limitations of the connector-based approach employed by many established vendors and explain the long-term significance of Apache Hive's data model along with its influence on next generation...

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Big Data and the Ethics and Challenges of Living in a Connected Society

By Alistair Croll | September 27, 2013
In late October, Strata+Hadoop World returns to the Big Apple. This year, we have a wide range of topics, from real-world case studies to hard-core data science to the ethics and challenges of a connected society.

http://cdn.oreillystatic.com/images/people/weblogs/richard_cotton.jpgRichie Cotton

Writing Great R Code

By Richie Cotton | September 25, 2013
Modern data analysis requires that you have two jobs: being a statistician and being a programmer. This is especially true with R, where pointing and clicking to analyze data is mostly not an option. Fortunately, the jump from writing code like a statistician...

http://cdn.oreillystatic.com/images/people/weblogs/paco_nathan.jpgPaco Nathan

Enterprise Data Workflows with Cascading

By Paco Nathan | September 17, 2013
In this hands-on webcast presented by Paco Nathan author of Enterprise Data Workflows with Cascading, he will discuss what defines a workflow , in contrast to notions of dataflow and the impact that has on the tools required.

Anonymizing Health Data

By Luk Arbuckle, Khaled El Emam | September 13, 2013
In this webcast we'll start with a discussion of the relatively simple de-identification of a cross-sectional disease registry, and then we'll jump in to more complex situations like the de-identification of longitudinal data, free-form text, and geospatial...

http://cdn.oreillystatic.com/images/people/weblogs/sreedhar_potarazu-50.jpgSreedhar Potarazu

Why Facebook and Google Missed the Boat on Healthcare

By Sreedhar Potarazu | September 04, 2013
In this webcast talk Dr. Sreedhar Potarazu (Dr P), Fox News Contributor, Acclaimed Author and Nationally Recognized Expert on Big Data and Healthcare, tells the story never told on the next Big THING in healthcare and the lessons learned from Silicon...

http://cdn.oreillystatic.com/images/people/weblogs/robert_grossman-50.jpgRobert Grossman

Community Clouds for Cancer Genomics: Lessons Learned from Bionimbus

By Robert Grossman | August 20, 2013
Join us for a webcast talk by Robert Grossman where he shares how his organization recently expanded Bionimbus so that researchers can analyze data from controlled datasets, such as The Cancer Genome Atlas (TCGA) in a secure and compliant fashion.

http://cdn.oreillystatic.com/images/people/weblogs/john_kansky-50.jpgJohn Kansky

Best of Strata Rx 2012: HIE 2.0 - The Future of Health Information Exchange

By John Kansky | August 09, 2013
This is an exclusive session presented by John Kansky recorded live at Strata Rx 2012.

http://cdn.oreillystatic.com/images/people/weblogs/igri_gorik.jpgIlya Grigorik

Best of Strata + Hadoop World 2012: Analyzing Millions of GitHub Commits

By Ilya Grigorik | August 08, 2013
In this session, we will discuss our experience in using BigQuery, how we modeled the GitHub event data, and the lessons learned in importing and making the data available.

Best of Strata Rx 2012: Doing Big Data All By Yourself

By Lauren Chaparro, Ari Gesher | July 30, 2013
In this presentation, we will show a working system that bridges the gap between data analysis and decision making using a carefully composed set of big-data technologies mated with an interactive, high-level interface.

http://cdn.oreillystatic.com/images/people/weblogs/shahid_shah-50.jpgShahid Shah

Best of Strata Rx 2012: Reasons why health data is poorly integrated today and what we can do about it

By Shahid Shah | July 03, 2013
In this talk Shahid N. Shah will look at the specific things that are holding us back when it comes to poor integration in healthcare and what future EHRs can do about it.

http://cdn.oreillystatic.com/images/people/weblogs/jennifer_van_der_meer-50.jpgJen van der Meer

The Best of Strata Santa Clara 2013: Data is Not a Business Model

By Jen van der Meer | July 02, 2013
This talk will help anyone who is tasked with determining how to get more business action out of data.

Best of Strata Rx 2012: Disruptors: What Healthcare Will Look Like In 2020

By John Mattison, Tim O'Reilly, DJ Patil, Benjamin West | June 28, 2013
This is an exclusive panel discussion with Tim O'Reilly, DJ Patill, John Mattison, and Benjamin West recorded live from Strata Rx 2012.

http://cdn.oreillystatic.com/images/people/weblogs/jim_blomo-50.jpgJim Blomo

How We Build Data Mining Teams at Yelp

By Jim Blomo | June 18, 2013
Starting and growing a data science team doesn't have to be a risky proposition. By balancing long term strategy and technology goals with immediate business demands, your data science team can quickly become productive and enjoy sustained growth.

http://cdn.oreillystatic.com/images/people/weblogs/sheridan_hitchens-50.jpgSheridan Hitchens

Best of Strata + Hadoop World: Moving to Big Data

By Sheridan Hitchens | June 11, 2013
Join us for an exclusive presentation by Sheriden Hitchens recorded live from Strata + Hadoop World 2012.

http://cdn.oreillystatic.com/images/people/weblogs/jon_bruner-50.jpgJon Bruner

Strata Online Conference: Mobility, Data, and Analytics

By Jon Bruner | June 05, 2013
In this Strata Online event, we'll look at some of the ways the rise of the always-on world is feeding the hungry engines of Big Data.

http://cdn.oreillystatic.com/images/people/weblogs/scott_murray.jpgScott Murray

Data Visualization - The Value of Process

By Scott Murray | March 20, 2013
This webcast talk presented by Scott Murray author of Interactive Data Visualization for the Web, will introduce ideas from conceptual art, connecting them to the daily challenges faced by data visualizers working with code.

http://cdn.oreillystatic.com/images/people/weblogs/winston_chang2-50.jpgWinston Chang

Introduction to Data Visualization with R and ggplot2

By Winston Chang | March 06, 2013
In this webcast presented by Winston Chang, author of R graphics Cookbook, you'll learn the basics of how to create data graphics using R and the popular ggplot2 package.

http://cdn.oreillystatic.com/images/people/weblogs/jeremy_howard-50.jpgJeremy Howard

Deep Learning - The Biggest Data Science Breakthrough of the Decade

By Jeremy Howard | March 05, 2013
In this webcast talk Jeremy Howard, Kaggle's president and chief scientist, will explain exactly what occurred, why it was front-page newsworthy for the New York Times, how it will impact business, and what you need to know to make these new algorithms...

http://cdn.oreillystatic.com/images/people/weblogs/maksim_tsvetovat.jpgMaksim Tsvetovat

Community Detection in Social Media Data

By Maksim Tsvetovat | March 05, 2013
In this webcast talk Maksim Tsvetovat author of Social Network Analysis for Startups will introduce a number of ways to address these issues and present an open-source Python-based toolkit for detecting and visualizing communities in Twitter networks...

http://cdn.oreillystatic.com/images/people/weblogs/wes_mckinney-1.jpgWes McKinney

Building Rich, High Performance Tools for Practical Data Analysis

By Wes McKinney | February 20, 2013
This live webcast is presented by Wes McKinney author of Python for Data Analysis and will be a somewhat advanced, technical talk connecting computer science concepts like data structure design and algorithms with the details of building intuitive, high...

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Strata Online Conference: Strata Santa Clara 2013 Preview

By Alistair Croll | February 15, 2013
In this free online conference, we'll be showcasing some of the hot topics and thought-provoking speakers who will be joining us for the event.

http://cdn.oreillystatic.com/images/people/weblogs/bitsy_bentley-50.jpgBitsy Bentley

Designing for Data-driven Organizations

By Bitsy Bentley | February 14, 2013
Businesses have access to more data than ever before, but the question of how the data can be leveraged to drive action is at times a daunting task, especially for larger organizations.

http://cdn.oreillystatic.com/images/people/weblogs/scott_murray.jpgScott Murray

Engaging Audiences with Data Visualization

By Scott Murray | February 13, 2013
Join us for a hands-on webcast presented by Scott Murray author of Interactive Data Visualization for the Web, as he guides you through the framework of three avenues of engagement: aesthetic, narrative, and interactive.

http://cdn.oreillystatic.com/images/people/weblogs/cj_date.jpgC.J. Date

View Updating: How to Make it Work

By C.J. Date | January 30, 2013
In this webcast presentation, the overall message is: Views in general are just as updatable as base tables are! Attend this webcast and see why this isn't as extravagant a claim as it might seem.

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Strata Online Conference: Data Warfare

By Alistair Croll | January 22, 2013
From public policy to elections, from healthcare to the battlefield, our lives rely on the analysis of abundant, connected data. But if data is infrastructure, then that infrastructure's vulnerable. Enemies can confound, confuse, distort, and mislead...

http://cdn.oreillystatic.com/images/people/weblogs/casey_micheline-50.jpgMicheline Casey

What Business People Need to Know About Data Governance

By Micheline Casey | January 15, 2013
In this webcast, Micheline Casey provides an overview of data governance and data management principles that should be applied to big data projects.

http://cdn.oreillystatic.com/images/people/weblogs/david_boyle-50.jpgDavid Boyle

How EMI is Changing the Culture of the Music Industry

By David Boyle | January 08, 2013
In this exclusive webcast, David Boyle will look at how EMI changed itself, and the music industry, by moving from gut instinct and opinions to a data-informed business.

http://cdn.oreillystatic.com/images/people/weblogs/khaled_el_emam-50.jpgKhaled El Emam

Responsibly Sharing Data Under HIPAA

By Khaled El Emam | October 31, 2012
In this webcast presentation we will first provide an overview of how data can be re-identified, with reference to a number of recent real world examples. This will be followed by a description of how to de-identify health data in a defensible way according...

http://cdn.oreillystatic.com/images/people/weblogs/allen_downey.jpgAllen B. Downey

Bayesian Statistics Made Simple

By Allen B. Downey | October 26, 2012
Join Allen Downey, author of Think Stats: Probability and Statistics for Programmers for an introduction to Bayesian statistics using Python. Bayesian statistical methods are becoming more common and more important, but there are not many resources to...

http://cdn.oreillystatic.com/images/people/weblogs/benjamin_yoskovitz.jpgBenjamin Yoskovitz

Understanding the Value of Lean Analytics: Using Data to Build a Better Startup Faster

By Benjamin Yoskovitz | October 25, 2012
The Lean movement has revolutionized how we create products and companies today. It focuses on customer development and tackling the risky parts first. At the core of this is iteration—a cycle of learning and adapting that's driven by data. Lean...

How to Develop Language Annotations for Machine Learning Algorithms

By James Pustejovsky, Amber Stubbs | October 16, 2012
Text-based data mining and information extraction systems that make use of machine learning techniques require annotated datasets for training the algorithms. In this webcast we will discuss the steps involved in creating your own training corpus for...

http://cdn.oreillystatic.com/images/people/weblogs/tod_fetherling-50.jpgJ. Tod Fetherling

Healthcare 101: Cradle to Grave

By J. Tod Fetherling | October 12, 2012
J. Tod Fetherling presents this 90 minute white board session walking the user through every aspect of the healthcare system from wellness to death.

http://cdn.oreillystatic.com/images/people/weblogs/wes_mckinney-1.jpgWes McKinney

Python for Data Analysis

By Wes McKinney | October 10, 2012
In this hands-on webcast presented by Wes McKinney, author of Python for Data Analysis , he will showcase a number of examples and you will receive an introduction to some of the most important tools in the Python language for data preparation, data ...

http://cdn.oreillystatic.com/images/people/weblogs/julie_steele.jpgJulie Steele

Strata Rx Online Conference: Personalized Medicine

By Julie Steele | October 05, 2012
In this free online conference we will discuss how Microsoft Research has developed a new version of the Linear Mixed Model algorithm that is not only computationally inexpensive, but also is better at finding the true signals that account statistically...

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

Strata + Hadoop World: Join the Data Revolution

By Alistair Croll | October 03, 2012
In this free online conference, we preview some of the hot topics, provocative speakers, and game-changing innovations that are fueling the growth of a data-driven society.

An Introduction to Machine Learning for Hackers

By John Myles White, Drew Conway | September 18, 2012
We'll introduce programmers to two of the most common tools in the machine learning toolkit: linear regression and logistic regression.

http://cdn.oreillystatic.com/images/people/weblogs/lars_george.jpgLars George

Choosing Hardware for Hadoop

By Lars George | August 15, 2012
In this webcast we will look at popular reference architectures used by companies across several business verticals, discuss their pros and cons, and their applicability to different use-cases, and conclude with best-practice advise on hardware selection...

Data in Motion

By Kaitlin Thaney, Alistair Croll, Jacomo Corbo, Simon Williams, Neal Lathia, John Graham-Cumming | July 24, 2012
In this Strata Online Conference, we'll look at data and movement across a variety of sports and industries.

http://cdn.oreillystatic.com/images/people/weblogs/edd_dumbill.jpgEdd Dumbill

Get the (Data) Vote Out

By Edd Dumbill | June 20, 2012
In this Strata Online Event, we'll look at the way data science is shaping elections, from visualizations to game theory, from understanding issues to targeting voters.

http://cdn.oreillystatic.com/images/people/weblogs/steve_francia.jpgSteve Francia

MongoDB and PHP

By Steve Francia | May 18, 2012
In this webcast presentation by Steve Francia, author of MongoDB and PHP, you will learn how to build elegant database applications with MongoDB and PHP.

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

O'Reilly Strata Online Conference

By Alistair Croll | May 16, 2012
Join us for our seventh Strata online conference, as we look at Data That Matters.

Tim O'Reilly and Dave Campbell Explore How to Accelerate Insights from Data

By Tim O'Reilly, David Campbell | May 14, 2012
Tim O'Reilly, founder and CEO of O'Reilly Media, talks with Microsoft Technical Fellow Dave Campbell about new tools for data.

http://cdn.oreillystatic.com/images/people/weblogs/alan_gates.jpgAlan Gates

Current and Upcoming Work in Pig

By Alan Gates | May 10, 2012
In this webcast, we will cover how Pig can take advantage of changes in Hadoop 0.23.

Under the iceberg: Using APIs to transform your business

By Gregory Brail, Daniel Jacobson, Dan Woods | March 22, 2012
In this webcast presentation join Dan Jacobson , Greg Brail, and Dan Woods as they discuss how business leaders can use APIs to transform as a strategy to transform business through private and public APIs.

http://cdn.oreillystatic.com/images/people/weblogs/jared_rosoff-50.jpgJared Rosoff

MongoDB Schema Design: How to Think Non-Relational

By Jared Rosoff | February 17, 2012
In this webcast we'll provide a number of data modeling rules of thumb, and discuss the tradeoffs of various data modeling strategies.

http://cdn.oreillystatic.com/images/people/weblogs/kord_davis-50.jpgKord Davis

An Introduction to Ethics of Big Data

By Kord Davis | February 16, 2012
The material will address the intersection of ethics and Big Data; what it is and what it isn't. Specifically, how to approach and generate dialog about an abstract subject with direct, real-world implications.

http://cdn.oreillystatic.com/images/people/50/joe_kissell-50.jpgJoe Kissell

Take Control of iCloud

By Joe Kissell | February 03, 2012
In this webcast, veteran Mac author Joe Kissell explains what iCloud can do for you, how to deal with configuration puzzles and compatibility issues, and how best to manage the transition from MobileMe.

http://cdn.oreillystatic.com/images/people/weblogs/john-zablocki-50.jpgJohn Zablocki

Developing with .NET and Couchbase Server

By John Zablocki | January 27, 2012
In this webcast John Zablocki, Developer Advocate at Couchbase, will introduce the .NET client library for Couchbase Server.

http://cdn.oreillystatic.com/images/people/weblogs/alistair_croll-50.jpgAlistair Croll

O'Reilly Strata Online Conference

By Alistair Croll | December 07, 2011
In this online event, we'll look at how Big Data stacks and analytical approaches are gradually finding their way into organizations, as well as the roadblocks that can thwart efforts to become more data-driven.

http://cdn.oreillystatic.com/images/people/weblogs/maksim_tsvetovat.jpgMaksim Tsvetovat

Social Network Analysis -- Finding communities and influencers

By Maksim Tsvetovat | December 06, 2011
A follow-on to Analyzing Social Networks on Twitter, this webcast will concentrate on the social component of Twitter data rather then the questions of data gathering and decomposition.

http://cdn.oreillystatic.com/images/people/weblogs/lars_george.jpgLars George

HBase Coprocessors - Deploy shared functionality directly on the cluster

By Lars George | November 04, 2011
This session explains the concepts behind coprocessors and uses examples to show how they can be used to implement data side extensions to the application code.

The Evolution from Private to Public: Is There Privacy in the Digital Age?

By Jim Adler, danah boyd, Terence Craig, Natalie Fonseca, Heather West | October 28, 2011
Join the panelists as they consider the evolution from private to public: how are our worlds colliding in the digital age?

http://cdn.oreillystatic.com/images/people/weblogs/lars_george.jpgLars George

HBase Schema Design - Things you need to know

By Lars George | October 14, 2011
This session discusses the basic underlying concepts of the storage layer in HBase and how an application should be combined with the appropriate schema to achieve the best possible performance.

http://cdn.oreillystatic.com/images/people/weblogs/allen_downey.jpgAllen B. Downey

There's Only One Test

By Allen B. Downey | October 04, 2011
People working with real data are often confused about hypothesis testing and paralyzed by the number of tests and their requirements. In this webcast, Allen B. Downey, author of Think Stats, presents a framework for using simple simulations to estimate...

Privacy and Big Data: Is there room for privacy in the age of big data?

By Terence Craig, Mary Ludloff | September 14, 2011
In this webcast, Terence Craig and Mary Ludloff, authors of Privacy and Big Data, ask and answer this question: What level of privacy do you really have in the age of big data?

Designing Data Visualizations

By Julie Steele, Noah Iliinsky | September 06, 2011
This webcast will discuss data visualization. Learn linear processes and best practices so that your message may be transmitted without interference.

People, Data and Dollars — A Preview of Strata NYC

By Edd Dumbill, Kathryn Dekas, Michael Hugos, Michael Nelson, Hjalmar Gislason, Bill Schmarzo | August 31, 2011
In this special online event, you'll get an inside look at some of the world's leading thinkers and innovators in the fields of business, data, and disruption.

Building Access Applications with SQL Server Databases

| August 09, 2011
In this session we will be demonstrating the construction of DSN's, linking tables, views, and using stored procedures and views in pass-through queries. This will include a discussion of the benefits in using SQL Server Schemas and Synonyms.

http://cdn.oreillystatic.com/images/people/weblogs/mike_halsey3-50.jpgMike Halsey

Securing Your Files and Data in Windows

By Mike Halsey | August 04, 2011
In this webcast, Mike Halsey MVP, the author of Troubleshooting Windows 7 Inside Out will talk you though how to keep your files and data safe from even the worst disaster.

Couchbase: Find Out What the Merger of CouchOne and Membase Means for Users

By J. Chris Anderson, Dustin Sallings | April 19, 2011
In this webinar we'll introduce you to the Membase caching and clustering architecture, and show how CouchDB is a drop-in fit as the storage and query engine.

http://cdn.oreillystatic.com/images/people/weblogs/kristina_chodorow.jpgKristina Chodorow

How Sharding Works

By Kristina Chodorow | February 04, 2011
This talk is a combination of whitepaper and Magic School Bus tour of how MongoDB scales across multiple machines. For applications that outgrow the resources of a single database server, MongoDB can convert to a sharded cluster, automatically managing...

http://cdn.oreillystatic.com/images/people/50/bradford_stephens-50.jpgBradford Stephens

How to Decrease the Pain in Building Distributed Systems

By Bradford Stephens | January 12, 2011
Building distributed systems is painful. Many organizations are approaching the point where their data and application infrastructures are being run on many servers (in the cloud or datacenter). Our software practices don't reflect that, often with disastrous...

http://cdn.oreillystatic.com/images/people/50/hadi_hariri-50.jpgHadi Hariri

CouchDB for .NET Developers

By Hadi Hariri | December 21, 2010
What does that mean to a .NET Developer? How do we store and retrieve data? How do we query it? If you've been interested in document databases but do not know where to start, then this is definitely the webcast for you. We'll see what CouchDB is about...

http://cdn.oreillystatic.com/images/people/50/ken_goodhope-50.jpgKen Goodhope

Hadoop - Tips, Tricks, Optimizations, and Pitfalls

By Ken Goodhope | November 23, 2010
We'll use real world examples in this webcast that demonstrate how to best utilize MapReduce with Hadoop. We'll also examine the appropriate uses of special partitioners, combiners, and configuration optimizations. We'll expose some common mistakes and...

http://cdn.oreillystatic.com/images/people/50/benjamin_young-50.jpgBenjamin Young

PHP and CouchDB

By Benjamin Young | November 17, 2010
This talk will cover the basics of the CouchDB HTTP API and how to use it from PHP with and without helper libraries. We'll discuss some architecture approaches and briefly look at things to avoid when moving from an RDBMS to a Document Database such...

http://cdn.oreillystatic.com/images/people/50/c_brown-50.jpgC. Titus Brown

Probabilistic Data Structures and Breaking Down Big Sequence Data

By C. Titus Brown | November 10, 2010
Many data analysis problems are not easily parallelizable, often because the relevant analyses require an all-by-all analysis step. Applying heuristics often requires approximation, which introduces errors, noise, and bias. Recently, in confronting the...

http://cdn.oreillystatic.com/images/people/50/kyle_banker-50.jpgKyle Banker

Indexing Matters: A MongoDB Optimization Primer

By Kyle Banker | October 29, 2010
We all know that MongoDB is one of the most flexible and feature-rich databases available. In this session we'll discuss how you can leverage this feature set and maintain high performance with your project's massive data sets and high loads. We'll cover...

http://cdn.oreillystatic.com/images/people/50/kocoloski_adam-50.jpgAdam Kocoloski

Scaling Out CouchDB with BigCouch

By Adam Kocoloski | October 22, 2010
This talk will cover the basics of BigCouch, including deploying and managing your first CouchDB cluster, as well as some advanced features like quorum reads/writes and design patterns for distributed couchdb. Finally, for the erlang hackers out there...

http://cdn.oreillystatic.com/images/people/50/aaron_miller-50.jpgAaron Miller

Using CouchDB on Android

By Aaron Miller | September 22, 2010
Why CouchDB on a phone is awesome, and what you can do with it Deploying existing CouchApps to Android CouchDB Using CouchDB in native Android apps

http://cdn.oreillystatic.com/images/people/weblogs/kristina_chodorow.jpgKristina Chodorow

Scaling with MongoDB

By Kristina Chodorow | September 17, 2010
MongoDB's architecture features built-in support for horizontal scalability, and high availability through replica sets. Auto-sharding allows users to easily distribute data across many nodes. Replica sets enable automatic failover and recovery of database...

http://cdn.oreillystatic.com/images/people/weblogs/tom_white.jpgTom White

The State of Hadoop

By Tom White | September 15, 2010
Apache Hadoop is a part of a growing ecosystem of projects for large-scale data analysis which is being used to solve problems for organizations in a wide range of disciplines. This talk will touch on what's new in the second edition of Hadoop: The Definitive...

http://cdn.oreillystatic.com/images/people/weblogs/jan_jehnardt.jpgJan Lehnardt

Asynchronous architectures with the CouchDB _changes feed

By Jan Lehnardt | August 25, 2010
Learn how to build robust web services using CouchDB's built-in facility for near-realtime updates. We'll explore a few patterns _changes can be used for: Building custom external indexers like CouchDB-Lucene, Powering CouchDB's replication, Real-time...

http://cdn.oreillystatic.com/images/people/weblogs/sean_hull.jpgSean Hull

MySQL Upgrades With No Downtime

By Sean Hull | July 27, 2010
In this webcast we'll discuss a two-node MySQL multi-master replication setup. We'll take the audience step-by-step through the process, and then uses MMM (MySQL Multi-master Manager) to manage & automate the process exposing a virtual IP address...

http://cdn.oreillystatic.com/images/people/weblogs/chris_anderson_2.jpgJ. Chris Anderson

Flexible Scaling with CouchDB Replication / Or how I learned to stop worrying and love Eventual Consistency

By J. Chris Anderson | July 14, 2010
CouchDB is known for having a flexible schemaless JSON storage API. But that is just the tip of the iceberg when it comes to flexibility. In this webcast we'll learn how replication can be used to share data securely, build offline-capable applications...

http://cdn.oreillystatic.com/images/people/weblogs/jan_jehnardt.jpgJan Lehnardt

What's new in CouchDB 0.11 & 1.0

By Jan Lehnardt | June 22, 2010
This webcasts highlights new features and refines in the latest and upcoming release of CouchDB. It rehashes old solutions to problems that are now way easier to solve. We look at how the new features help you make your life and development work easier...

http://cdn.oreillystatic.com/images/people/weblogs/chris_anderson_2.jpgJ. Chris Anderson

CouchApp Evently Guided Hack w/ CouchDB

By J. Chris Anderson | May 20, 2010
Learn to hack jQuery CouchApps -- p2p web applications that can be deployed anywhere there's a CouchDB. Apache CouchDB can host HTML5 apps natively, serving them over HTTP. Learn how to write JavaScript CouchApps which run on both the client and ...

http://cdn.oreillystatic.com/images/people/weblogs/chris_anderson_2.jpgJ. Chris Anderson

Introduction to Apache CouchDB

By J. Chris Anderson | April 21, 2010
CouchDB is a distributed document database accessed via HTTP and JSON and queried using JavaScript Map Reduce. CouchDB focuses on simplicity and reliability, with a data replication model that makes it well suited for mobile and offline applications...

http://cdn.oreillystatic.com/images/people/weblogs/sean_hull.jpgSean Hull

DRBD and MySQL - An HA Match Made In Heaven

By Sean Hull | January 19, 2010
DRBD has grown in popularity as an excellent low-cost high availability solution for MySQL. It provides synchronous replication of your data without MySQL having to worry too much about the details. Combined with Linux Heartbeat, and you have automatic...

http://cdn.oreillystatic.com/images/people/weblogs/michael_milton.gifMichael Milton

Two Big Data Analysis Tricks for Everyone

By Michael Milton | October 28, 2009
Data analysis skills are critical to staying competitive in the 21st century economy. In this webcast the author of Head First Data Analysis, Michael Milton, provides some useful tips for common data problems that everyone faces.

http://cdn.oreillystatic.com/images/people/weblogs/sean_hull.jpgSean Hull

Hands-on: Step-by-step MySQL Clustering Setup

By Sean Hull | August 04, 2009
MySQL's Clustering solution provides some pretty sophisticated functionality. In this webcast we'll take you through getting it up and running on your laptop or single node server, building a sandbox where you can play with the dials and levers and get...

http://cdn.oreillystatic.com/images/people/weblogs/sean_hull.jpgSean Hull

MySQL Replication: Audit, Test, & Verify

By Sean Hull | January 22, 2009
In this live online event, Sean Hull (Oracle and Open Source) will talk about why MySQL slaves get out of sync with the master, both in terms of things that happen in the application and in MySQL's implementation of statement-based replication. He'll...