Lightning Talks & Using Python at Scale for Data ScienceDate: This event took place live on September 29 2015 Location: While Python is a de-facto language for modern data engineering and data science, Python development has been confined to local data processingâ€â€thereby limiting its users to smaller data sets. Historically, to address bigger data workloads, Python developers have had to extract samples or aggregates, forcing compromises in data fidelity, adding ETL costs, and ultimately leading to a loss of productivity and addressable use cases.
Ibis, a new open source data analytics framework for Python developers, has the goal of enabling the Python data ecosystem (NumPy, pandas, etc.) to operate efficiently at Hadoop scale. To enable high performance Python at scale without the age-old JVM interoperability problems, Ibis take advantage of unique synergies between Python and Impala, the leading open source MPP analytical query engine. In this talk, Ibis creator Wes McKinney, who was also the creator of pandas, will demo the current capabilities of Ibis as well as explain its roadmap.
More information about this event is available at: http://www.meetup.com/nycpython/events/224225046/ Return to O'Reilly Events |
|
|