Thomas Pfaff: Advanced Data Storage

In this tutorial we will give an introduction to two advanced data storage formats. HDF5 and NetCDF were designed to efficiently store the results of supercomputing applications like climate model outputs, or the data streams received from NASA's fleet of earth observing satellites. They provide a lot of optimizations concerning transparent file compression, speed of access or working with multiple files as if it were one large data set.A couple of Python libraries exist that allow fast and pythonic access to these formats.We will show you how to create and access these types of files from Python, and how to use their advanced features to tune them for maximum efficiency.Tutorial prerequisites and instructions. Thomas Pfaff

Related Talks

Microservices at Spotify

Microservices at SpotifyThis presentation was recorded at GOTO Berlin 2015 http://gotober.com Kevin Goldsmith - Vice President, Engineering at Spotify ABSTRACT The software industry used to be all about building monoliths: monolithic applications and services, with bing-bang product releasees. All that has now changed [...] Download slides and read the full abstract here: http://gotocon.com/berlin-2015/presentation/Microservices%20@%20Spotify https://twitter.com/gotober https://www.facebook.com/GOTOConference http://gotocon.com ...

PyPy.js: What? How? Why? by Ryan Kelly

PyPy.js: What? How? Why? by Ryan KellyPyPy.js is an experiment in building a fast, compliant, in-browser python interpreter. By compiling the PyPy interpreter into javascript, and retargeting its JIT compiler to emit asmjs code at runtime, it is possible to run python code in the browser at speeds competitive with a native python environment. ...

Facts and Myths about Python names and values

Ned Batchelder
25 minutes
Facts and Myths about Python names and valuesThe behavior of names and values in Python can be confusing. Like many parts of Python, it has an underlying simplicity that can be hard to discern, especially if you are used to other programming languages. Here I'll explain how it all works, and present some facts and myths along ...

Haskell Amuse-Bouche

Haskell Amuse-BoucheGoogle Tech Talk (more info below) October 14, 2011 Presented by Mark Lentczner. ABSTRACT Want to know a little more about programming Haskell than just the buzz-words? This talk will show you some of the joys coding in Haskell through lots and lots of code examples. No prior experience with Haskell or functional programming required. ...

Rob Story - Up and Down the Python Data and Web Visualization Stack

Rob Story - Up and Down the Python Data and Web Visualization StackPyData SV 2014 In the past two years, there has been incredible progress in Python data visualization libraries, particularly those built on client-side JavaScript tools such as D3 and Leaflet. This talk will give a brief demonstration of many of the newest charting libs: mpld3 (using Seaborn/ggplot), nvd3-python, ggplot, Vincent, ...

Andrew Odlyzko: Turing and the Riemann zeta function

Andrew Odlyzko: Turing and the Riemann zeta functionAndrew Odlyzko, Professor of Mathematics at the University of Minnesota, discusses "Turing and the Riemann zeta function" in a lecture given on the occasion of Princeton University's centennial celebration of Alan Turing. Learn more at www.princeton.edu/turing #turingprinceton ...

Bugra Akyildiz - Outlier Detection in Time Series Signals

Bugra Akyildiz - Outlier Detection in Time Series SignalsPyData SV 2014 Many real-world datasets have missing observations, noise and outliers; usually due to logistical problems, component failures and erroneous procedures during the data collection process. Although it is easy to avoid missing points and noise to some level, it is not easy to detect wrong measurements and outliers ...