Dublin is a charming city and a burgeoning technology hub, but it also has special significance for anyone whose work involves making sense of data, since William Sealy Gosset was working as the head brewer at Guinness when he developed the t-statistic. Last week, Dublin had extra special significance for anyone whose work involves using Apache Spark for data processing. Our group at Red Hat gave three talks at Spark Summit EU this year, and videos of these are now online. You should check them out!
- One Pass Data Science In Apache Spark With Generative T-Digests , by Erik Erlandson,
- Fire in the Sky: An Introduction to Monitoring Apache Spark in the Cloud, by Mike McCune, and
- Building Machine Learning Algorithms on Apache Spark, by Will Benton.
A lot of the work we discussed is available from radanalytics.io or from the Isarn project; if you’d like to see other talks about data science, distributed computing, and best practices for contemporary intelligent applications, you should see our team’s list of presentations.