GoDataFest 2019 Schedule
GoDataFest takes place from Monday, October 28 to Friday, November 1.
Monday, October 28 - Amazon Web Services
Join Amazon Web Services, Binx.io and GoDataDriven for an exciting day jam-packed with the latest and greatest AWS has to offer around data and machine learning.
09:00 Opening Notes
09:15 Machine Learning in the Real World - Guy Kfir
10:00 Revving up with Reinforcement Learning – Ricardo Suerias (AWS) & Diederik Greveling (GoDataDriven)
An introduction to AWS DeepRacer and how it will enable you to get started with Reinforcement Learning.
including coffee break at 10:45
11:45 Setting up a Data Hub on AWS - Martijn van Dongen
How to set up a high-volume and scalable data platform on AWS.
13:15 Machine Learning Industrialization - Zhe Sun (AWS Professional Services)
To overcome the challenges of productionizing models from a PoC, AWS Professional Service team introduces a framework to deploy models faster and more manageable. Services such as AWS CodePipeline, AWS CodeCommit, AWS CodeBuild, AWS CloudFormation, Amazon Sagemaker, AWS Step Functions, AWS Lambda, AWS Glue, Amazon DynamoDB are used. We will bring this to life with one or two case studies (by AWS).
14:15 Elastic Kubernetes Service - Thijs Elferink
Learn about Amazon Elastic Kubernetes Service (Amazon EKS, a services that makes it easy to deploy, manage, and scale containerized applications using Kubernetes on AWS.
15:15 Formula One Race Insights in Real-Time with Serverless Machine Learning - Luuk Figdor (AWS Professional Services)
F1 pushes limits of both humans and technology. Long gone are the times when people could analyze race data without the use of technology, but now being competitive requires moving beyond past event analysis into live insights and predictions. To satisfy this demand, F1 decided to employ cloud-native technologies using AWS. Machine learning models created in AWS SageMaker and hosted on AWS Lambda allows F1 to pinpoint how a driver is performing, if they are pushing the car over the limit, and how their battle against other drivers will end.
These insights are immediately shared to fans all over the world through television and digital platforms. In this talk we will dive deep into the serverless machine learning architecture used for the application.
Attendees will learn about common pitfalls in serverless machine learning applications and how to overcome them. Lastly, we will walk through various tips and tricks for deploying machine learning models in AWS Lambda that will allow you to rapidly develop and deploy your machine learning application in truly serverless manner.
16:00 Racing with the DeepRacer
Doing laps with the DeepRacer with pre-trained models or a model that you trained yourself on the DeepRacer console.
19:00 AWS Meetup
In the evening a Amazon Web Services Meetup takes place, more info and registration for this: https://www.awsug.nl
Tuesday, October 29 - Microsoft Azure
09:00 Opening Notes by Rudy Doornewaard
09:15 Introduction to Microsoft Azure - Henk Boelman, senior cloud advocate, Microsoft
09:45 AI and IoT - Tony Krijnen, IoT Technology Strategist
10:30 Coffee break
11:00 DevOps for AI - Marcel de Vries and Niels Zeilemaker
11:30 Customer Stories
- Vattenfall (Rens Weijers, manager Data & Strategy): Developing smart applications on Azure
13:30 Workshops - Marian Dragt, AI MVP
- Custom Vision AI
- Visual Interface AMLS
Wednesday, October 30 - Databricks
09:00 Opening Notes by Susie Dobing
09:15 What's new at Databricks?
10:00 Quby - Making Homes Efficient and Comfortable using AI, IoT data and the full Databricks stack - Erni Durdevic
Quby is a leading company offering data driven home services technology across European markets, known for creating the in-home display and smart thermostat Toon. We enable our partners to take on a leading role in the home services domain, by offering data driven home services. Our services enable users to control and monitor their homes using both an in-home display and app.
As a data driven company, we use AI and machine learning, backed by Apache Spark, to generate actionable insights for all our end users. Via our IoT devices we have access to Europe’s largest energy dataset, petabytes in scale and growing. This unique dataset enables us to introduce new data driven services, with a particular focus on homes with smart meter installations.
In this talk Erni will take you on a tour of how Quby leverages the full Databricks stack to quickly prototype, validate, scale and launch data science products. We will explore the technical workflow of a Data Science project from end to end. Starting from developing a notebook prototype and tracking the Machine Learning Model performance with ML Flow, we move towards production-grade Databricks jobs with a CI/CD pipeline, debugging production code with Databricks Connect, and finally setting up a monitoring system for the jobs.
10:30 Coffee break
11:00 Customer Stories
13:00 Data pipelines with ML Flow
14:00 Technical Breakout Sessions
ML: Scaling Deep Learning using HorovodRunner - Sanne de Roever
DE: Delta Lakes and Streaming
19:00 Data Council Meetup - with Tim Hunter
Koalas: pandas APIs on Apache Spark
In this talk, Tim will present Koalas, a new open source project that was announced at the Spark + AI Summit in April. Koalas is a Python package that implements the pandas API on top of Apache Spark, to make the pandas API scalable to big data. Using Koalas, data scientists can make the transition from a single machine to a distributed environment without needing to learn a new framework.
Tim will demonstrate Koalas' new functionalities since its initial release, discuss its roadmaps, and how he envisions Koalas could become the standard API for large scale data science.
Thursday, October 31 - Google
09:30 Opening Notes
09:45 Cloud Data Fusion: Data Integration at Google Cloud- Rokesh Jankie, Customer Engineer Google Cloud
10:45 Coffee break
11:15 Customer Stories:
- Mollie - From cloudy to the cloud: Mollie's data transformation
- WEBB traders - Increasing profitability with back testing in the cloud
- Nico Lab - How StrokeView, an AI-powered clinical decision support system, offers a complete assessment of relevant imaging biomarkers for faster and more accurate treatment decisions in stroke, where every second counts.
13:30 Democratizing Machine Learning with BigQuery ML - Abishay Rao
A demo-driven session with BigQuery ML, showing how BQML democratizes ML and can be incredibly useful in real-world scenarios.
14:30 The Future of BI is a Data Platform - Sebastien Fabri
An introduction to Looker, part of Google Cloud, by Sebastien Fabri. Looker delivers insights to user workflows, allowing organizations to extract value from their data
15:30 The What and Why of Serverless – a talk by O'Reilly author Wietse Venema about serverless on GCP.
19:00 Google Developers Group Meetup - More info
Friday, November 1 - Open-Source
The final day of GoDataFest is all about the latest open-source technology.
09:30 Taking Machine Learning Models into Production - Julian de Ruiter
In this interactive session we will look at the technical challenges associated with getting data science products into production. So, our target audience for this breakfast are people with a technical role, like data scientist, data engineer or machine learning engineer.
In detail, we will look into the steps needed to move a simple data science model from a Jupyter Notebook into a more production-ready Python package. Besides this, we will also explore how to expose this model in a simple web API and how to deploy this API in a containerized environment using Docker.
11:45 Implementing JPEG in Python - Cor Zuurmond
Hundreds of millions of pictures are taken everyday. These pictures are stored on mobile devices, computers and servers. How are these pictures stored efficiently? To answer this question I will explain how the JPEG algorithm efficiently compresses images with minimum information loss.
In this talk, Cor will explain the different compression steps, and more importantly why these steps are needed. After this talk, you have an understanding of the JPEG algorithm. An understanding which will be useful to have whenever you work with images!
15:00 Screening of the Dataiku Data Science Pioneers Documentary
With humor and humanity, DATA SCIENCE PIONEERS presents a documentary about passionate data scientists driving us towards technological revolution.
16:00 GoDataFest Party - Drinks, fun and all around joy