From (Big) Data Mess to Data as an Innovation Enabler
GOTO Berlin 2018

Thursday Nov 1
13:50 –
14:40
C 01

From (Big) Data Mess to Data as an Innovation Enabler

Slides:


This video is also available in the GOTO Play video app! Download it to enjoy offline access to our conference videos while on the move.

Available in Google Play Store or Available in Apple App Store




It's no secret that collecting and processing data is a double-edged sword. On one hand, it is the enabler of AI and ML applications that drive the modern organisation forward. On the other, it takes constant effort to maintain its accuracy and usefulness and extreme diligence to make sure that it doesn't get into the wrong hands. This talk will look at the data journey of one of the world's largest internet companies, OLX Group. From data collection over data democratisation to data products and data innovation in a platform with as many monthly active users as twitter.

We will cover:

  • How to collect and store billions of events and records per day
  • How to aggregate data from multiple platforms
  • How to design a data lake/reservoir architecture in AWS cloud
  • How to give each and everyone access to the data that he or she needs
  • How to distribute data in a secure and compliant manner
  • How to build a scalable, easy to use reporting infrastructure
  • How to drive data innovation and data products with the help of AWS sagemaker, tensorflow and other ML tools