Go Data Driven BLOG!

How to Write CodeĀ Using The SparkĀ Dataframe API: A Focus on Composability And Testing

27 Jan

I was recently thinking about how we should write Spark code using the Dataframe API. In this post I'll guide you through the different options

Read more...


Join Us on February 23rd for Google Hashcode

23 Jan

Fun algorithms/optimization competition where you can solve real Google problems

Read more...


Use a SSH-key to access your cloud resources with socks-proxy

08 Jan

Securely access your cloud resources with a socks-proxy. Example how to create a SSH-key and use the public key to create a new Linux machine. Configure a sock proxy to access the remote websites.

Read more...


Solving hard data problems with causal data science

29 Dec

It is tempting for organizations to find biased answers in their data and draw faulty conclusions, like mixing causation with correlation. Adam Kelleher, lead data scientist at Buzzfeed, emphasizes that this is not without risk.

Read more...


Bringing models into production

03 Dec

In Data Science, software quality often is an issue that prevents models to hit production. How can you successfully bring data science models into production?

Read more...


Devoxx 2016

26 Nov

Devoxx goes beyond Java with machine learning, streaming apps and data, and cloud.

Read more...


Data Innovation in a Pressure Cooker: Schiphol Data Science Hackathon

06 Nov

On October 28 Schiphol Airport, Microsoft, and GoDataDriven, organized the Schiphol Data Science Hackathon. Five teams of data scientists worked together on various unique data sets provided by the airport.

Read more...


Retrospect on Spark Summit 2016

04 Nov

At the Spark Summit this year, GoDataDriven was asked to deliver training and to do a key note presentation. Needless to say, we were honored and took on this opportunity with two hands.

Read more...


Big Data Survey 2016: Does a budget increase lead to successful data projects?

01 Nov

Insight in the main results of Big Data Survey 2016, the international research on the use of data.

Read more...