Go Data Driven BLOG!

Spark surprises for the uninitiated

28 Jan

Recently I was delivering a Spark course. One of the exercises asked the students to split a Spark DataFrame in two, non-overlapping, parts.

Read more...


Highlights from the new Apache Airflow 1.10.2 release

23 Jan

Apache Airflow 1.10.2 is released and we highlight some of the most interesting features.

Read more...


Turning off our Ethereum miner

23 Jan

1,5 years ago we build our very own Ethereum miner, today we turned it off. What did we learn?

Read more...


GoDataDriven open source contribution: December 2018 edition

14 Jan

Welcome to the Open Source at GoDataDriven, December 2018 edition

Read more...


Using the Airflow Experimental Rest API to trigger a DAG

12 Jan

Using Airflow and Azure Functions to trigger a DAG remotely.

Read more...


Apache Airflow graduation as Apache Top-Level

08 Jan

Apache Airflow graduated from incubation to an Apache Top-Level project

Read more...


Data Survey 2018/2019 - Data 50

07 Jan

What is the most popular data technology when it comes to data, cloud platforms, and data visualization tools? In this article, we share the Data-50, the 50 most popular data technologies of 2019.

Read more...


Use a SSH-key to access your cloud resources with socks-proxy

31 Dec

Securely access your cloud resources with a socks-proxy. Example how to create a SSH-key and use the public key to create a new Linux machine. Configure a sock proxy to access the remote websites.

Read more...


Looking Back at our Deep Learning Frenzy

28 Dec

We tried to tackle 140 hours of fast.ai's Deep Learning MOOC in 7 days, how did it go?

Read more...


GDD tackles Deep Learning at warp speed

14 Dec

A group of plucky engineers and scientists challenge themselves to tackle fast.ai's deep learning MOOCs in 7 days.

Read more...