Go Data Driven BLOG!

Welcome to the Go Data Driven BLOG.

This is the place where we share our knowledge and opinions. We will try to post new content regularly.
Enjoy, the GoDataDriven team.

Migrating your Hadoop workloads to the Cloud

22 Apr

What to think about when you are migrating your Hadoop workloads to the Cloud

Read more...


GoDataDriven announces Data Council NL community

19 Apr

GoDataDriven announces Data Council NL community

Read more...


GoDataDriven Open Source Contribution for February 2019, the first Open Source Initiatives edition

05 Apr

It has been quite some time we have been publishing what we give back to the community. But is what we do enough?

Read more...


Open Sourcing Airflow Local Development

05 Apr

Fast iterative local development and testing of Apache Airflow workflows

Read more...


A Practical Guide to Using Setup.py

26 Mar

Making your python project installable benefits everyone who uses it. This blog shows you by example how to make your project installable using a setup.py file.

Read more...


Docker Hub Tips and Tricks

19 Mar

How we use Docker Hub to build some our Docker images

Read more...


How to style transfer your own images

15 Mar

The term "style transfer" is used to describe the operation of recomposing one image in the style of another. In this blog, we demonstrate two approaches on how to do this yourself: neural style and cycle-consistent adverserial networks.

Read more...


It's time to trust your predictions

05 Mar

When you find yourself building a prediction machine where you are both looking for the best model and a fair estimate of its performance this blog is for you. Especially so when you are working with time series data.

Read more...


Testing and debugging Apache Airflow

22 Feb

One of the questions I get asked the most about Apache Airflow is how to shorten the development cycle of pushing code, deploying, and manually triggering a DAG for verification to something that is locally testable without running on a live system. In this blog post I provide several pointers to testing and debugging Apache Airflow on your local machine.

Read more...


The Zen of Python and Apache Airflow

18 Feb

Apache Airflow is a Python framework for programmatically creating workflows in DAGs. This allows for concise and flexible scripts but can also be the downside of Airflow; since it's Python code there are infinite ways to define your pipelines. The Zen of Python is a list of 19 Python design principles and in this blog post I point out some of these principles on four Airflow examples.

Read more...