Go Data Driven BLOG!

EuroPython 2018

15 Aug

I don't have to tell you Edinburgh is all about history. In contrast, between the 23rd and 29th of July more than 1300 Python-enthusiasts gathered to talk about the future.

Read more...


Write less terrible code with Jupyter Notebook

05 Aug

How can you quickly go from prototype to production code using Jupyter Notebooks?

Read more...


GoDataDriven open source contribution: July 2018 edition

01 Aug

Welcome to the Open Source at GoDataDriven, July 2018 edition

Read more...


Dynniq presentation video at AI Expo Europe 2018

31 Jul

Dynniq with AI use cases was invited to AI Expo Europe 2018 in RAI Amsterdam. Watch the recording of the presentation here.

Read more...


Working with multiple partition formats within a Hive table with Spark

31 Jul

Having different file formats (Avro and Parquet) for the same data source is a problem we often encounter. We can create a partitioned table on top of this data. With Hive you can alter the type of a given partition so we can access the data with one table. We discovered that Spark doesn't support this functionality yet, so we started investigating how we could add this.

Read more...


Handling encoding issues with Unicode normalisation in Python

28 Jul

When reading and writing from various systems, it is not uncommon to encounter encoding issues when the systems have different locales. In this post I show several options for handling such issues.

Read more...


Lint your Dockerfile with Hadolint

19 Jul

As part of good engineering practice, integrate a Dockerfile linter into your CI pipeline.

Read more...


Following or leading in data? Participate in Data Survey 2018!

09 Jul

Interested in a benchmark of your Data Science and Data Engineering efforts compared to your peers? Participate in Data Survey 2018 to find out.

Read more...


GoDataDriven presentations at PyData Amsterdam 2018

05 Jul

GoDataDriven headed the organizing comittee of PyData Amsterdam 2018. Watch the recordings of our presentations here.

Read more...


GoDataDriven open source contribution: June 2018 edition

20 Jun

Welcome to the Open Source at GoDataDriven, June 2018 edition

Read more...