Recently I was delivering a Spark course. One of the exercises asked the students to split a Spark DataFrame in two, non-overlapping, parts.
Apache Airflow 1.10.2 is released and we highlight some of the most interesting features.
1,5 years ago we build our very own Ethereum miner, today we turned it off. What did we learn?
Welcome to the Open Source at GoDataDriven, December 2018 edition
Using Airflow and Azure Functions to trigger a DAG remotely.
Apache Airflow graduated from incubation to an Apache Top-Level project
What is the most popular data technology when it comes to data, cloud platforms, and data visualization tools? In this article, we share the Data-50, the 50 most popular data technologies of 2019.
Securely access your cloud resources with a socks-proxy. Example how to create a SSH-key and use the public key to create a new Linux machine. Configure a sock proxy to access the remote websites.
We tried to tackle 140 hours of fast.ai's Deep Learning MOOC in 7 days, how did it go?
A group of plucky engineers and scientists challenge themselves to tackle fast.ai's deep learning MOOCs in 7 days.