GoDataDriven Open Source Contribution for February 2019, the first Open Source Initiatives edition
It has been quite some time we have been publishing what we give back to the community. But is what we do enough?
Last year we proposed to allocate 12 weeks a year (across the whole company) to work on open source projects. Data engineers and data scientists would have — twice a year, hence 6 weeks at a time — the possibility to pitch their own favourite open source project and get time — on GoDataDriven — to work on it.
We had numerous pitches, and many winners — we didn't want to allocate 6 weeks in a single project.
The winning projects were
- Improving DAG parsing in Apache Airflow;
- Improve documentation in evol;
- A new project to set up Airflow locally to ease development;
- Coordinating the release of Apache Avro 1.9.
Everyone is very excited about these projects and we will write in this channel what comes out of it!
While on boarding a new colleague, we forgot to tell him that he should let us know how much he contributes back to the community. But we didn't forget to tell him that we give back to the community, so he did lots of work in the meantime.
Fokko was prolific as always with 3 PRs to Airflow, 9 PRs to Apache Avro, PR 7681 to Apache Flink, 2 PRs to Apache parquet-mr, four PRs to Apache Sqoop, and PR 152 to the bigdata-interop repository of Google Cloud.
That's it for this edition! If you, like many others, are searching for you next challenge for 2019, don't forget we're hiring! (Especially software and data engineers!).
For more goodness follow me on Twitter: I'm gglanzani there!
Follow us for more of this
Are sklearn defaults wrong?
September 03, 2019
Improved wireless coverage using an old router
August 28, 2019
Data Driven Board Game Design
August 23, 2019
Real time analytics: Divolte + Kafka + Druid + Superset
August 22, 2019
DeepCS - Berlin Buzzwords 2019
July 26, 2019
Fairness in AI - Dutch Data Science Week 2019
July 23, 2019