Machine Learning Application Skeleton
In this blog post I provide an overview of a Python skeleton application I made. This skeleton can help you bridge the gap between your model and a machine learning application.
For example, you can use your existing Flask application, import it in
app, and this will add the production ready features of Gunicorn.
The times when business saw machine learning models as black boxes with no hope of understanding are long gone.
It use to be that the data analytics or data science department of a company produced results in a silo kind of environment. Little or no interaction took place between these departments and the business side making the decisions (marketing, sales, client support, etc.). Advice coming from machine learning models consisted of reports, which were nice to have if they supported ideas from the business.
As data driven decisions demonstrated their value, the business side started peeking behind the curtain.
Paper/files reports have been substituted by static reporting dashboards, which themselves are being replaced by interactive ones. The business end users want to interact with the models, understand why certain predictions are made and evenmore, they want to be capable of performing predictions on the fly (imagine simultaneously having a customer on the phone and updating the probabilities of him/her buying certain products, or a marketing department tuning campaigns themselves depending on regional features).
In short, I had some time during a rainy weekend and a GDD Friday1, already did something similar for a client and I think it is important to bring machine learning models to the business side.
Also, as a bonus they will stop bothering you every time they need insights or a slightly different prediction.
What's in the goody bag?
This allows the application to be run in a more production ready environment (multiple workers and threads for example). In here you can find a complete list of all the possible Gunicorn settings. I added the possibility to use some of them as command line arguments. Some relevant ones are:
workers- define number of workers.
threads- number of threads on each worker.
daemon- run application in the background.
access-logfile- save access logs to a file.
forwarded-allow-ips- list allowed IP addresses.
Dummy application which demonstrates how to ingest several types of user inputs into your Python application.
Debug mode which (similar to Flask) will
- run a single process
- logging to debug level
- restart process on code change
- reload html and jinja templates on change
Dockerfile template to containerize the application.
Interactive application which runs a classifier model, outputs predictions and information about the machine learning model.
The model can be run by using the UI or by directly making a post request to the endpoint.
A more complete description, a set of instructions and the code can be found in this repository.
Note: I also include a
setup.py file that you should use to install your package used in the application.
I hope this work can help you bring your models into a machine learning application, it certainly helped and will help me in the future. You can find the code here.
If you have any other questions just ping me in twitter @rragundez.
One Friday a month when we get to do whatever we want, it is awesome. ↩
Follow us for more of this
Testing and debugging Apache Airflow
February 22, 2019
The Zen of Python and Apache Airflow
February 18, 2019
AWS Machine Learning Competency Status for GoDataDriven
February 14, 2019
GoDataDriven Open Source Contribution for January 2019, the Apache Edition
February 13, 2019
Our social responsibility as a company
February 08, 2019
Keras: multi-label classification with ImageDataGenerator
January 31, 2019