Machine Learning Pipeline using Luigi and Scikit Learn - PyConSG 2016

Published on: Wednesday, 6 July 2016

Speaker: Atreya Biswas

Using Luigi and Scikit-Learn to create a Machine Learning Pipeline which trains a model and predict through a Rest API

A Machine Learning Pipeline can be broadly thought of as many tasks which includes - Data Ingestion - Data Cleaning - Feature Extraction - Training Models - Hyper Parameter Optimization - Model Evaluation - Model Deployment. Luigi is Spotify's open sourced Python framework for batch data processing including dependency resolution, workflow resolution, visualisation, handling failures and monitoring. Scikit-Learn is the most popular and widely used Machine Learning Library in Python. We will demonstrate how Luigi and Scikit-Learn can be used to orchestrate the Machine Learning Tasks, hence creating a cohesive Machine Learning Pipeline.

Event Page:

Produced by Engineers.SG

Help us caption & translate this video!