Machine Learning Pipeline using Luigi and Scikit Learn - PyConSG 2016

Published on: Wednesday, 6 July 2016

Speaker: Atreya Biswas

Description
Using Luigi and Scikit-Learn to create a Machine Learning Pipeline which trains a model and predict through a Rest API

Abstract
A Machine Learning Pipeline can be broadly thought of as many tasks which includes - Data Ingestion - Data Cleaning - Feature Extraction - Training Models - Hyper Parameter Optimization - Model Evaluation - Model Deployment. Luigi is Spotify's open sourced Python framework for batch data processing including dependency resolution, workflow resolution, visualisation, handling failures and monitoring. Scikit-Learn is the most popular and widely used Machine Learning Library in Python. We will demonstrate how Luigi and Scikit-Learn can be used to orchestrate the Machine Learning Tasks, hence creating a cohesive Machine Learning Pipeline.

Event Page: https://pycon.sg

Produced by Engineers.SG

Help us caption & translate this video!

http://amara.org/v/P6Rz/

Organization
Presenter