How to train a wacky language model - PyCon SG 2019

Published on: Monday, 11 November 2019

Speaker: Jonathan Heng, Software Developer

I will talk about GPT-2, a language model developed by Open AI, and how we can finetune it with various training sources to get interesting results. These will be demonstrated in Tensorflow. Ethical implications of such models will be discussed as well. If time permits, I will also demonstrate a simple deployment as a flask app on a cloud platform (GCP).

Jonathan is a software developer at ThoughtWorks. He has 3 years of experience in building machine learning models for both research and businesses. While waiting for his model to train, Jonathan listens to music and contemplates when the bots will rule the world.

