Uber’s Plug and Play Language Model (PPLM) Allows Steering Topic and Attributes of GPT-2 Models

It’s impressive that Generative models like Open AI’s GPT-2 automatically create texts using limited input. But controlling the attributes (topics, context, sentiment) of these texts, and paragraphs need an extra layer of work that includes architectural modifications/specific data understanding, etc. This work is done by a team of professionals from Uber, Caltech, and the Hong Kong University of Science and Technology. They worked on the model and created the Plug and Play Language Model (PPLM), which takes one or two attributes classifier and combines it with a pre-trained language model.

Paper with Initial Results: https://arxiv.org/pdf/1912.02164.pdf

Github: https://github.com/uber-research/PPLM

Colab Notebook: https://colab.research.google.com/drive/1Ux0Z4-ruiVtJ6jUk98uk6FqfvGHCOYL3

Demo: https://transformer.huggingface.co/model/pplm



Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of Artificial Intelligence for social good. His most recent endeavor is the launch of an Artificial Intelligence Media Platform, Marktechpost, which stands out for its in-depth coverage of machine learning and deep learning news that is both technically sound and easily understandable by a wide audience. The platform boasts of over 2 million monthly views, illustrating its popularity among audiences.

🚀 LLMWare Launches SLIMs: Small Specialized Function-Calling Models for Multi-Step Automation [Check out all the models]