Home // GPT-3, BERT & Co.: When to use which language model?

Supervisor

Prof. Dr.-Ing. Michael Färber

Chair of Scalable Software Architectures for Data Analytics

TUD Dresden University of Technology

michael.faerber@tu-dresden.de

GPT-3, BERT & Co.: When to use which language model?

Status: open / Type of Theses: Bachelor Theses, Master theses / Location: Dresden

Objectives

In the past, a variety of language models has been proposed, such as GPT-3 and BERT [1], for natural language tasks (e.g., question answering, named entity recognition, text summarization). However, data scientists and AI researchers increasingly loose an overview when to use which language model.

In this thesis, the task is to obtain an overview of state-of-the-art language models and to create a conceptual framework (e.g., criteria), which can be used by any researcher or practitioner to quickly know when to use which model.

Note that this thesis is mainly on a conceptual level. No language models needs to be executed or trained. However, the student could implement a small online demonstration system as implementation of the above mentioned framework (e.g., input: problem description; output: recommended language model).

Prerequisites

Basic data processing skills (e.g., in Python).
Ability to work independently on the topic (based on inputs from the supervisor).
Interest in publishing an own research paper based on the written thesis.

[1] https://analyticsindiamag.com/top-ten-bert-alternatives-for-nlu-projects/

funded by:

Gefördert vom Bundesministerium für Bildung und Forschung.

ScaDS.AI Dresden/Leipzig (Center for Scalable Data Analytics and Artificial Intelligence) is a center for Data Science, Artificial Intelligence and Big Data with locations in Dresden and Leipzig.