JavaScript is required to use this site. Please enable JavaScript in your browser settings.

Supervisor

GPT-3, BERT & Co.: When to use which language model?

Status: open / Type of Theses: Bachelor Theses, Master theses / Location: Dresden

Objectives

In the past, a variety of language models has been proposed, such as GPT-3 and BERT [1], for natural language tasks (e.g., question answering, named entity recognition, text summarization). However, data scientists and AI researchers increasingly loose an overview when to use which language model.

In this thesis, the task is to obtain an overview of state-of-the-art language models and to create a conceptual framework (e.g., criteria), which can be used by any researcher or practitioner to quickly know when to use which model.

Note that this thesis is mainly on a conceptual level. No language models needs to be executed or trained. However, the student could implement a small online demonstration system as implementation of the above mentioned framework (e.g., input: problem description; output: recommended language model).

 

Prerequisites

  •  Basic data processing skills (e.g., in Python).
  •  Ability to work independently on the topic (based on inputs from the supervisor).
  •  Interest in publishing an own research paper based on the written thesis.

 

 

[1] https://analyticsindiamag.com/top-ten-bert-alternatives-for-nlu-projects/

funded by:
Gefördert vom Bundesministerium für Bildung und Forschung.
Gefördert vom Freistaat Sachsen.