Home // Design and Development of a Speech Collection Software for Spoken Natural Languages

Supervisor

Dr. Kossi Amouzouvi

Center for Interdisciplinary Digital Sciences (CIDS)

TUD Dresden University of Technology

kossi.amouzouvi@tu-dresden.de

Design and Development of a Speech Collection Software for Spoken Natural Languages

Status: open / Type of Theses: Bachelor Theses, Seminar Theses / Location: Dresden

Pretrained Large Language Models (LLMs) excels in natural language processing/understanding tasks by leveraging vast amounts of data collected from different sources. One particular example is text translation. Despite this prowess, LMs are biassed by the unbalanced nature of the data. In the particular case of sentence translation, they completely fail to translate popular languages such as English, French, and Deutsch to/from minority languages such as indigenous languages spoken in Africa. It is therefore crucial to reduce this balance by collecting high quality majority-minority paired data.

The aim of this project is to design and develop an innovative mobile application specifically focused on collecting spoken sentences corresponding to the translation of a sentence in French to an indigenous language spoken in Africa. The app will provide users with a series of French sentences displayed on the screen, which they will translate to their mother language, and record the pronunciation with the app. The collected data will contribute to the enhancement of speech recognition systems and language processing technologies.

Student’s Responsibilities

Conduct thorough research on existing speech collection approaches and technologies.
Design the user interface (UI) and user experience (UX) for the software, ensuring it is intuitive and easy to navigate.
Develop the app using appropriate programming languages or frameworks.
Implement speech recognition algorithms to accurately capture users’
pronunciations.
Integrate features for data storage, retrieval, and analysis in line with privacy
regulations.
Test and debug the app to ensure its functionality across different devices and
operating systems.

Qualifications Required

Proficient knowledge in software development (iOS or Android).
Strong background in programming languages such as Swift, Java, or similar.
Understanding of speech recognition technologies and algorithms.
Experience with UI/UX design principles.
Passion for language processing technologies would be an asset.

By undertaking this project, not only will you gain valuable experience in designing and developing mobile applications but also contribute towards advancing speech recognition systems using real-world data collected from native speakers. If you are interested in pursuing this exciting opportunity as your bachelor thesis project or a research project, please contact Dr. Kossi Amouzouvi at kossi.amouzouvi@tu-dresden.de to discuss further details.

funded by:

Gefördert vom Bundesministerium für Bildung und Forschung.

ScaDS.AI Dresden/Leipzig (Center for Scalable Data Analytics and Artificial Intelligence) is a center for Data Science, Artificial Intelligence and Big Data with locations in Dresden and Leipzig.

Dresden

Visitor address Technische Universität Dresden
ScaDS.AI Dresden/Leipzig
Bürogebäude Strehlener Straße
Strehlener Straße 12, 14
01069 Dresden

Postal address Technische Universität Dresden
Zentrum für Informationsdienste und Hochleistungsrechnen
ScaDS.AI Dresden/Leipzig
01062 Dresden

Leipzig

Visitor address ScaDS.AI Dresden/Leipzig
Löhrs Carré
Humboldtstraße 25,
3. Obergeschoss
04105 Leipzig

Postal address Universität Leipzig
Data Science Zentrum
Internes Postfach: 212104
04081 Leipzig

Quicklinks:

Accessibility

Imprint

Privacy

About us

Research

Education

Transfer and Service

Living Lab