Home // Code Generation with Large Language Models: Analysis of Generated Code Performance and Language Model Tuning Strategies

Supervisor

Elias Werner

Center for Interdisciplinary Digital Sciences (CIDS)

TUD Dresden University of Technology

elias.werner@tu-dresden.de

Code Generation with Large Language Models: Analysis of Generated Code Performance and Language Model Tuning Strategies

Status: finished / Type of Theses: Master theses / Location: Dresden

High-performance computing faces a signiﬁcant productivity gap between high-level programming languages used for rapid prototyping and low-level languages required for optimal performance. This thesis investigates whether large language models can bridge this gap by automating code translation from sequential Python to parallel implementations
in NumPy, JAX, C++ with OpenMP, and CUDA. A novel benchmark covering core HPC computational patterns was constructed and used to evaluate DeepSeek-Coder models ranging from 1.3B to 33B parameters. Base models achieved moderate success on simpler translations but struggled signiﬁcantly with CUDA and tensor frameworks. Supervised ﬁne-tuning on a manually curated dataset substantially improved both functional correctness and performance optimization, outperforming models trained on synthetic data. The ﬁndings indicate that while current-generation coding LLMs cannot yet guarantee suﬃcient reliability for production HPC workﬂows, ﬁne-tuning on high-quality datasets shows promise for improving their code translation capabilites

funded by:

Gefördert vom Bundesministerium für Bildung und Forschung.

ScaDS.AI Dresden/Leipzig (Center for Scalable Data Analytics and Artificial Intelligence) is a center for Data Science, Artificial Intelligence and Big Data with locations in Dresden and Leipzig.

Dresden

Visitor address Technische Universität Dresden
ScaDS.AI Dresden/Leipzig
Bürogebäude Strehlener Straße
Strehlener Straße 12, 14
01069 Dresden

Postal address Technische Universität Dresden
Zentrum für Informationsdienste und Hochleistungsrechnen
ScaDS.AI Dresden/Leipzig
01062 Dresden

Leipzig

Visitor address ScaDS.AI Dresden/Leipzig
Löhrs Carré
Humboldtstraße 25, Uferstr. 11
04105 Leipzig

Postal address Universität Leipzig
Data Science Center ScaDS.AI Leipzig
Internes Postfach: 322001
04081 Leipzig

Quicklinks:

Accessibility

Imprint

Privacy

About us

Research

Education

Transfer and Service

Living Lab