Status: open / Type of Theses: Bachelor Theses, Master theses, PhD Theses / Location: Dresden
Most anonymization methods assume that once identifiers are removed or replaced, privacy is preserved. However, modern LLMs can infer hidden identities from context, background knowledge, and subtle cues. This creates a new threat: even anonymized text may be re-identifiable by intelligent models.
This project investigates the robustness of current anonymization methods against such LLM-based attacks. In particular, it aims to systematically evaluate whether different anonymization strategies (e.g., masking, generalization, and context-aware rewriting) effectively prevent identity inference, and under which conditions they fail. By treating LLMs as adversaries, the project seeks to better understand the limits of existing anonymization techniques and to inform the design of more resilient, privacy-preserving approaches.