Web-Scale Domain-Specific Information Extraction
Information Extraction (IE) from unstructured texts is a technology with growing importance in many applications. Important challenges to IE are the achievement of high quality results, scalability of methods to very large corpora, and integration of IE results with other data for downstream analysis. In this talk, we will highlight recent advances and open questions in these areas by drawing from extensive experiences in developing and applying IE for biomedical research.
Ulf Leser studied computer science at the Technische Universität München and holds a PhD in Data Integration from Technische Universität Berlin. After positions at the Max-Planck-Institute for Molecular Genetics and in the private sector, be became a professor for Knowledge Management in Bioinformatics at Humboldt-Universität zu Berlin. His research focuses on scientific data management, statistical Bioinformatics, biomedical text mining and infrastructures for large-scale data analysis. He approaches these topics in interdisciplinary projects with colleagues from biology and medicine. He is speaker of the DFG-funded graduate school SOAMED (Service-oriented architectures for medical applications), the BMBF-funded coordinated project PREDICT (Comprehensive Data Integration for Cancer Treatment) and a board member of the DFG-excellence funded Berlin School for Integrative Oncology (BSIO).