NewSQL, SQL on Hadoop
A comparison between popular NoSQL databases and relational database management systems shows two significant differences. First, NoSQL databases typically provide a simple data model with limited query capabilities and do not support powerful query languages such as SQL. Second, they do not support ACID transactions but provide limited consistency guarantees to achieve scalability and performance.
In the first part of the talk I will examine how declarative queries can be automatically translated to be executed in a Hadoop infrastructure. For example, I will introduce Hive that is especially suited for data warehouse-like SQL queries on HDFS files. In the second part I will give an overview of NewSQL systems, a new class of database management systems that maintain ACID guarantees. NewSQL systems (e.g., VoltDB, Google Spanner) support the relational data model and SQL while still achieving scalability in a distributed cloud environment.
Andreas is a full professor at the Leipzig University of Telecommunications (HfTL), a private university owned by Deutsche Telekom. Before joining HfTL in 2013, he was an acting professor at the University of Passau and a visiting research scientist at the University of Maryland. In 2008, he received a PhD in Computer Science from the University of Leipzig where he worked as a researcher at the database group for several years.
Andreas‘ research interests are in the area of scalable data management. More specifically, he has been working on approaches for entity resolution, ontology alignment, and optimizing cloud technologies. He is regularly applying advanced data management techniques to research problems in the life sciences and bibliometrics.