FUnctionality Sharing In Open eNvironments
Heinz Nixdorf Chair for Distributed Information Systems
Title: (Semantische) Daten- und Prozessintegration
Lecturers: Alsayed Algergawy
Birgitta König-Ries
Starts on: 2016-10-18
Ends on: 2017-02-03
Time and Location: CZ 3, SR 125 Tuesday 8:00 bis 10:00


Data integration is the process that combines data from several disparate data sources. It becomes one of the key challenges within most of IT projects. Since these data sources are independently engineered and developed by different people, so, they contain a large number of heterogeneities. In order to provide a unified view over these data sources, we should deal with different kinds of heterogeneities.

In this course, students will learn techniques and methodologies for integrating data from large sets of heterogeneous data sources. The course will cover the following topics:
– Importance of data integration
– Physical and virtual data integration
– data and semantic heterogeneities
– String, schema and data matching
– Web data and web data integration


– AnHai Doan, Alon Halevy, Zachary Ives: Principles of Data Integration. Morgan Kaufmann, 2012.
– Ulf Leser, Felix Naumann: Informationsintegration. Dpunkt Verlag, 2007.
– Luna Dong, Divesh Srivastava: Big Data Integration. Morgan & Claypool, 2015.
– Serge Abiteboul, et al: Web Data Management. Cambridge University Press, 2012.
– Jérôme Euzenat, Pavel Shvaiko: Ontology Matching. Springer, 2007.
– Felix Naumann: An Introduction to Duplicate Detection. Morgan & Claypool, 2012.