Title: (Semantische) Daten- und Prozessintegration
Lecturers: Alsayed Algergawy
Birgitta König-Ries
Starts on: 2018-10-15
Ends on: 2019-02-08
Time and Location: Carl-Zeiß-Straße 3 - SR 130, Tuesday 8:00 bis 10:00
Website: Link

Description

Data integration is the process that combines data from several disparate data sources. It becomes one of the key challenges within most of IT projects. Since these data sources are independently engineered and developed by different people, so, they contain a large number of heterogeneities. In order to provide a unified view of these data sources, we should deal with different kinds of heterogeneities.

In this course, students will learn techniques and methodologies for integrating data from large sets of heterogeneous data sources. The course will cover the following topics:
– Importance of data integration
– Physical and virtual data integration
– data and semantic heterogeneities
– String, schema, and data matching
– Web data and web data integration

Literature

– AnHai Doan, Alon Halevy, Zachary Ives: Principles of Data Integration. Morgan Kaufmann, 2012.
– Ulf Leser, Felix Naumann: Informationsintegration. Dpunkt Verlag, 2007.
– Luna Dong, Divesh Srivastava: Big Data Integration. Morgan & Claypool, 2015.
– Serge Abiteboul, et al: Web Data Management. Cambridge University Press, 2012.
– Jérôme Euzenat, Pavel Shvaiko: Ontology Matching. Springer, 2007.
– Felix Naumann: An Introduction to Duplicate Detection. Morgan & Claypool, 2012.