FUnctionality Sharing In Open eNvironments
Heinz Nixdorf Chair for Distributed Information Systems

QUIS: in-situ heterogeneous data source querying

Title: QUIS: in-situ heterogeneous data source querying
Authors: Javad Chamanara, Birgitta König-Ries, H. V. Jagadish
Source: Journal of Proceedings of the VLDB Endowment
Place: Munich, Germany
Date: 2017-08-28
Type: Journal Paper

Existing data integration frameworks are poorly suited for the special requirements of scientists. To answer a specific research question, often, excerpts of data from different sources need to be integrated. The relevant parts and the set of underlying sources may differ from query to query. The analyses also oftentimes involve frequently changing data and exploratory querying. Additionally, The data sources not only store data in different formats, but also provide inconsistent data access functionality. The classic Extract-Transform-Load (ETL) approach seems too complex and time-consuming and does not fit well with interest and expertise of the scientists.

With QUIS (QUery In-Situ), we provide a solution for this problem. QUIS is an open source heterogeneous in-situ data querying system. It utilizes a federated query virtualization approach that is built upon plugged-in adapters. QUIS takes a user query and transforms appropriate portions of it into the corresponding computation model on individual data sources and executes it. It complements the segments of the query that the target data sources can not execute. Hence, it guarantees full syntax and semantic support for its language on all data sources. QUIS’s in-situ querying facility almost eliminates the time to prepare the data while maintaining a competitive performance and steady scalability.

The present demonstration illustrates interesting features of the system: virtual Schemas, heterogeneous joins, and visual query results. We provide a realistic data processing scenario to examine the system’s features. Users can interact with QUIS using its desktop workbench, command line interface, or from any R client including RStudio Server.

File: p976-chamanara
URL: https://dl.acm.org/citation.cfm?id=3137798
 author = {Chamanara, Javad and K\"{o}nig-Ries, Birgitta and Jagadish, H. V.},
 title = {QUIS: In-situ Heterogeneous Data Source Querying},
 journal = {Proc. VLDB Endow.},
 issue_date = {August 2017},
 volume = {10},
 number = {12},
 month = aug,
 year = {2017},
 issn = {2150-8097},
 pages = {1877--1880},
 numpages = {4},
 url = {https://doi.org/10.14778/3137765.3137798},
 doi = {10.14778/3137765.3137798},
 acmid = {3137798},
 publisher = {VLDB Endowment},