A virtual “Werkstatt” for digitization in the sciences
ABECTO: An ABox Evaluation and Comparison Tool for Ontologies
Combining Image and Caption Analysis for Classifying Charts in Biodiversity Texts
CoMerger: A Customizable Online Tool for Building a Consistent Quality-Assured Merged Ontology
How good is this merged ontology?
JenTab: Matching Tabular Data to Knowledge Graphs
Machine Learning Pipelines: Provenance, Reproducibility and FAIR Data Principles
Matching Biodiversity and Ecology Ontologies: Challenges & Evaluation Results
Ontology Based Natural Language Queries Transformation into SPARQL Queries
Ontology Modularization with OAPT
Participatory Visualization Design as an Approach to Minimize the Gap between Research and Application
ReproduceMeGit: A Visualization Tool for Analyzing Reproducibility of Jupyter Notebooks
Results of the Ontology Alignment Evaluation Initiative 2020
ScholarLensViz: A Visualization Framework for Transparency in Semantic User Profiles
Semantics-driven Keyword Search over Knowledge Graphs
Tag Me If You Can! Semantic Annotation of Biodiversity Metadata with the QEMP Corpus and the BiodivTagger
Toward OWL Restriction Reconciliation in Merging Knowledge
Towards a Core Ontology for Hierarchies of Hypotheses in Invasion Biology
Towards Multiple Ontology Merging with CoMerger
Towards Transforming Tabular Datasets into Knowledge Graphs
What to Do When the Users of an Ontology Merging System Want the Impossible? Towards Determining Compatibility of Generic Merge Requirements
Combining Image and Caption Analysis for Classifying Charts in Biodiversity Texts
Title: | Combining Image and Caption Analysis for Classifying Charts in Biodiversity Texts |
---|---|
Authors: | Pawandeep Kaur and Dora Kiesal |
Source: | IVAPP 2020 |
Place: | Malta |
Date: | 2020-02-27 |
Type: | Publication |
Abstract: |
Chart type classification through caption analysis is a new area of study. Distinct keywords in the captions that relate to the visualization vocabulary (e.g., for scatterplot: dot, y-axis, x-axis, bubble) and keywords from the specific domain (e.g., species richness, species abundance, phylogenetic associations in the case of biodiversity research), serve as parameters to train a text classifier. For better chart comprehensibility, along with the visual characteristics of the chart, a classifier should also understand these parameters well. Such conceptual/semantic chart classifiers then will not only be useful for chart classification purposes but also for other visualization studies. One of the application of such classifier is in domain knowledgebase visualization recommendation system. Where these text classifiers can provide the recommendation of visualization types or schema based on the classification of the text provided along with the dataset. Motivated by this use case, in this paper, we have explored our idea of semantic chart classifiers. We have taken the assistance of state-of-the-art natural language processing (NLP) and computer vision algorithms to create a biodiversity domain-based visualization classifier. With an average test accuracy (F1-score) of 92.2\% over all 15 classes, we can prove that our classifiers can differentiate between different chart types conceptually and visually. |