Navigation
A Deep Learning-based Approach for Banana Leaf Diseases Classification
A Particle Swarm-based Approach for Semantic Similarity Computation
A Review on Visualization Recommendation Strategies
Accelerating relational database operations using both CPU and GPU co-processor
An Ontology-based Scientific Data Integration Workflow
CNN based Chart Image Classification
Context-Based Web Service Discovery Framework with QoS Considerations
Essential Annotation Schema for Ecology (EASE)—A framework supporting the efficient data annotation and faceted navigation in ecology
Exploratorien zur funktionellen Biodiversitätsforschung – ein DFG Schwerpunktprogramm
Honey bee versus Apis mellifera: A Semantic Search for Biological Data
Integrated Semantic Search on Structured and Unstructured Data in the ADOnIS System
Integrative Data Management for Reproducibility of Microscopy Experiments
Ontology Visualization: An Overview
QUIS: in-situ heterogeneous data source querying
REPRODUCE-ME: Ontology-based Data Access for Reproducibility of Microscopy Experiments
ScholarLens: extracting competences from research publications for the automatic generation of semantic user profiles
Towards Personalized Support for Ontology Selection
Towards Using Semantic Data Mining for Predicting Community Assembly
Visualization Taxonomy based on the Specification of User’s Goal and Data Dimensions
What do Biodiversity Scholars Search for? Identifying High-Level Entities for Biological Metadata
Why the mapping process in ontology integration deserves attention
ScholarLens: extracting competences from research publications for the automatic generation of semantic user profiles
Title: | ScholarLens: extracting competences from research publications for the automatic generation of semantic user profiles |
---|---|
Authors: | Bahar Sateli, Felicitas Löffler, Birgitta König-Ries, René Witte |
Source: | PeerJ |
Date: | 2017-07-03 |
Type: | Journal Paper |
URL: | https://doi.org/10.7717/peerj-cs.121 |
BibTex: |
@article{10.7717/peerj-cs.121, title = {ScholarLens: extracting competences from research publications for the automatic generation of semantic user profiles}, author = {Sateli, Bahar and Löffler, Felicitas and König-Ries, Birgitta and Witte, René}, year = 2017, month = jul, keywords = {Natural language processing, Semantic user profile, Semantic publishing, Scholarly user modeling, Linked open data}, abstract = { Motivation Scientists increasingly rely on intelligent information systems to help them in their daily tasks, in particular for managing research objects, like publications or datasets. The relatively young research field of \textit{Semantic Publishing} has been addressing the question how scientific applications can be improved through semantically rich representations of research objects, in order to facilitate their discovery and re-use. To complement the efforts in this area, we propose an automatic workflow to construct \textit{semantic user profiles} of scholars, so that scholarly applications, like digital libraries or data repositories, can better understand their users’ interests, tasks, and competences, by incorporating these user profiles in their design. To make the user profiles sharable across applications, we propose to build them based on standard semantic web technologies, in particular the Resource Description Framework (RDF) for representing user profiles and Linked Open Data (LOD) sources for representing competence topics. To avoid the \textit{cold start} problem, we suggest to automatically populate these profiles by analyzing the publications (co-)authored by users, which we hypothesize reflect their research competences. Results We developed a novel approach, \textit{ScholarLens}, which can automatically generate semantic user profiles for authors of scholarly literature. For modeling the competences of scholarly users and groups, we surveyed a number of existing linked open data vocabularies. In accordance with the LOD best practices, we propose an RDF Schema (RDFS) based model for competence records that reuses existing vocabularies where appropriate. To automate the creation of semantic user profiles, we developed a complete, automated workflow that can generate semantic user profiles by analyzing full-text research articles through various natural language processing (NLP) techniques. In our method, we start by processing a set of research articles for a given user. Competences are derived by text mining the articles, including syntactic, semantic, and LOD entity linking steps. We then populate a knowledge base in RDF format with user profiles containing the extracted competences.We implemented our approach as an open source library and evaluated our system through two user studies, resulting in mean average precision (MAP) of up to 95%. As part of the evaluation, we also analyze the impact of semantic zoning of research articles on the accuracy of the resulting profiles. Finally, we demonstrate how these semantic user profiles can be applied in a number of use cases, including article ranking for personalized search and finding scientists competent in a topic —e.g., to find reviewers for a paper. Availability All software and datasets presented in this paper are available under open source licenses in the supplements and documented at http://www.semanticsoftware.info/semantic-user-profiling-peerj-2016-supplements. Additionally, development releases of ScholarLens are available on our GitHub page: https://github.com/SemanticSoftwareLab/ScholarLens. }, volume = 3, pages = {e121}, journal = {PeerJ Computer Science}, issn = {2376-5992}, url = {https://doi.org/10.7717/peerj-cs.121}, doi = {10.7717/peerj-cs.121} } |