Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • H HiWi Tasks
  • Project information
    • Project information
    • Activity
    • Labels
    • Planning hierarchy
    • Members
  • Issues 5
    • Issues 5
    • List
    • Boards
    • Service Desk
    • Milestones
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
  • Activity
  • Create a new issue
  • Issue Boards
Collapse sidebar

This GitLab instance will be decommissioned in the next days. Please do not create new projects here and backup your existing project or move them to the URZ GitLab on http://git.rz.uni-jena.de.

  • FUSION
  • HiWi Tasks
  • Issues
  • #6

Closed
Open
Created Mar 15, 2021 by Jan Martin Keil@jmkeilReporter

Survey the Use of Datatypes in RDF on the Web

RDF is a framework for the distributed representation of knowledge on the web. The Web Data Commons is a RDF representation of structured data extracted from the Common Crawl, a snapshot of the web. In RDF, values (e.g. names, prices, coordinates, …) are represented with literals, that have specific data types. We would like to survey the usage of different datatypes on the web, based on Web Data Commons.

Your task
  • develop a tool to stream the Web Data Commons dataset and collect statistics on the usage of datatypes in RDF on the web
Your qualifications
  • good programming skills (e.g. Java, Python, NodeJS, …)
  • basic SQL skills
  • preferable basic knowledge about RDF
Your benefits
  • contribute to current research
  • work on exciting technologies
  • work with the structured data of the whole web
Your application

Please contact jan-martin.keil@uni-jena.de for further information and send your informal application via e-mail to jan-martin.keil@uni-jena.de.

Edited Mar 16, 2021 by Birgitta
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
Time tracking