DI4R 2017 associated event: OpenAIRE datathon

The OpenAIRE datathon aims at encouraging developers and data scientists to analyse and improve the OpenAIRE Information Space, currently made of a scholarly communication graph interlinking publications, datasets, software, research organisations, funders, and projects. The graph is the result of harvesting metadata from about 3000 data providers.

The data challenges

The datathon encourages teams of computer scientists, data scientists and experts from other fields to join the challenge of studying the OpenAIRE graph to enhance its discovery and statistical capabilities. Four main OpenAIRE datasets will be made available as Linked Open Data, Scholix exchange format (JSON scholarly graph representation), XML collections and full-text collections.


The buzz-topics leading the challenge are:

  • Enabling multi-disciplinary or discipline-specific discovery and stats functionality
  • Novel techniques to enable measurement of scientific impact, e.g. counters, links, provenance
  • Innovative techniques to measure scientific impact, e.g. measures of quality
  • Enabling reproducibility, e.g. re-use oriented metadata, meaningful interlinking of objects
  • De-duplication of the information space, e.g. disambiguation of authors, disambiguation of organisations.

The three teams proposing the most outstanding solutions will be awarded an Apple iPad Pro and a featuring on the OpenAIRE website.

Important dates

The datathon will last for 3 months, from 30 November to 28 February 2018. Registration is open until 15 January 2018.

See more information.

Twitter: @oaire_datathon