Dataset Open Access

WikiEvents Dataset from January 2020 to December 2022

Michaelis, Lars


Citation Style Language JSON Export

{"DOI":"10.25592/uhhfdm.11447","abstract":"<p>WikiEvents is a knowledge graph based dataset for NLP and event-related machine learning tasks.</p>\n\n<p>This dataset includes RDF data in JSON-LD about events between January 2020 and December 2022. It was extracted from the Wikipedia Current events portal, Wikidata, OpenStreetMaps Nominatim and Falcon 2.0. The extractor is available on GitHub under <a href=\"https://github.com/semantic-systems/current-events-to-kg\">semantic-systems/current-events-to-kg</a>.</p>\n\n<p>The RDF data for each month is split onto four graph modules each:</p>\n\n<ul>\n\t<li>The <strong>base</strong> graph module contains events, event summaries with references from named entities to Wikipedia articles.</li>\n\t<li>The <strong>ohg</strong> graph module with all one-hop graphs (ohg) around the referencend Wikidata entities.</li>\n\t<li>The <strong>osm</strong> graph module which contains spartial data from OpenStreetMap (OSM).</li>\n\t<li>The <strong>raw</strong> graph module containing the raw HTML objects of events and article infoboxes.</li>\n</ul>\n\n<p>This repository additionally includes two JSON files with training samples used for entity linking and event-related location extraction. They were created using queries to the WikiEvents dataset uploaded into this repository.</p>","author":[{"family":"Michaelis, Lars"}],"id":"11447","issued":{"date-parts":[[2023,2,7]]},"language":"eng","title":"WikiEvents Dataset from January 2020 to December 2022","type":"dataset"}

Cite record as