Dataset Open Access
Michaelis, Lars
{"conceptdoi":"10.25592/uhhfdm.11446","conceptrecid":"11446","created":"2023-02-07T14:22:46.994823+00:00","doi":"10.25592/uhhfdm.11447","id":11447,"links":{"badge":"https://www.fdr.uni-hamburg.de/badge/doi/10.25592/uhhfdm.11447.svg","conceptbadge":"https://www.fdr.uni-hamburg.de/badge/doi/10.25592/uhhfdm.11446.svg","conceptdoi":"http://doi.org/10.25592/uhhfdm.11446","doi":"http://doi.org/10.25592/uhhfdm.11447"},"metadata":{"access_right":"open","access_right_category":"success","communities":[{"id":"uhh"}],"creators":[{"name":"Michaelis, Lars"}],"description":"<p>WikiEvents is a knowledge graph based dataset for NLP and event-related machine learning tasks.</p>\n\n<p>This dataset includes RDF data in JSON-LD about events between January 2020 and December 2022. It was extracted from the Wikipedia Current events portal, Wikidata, OpenStreetMaps Nominatim and Falcon 2.0. The extractor is available on GitHub under <a href=\"https://github.com/semantic-systems/current-events-to-kg\">semantic-systems/current-events-to-kg</a>.</p>\n\n<p>The RDF data for each month is split onto four graph modules each:</p>\n\n<ul>\n\t<li>The <strong>base</strong> graph module contains events, event summaries with references from named entities to Wikipedia articles.</li>\n\t<li>The <strong>ohg</strong> graph module with all one-hop graphs (ohg) around the referencend Wikidata entities.</li>\n\t<li>The <strong>osm</strong> graph module which contains spartial data from OpenStreetMap (OSM).</li>\n\t<li>The <strong>raw</strong> graph module containing the raw HTML objects of events and article infoboxes.</li>\n</ul>\n\n<p>This repository additionally includes two JSON files with training samples used for entity linking and event-related location extraction. They were created using queries to the WikiEvents dataset uploaded into this repository.</p>","doi":"10.25592/uhhfdm.11447","keywords":["Knowledge Graph","Events","Location Extraction","Entity Linking","NLP"],"language":"eng","license":{"id":"CC-BY-SA-4.0"},"publication_date":"2023-02-07","related_identifiers":[{"identifier":"10.25592/uhhfdm.11446","relation":"isVersionOf","scheme":"doi"}],"relations":{"version":[{"count":1,"index":0,"is_last":true,"last_child":{"pid_type":"recid","pid_value":"11447"},"parent":{"pid_type":"recid","pid_value":"11446"}}]},"resource_type":{"title":"Dataset","type":"dataset"},"title":"WikiEvents Dataset from January 2020 to December 2022"},"owners":[371],"revision":4,"updated":"2023-12-29T15:14:40.704089+00:00"}