Dataset Open Access

GLips - German Lipreading Dataset

Schwiebert, Gerald; Weber, Cornelius; Qu, Leyuan; Siqueira, Henrique; Wermter, Stefan


JSON-LD (schema.org) Export

{"@context":"https://schema.org/","@id":"http://doi.org/10.25592/uhhfdm.10048","@type":"Dataset","creator":[{"@type":"Person","affiliation":"University of Hamburg","name":"Schwiebert, Gerald"},{"@type":"Person","affiliation":"University of Hamburg","name":"Weber, Cornelius"},{"@type":"Person","affiliation":"University of Hamburg","name":"Qu, Leyuan"},{"@type":"Person","affiliation":"University of Hamburg","name":"Siqueira, Henrique"},{"@type":"Person","affiliation":"University of Hamburg","name":"Wermter, Stefan"}],"datePublished":"2022-03-01","description":"<p>The German Lipreading dataset consists of 250,000 publicly available videos of the faces of speakers of the Hessian Parliament, which was processed for word-level lip reading using an automatic pipeline. The format is similar to that of the English language Lip Reading in the Wild (LRW) dataset, with each H264-compressed MPEG-4 video encoding one word of interest in a context of 1.16 seconds duration, which yields compatibility for studying transfer learning between both datasets. Choosing video material based on naturally spoken language in a natural environment ensures more robust results for real-world applications than artificially generated datasets with as little noise as possible. The 500 different spoken words ranging between 4-18 characters in length each have 500 instances and separate MPEG-4 audio- and text metadata-files, originating from 1018 parliamentary sessions. Additionally, the complete TextGrid files containing the segmentation information of those sessions are also included. The size of the uncompressed dataset is 16GB.</p>","distribution":[{"@type":"DataDownload","contentUrl":"https://www.fdr.uni-hamburg.de/api/files/249b00a0-2f2c-4a7b-85db-4a420d94abe3/GLips.zip","encodingFormat":"zip"}],"identifier":"http://doi.org/10.25592/uhhfdm.10048","inLanguage":{"@type":"Language","alternateName":"deu","name":"German"},"keywords":["Computer Vision","Pattern Recognition","Machine Learning","Deep Learning","Language","Dataset","Automatic Speech Recognition","Transfer Learning","Lip Reading","Corpus"],"license":"https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode","name":"GLips - German Lipreading Dataset","url":"https://www.fdr.uni-hamburg.de/record/10048","version":"1.0"}

Cite record as