Software Open Access
<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
<leader>00000nmm##2200000uu#4500</leader>
<datafield tag="909" ind1="C" ind2="O">
<subfield code="o">oai:fdr.uni-hamburg.de:8965</subfield>
<subfield code="p">user-uhh</subfield>
</datafield>
<controlfield tag="001">8965</controlfield>
<datafield tag="260" ind1=" " ind2=" ">
<subfield code="c">2021-03-20</subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">software</subfield>
</datafield>
<datafield tag="024" ind1=" " ind2=" ">
<subfield code="a">10.25592/uhhfdm.8965</subfield>
<subfield code="2">doi</subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">user-uhh</subfield>
</datafield>
<datafield tag="856" ind1="4" ind2=" ">
<subfield code="s">43945</subfield>
<subfield code="u">https://www.fdr.uni-hamburg.de/record/8965/files/xnlpipe-v1.0.zip</subfield>
<subfield code="z">md5:767d2a6e7b2cedd0a5e068778a829aa2</subfield>
</datafield>
<datafield tag="540" ind1=" " ind2=" ">
<subfield code="u">https://opensource.org/licenses/GPL-3.0</subfield>
<subfield code="a">GNU General Public License v3.0 or later</subfield>
</datafield>
<datafield tag="773" ind1=" " ind2=" ">
<subfield code="a">10.1075/da.2020.verwer.plain-text-processing</subfield>
<subfield code="i">cites</subfield>
<subfield code="n">doi</subfield>
</datafield>
<datafield tag="773" ind1=" " ind2=" ">
<subfield code="a">10.3115/v1/P14-5010</subfield>
<subfield code="i">cites</subfield>
<subfield code="n">doi</subfield>
</datafield>
<datafield tag="773" ind1=" " ind2=" ">
<subfield code="a">10.25592/uhhfdm.8964</subfield>
<subfield code="i">isVersionOf</subfield>
<subfield code="n">doi</subfield>
</datafield>
<datafield tag="542" ind1=" " ind2=" ">
<subfield code="l">open</subfield>
</datafield>
<datafield tag="999" ind1="C" ind2="5">
<subfield code="x">Imsieke, Gerrit. 2018. "Tokenized-to-Tree: An XProc/XSLT Library For Patching Back Tokenization/Analysis Results Into Marked-up Text." In XML Prague 2018 Conference Proceedings, 229–45. Prague, Czech Republic.</subfield>
</datafield>
<datafield tag="999" ind1="C" ind2="5">
<subfield code="x">Manning, Christopher D., Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David McClosky. 2014. "The Stanford CoreNLP Natural Language Processing Toolkit." In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 55–60.</subfield>
</datafield>
<datafield tag="999" ind1="C" ind2="5">
<subfield code="x">Verwer, Nico. "Plain Text Processingin Structured Documents." In Proceedings of Declarative Amsterdam 2020. CWI, Amsterdam: John Benjamins, 2020</subfield>
</datafield>
<datafield tag="520" ind1=" " ind2=" ">
<subfield code="a"><p>The XML NLP Pipeline is a Java command line application that integrates the Stanford CoreNLP pipeline (Manning et al. 2014) in an XML-based processing pipeline. It uses a simplified version of the Separated Markup API for XML (SMAX) by Nico Verwer (Verwer 2020) to patch the annotated tokens back to the XML document, preserving all previous annotations.</p>
<p><strong>Bibliography</strong></p>
<p>Imsieke, Gerrit. 2018. &ldquo;Tokenized-to-Tree: An XProc/XSLT Library For Patching Back Tokenization/Analysis Results Into Marked-up Text.&rdquo; In XML Prague 2018 Conference Proceedings, 229&ndash;45. Prague, Czech Republic.</p>
<p>Manning, Christopher D., Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David McClosky. 2014. &ldquo;The Stanford CoreNLP Natural Language Processing Toolkit.&rdquo; In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 55&ndash;60. <a href="http://www.aclweb.org/anthology/P/P14/P14-5010">http://www.aclweb.org/anthology/P/P14/P14-5010</a>.</p>
<p>Verwer, Nico. &ldquo;Plain Text Processingin Structured Documents.&rdquo; In Proceedings of Declarative Amsterdam 2020. CWI, Amsterdam: John Benjamins, 2020. <a href="https://doi.org/10.1075/da.2020.verwer.plain-text-processing">https://doi.org/10.1075/da.2020.verwer.plain-text-processing</a>.</p></subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Java</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Dehmel Digital</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">XML</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">NLP</subfield>
</datafield>
<controlfield tag="005">20210723083410.0</controlfield>
<datafield tag="245" ind1=" " ind2=" ">
<subfield code="a">XML NLP Pipeline</subfield>
</datafield>
<datafield tag="041" ind1=" " ind2=" ">
<subfield code="a">eng</subfield>
</datafield>
<datafield tag="100" ind1=" " ind2=" ">
<subfield code="a">Maus, David</subfield>
<subfield code="u">State and University Library Hamburg</subfield>
<subfield code="0">(orcid)0000-0001-9292-5673</subfield>
</datafield>
<datafield tag="650" ind1="1" ind2="7">
<subfield code="a">cc-by</subfield>
<subfield code="2">opendefinition.org</subfield>
</datafield>
</record>