<?xml version="1.0" encoding="UTF-8"?>
<!--DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"-->
<?xml-stylesheet href="http://www.sfb632.uni-potsdam.de/~chiarcos/nlg.xml" type="text/xsl" ?>
<xsl:stylesheet version="1.0"
    xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:bibtex="http://bibtexml.sf.net/"
    xmlns:my="bibfunc:bar"
    xmlns:xs="http://www.w3.org/2001/XMLSchema">
    
    <xsl:param name="pdfDirURI">./</xsl:param>
    
    
    <xsl:output 
        method="html"
        indent="yes"
        version="1.0"/>
    
    <xsl:include href="http://www.sfb632.uni-potsdam.de/~chiarcos/bibtex/xslt/bibxml2html_xslt1_funcs.xsl"/>
    
    <xsl:template match="/">
        
        
        <HTML>
            <HEAD>
                <META HTTP-EQUIV="CONTENT-TYPE" CONTENT="text/html"/>
                <TITLE></TITLE>
                <link href="http://www.sfb632.uni-potsdam.de/~chiarcos/bibtex/default.css" type="text/css" rel="stylesheet"/>
                <STYLE>
                    <!--
                        H4 { color: #000000 }
                    -->
                </STYLE>
                <script type="text/javascript" src="http://www.sfb632.uni-potsdam.de/~chiarcos/bibtex/toggle.js"/>
            </HEAD>
            <body  style="background-color:rgb(240,241,225);">
                <H4><FONT FACE="Sans-serif"><FONT STYLE="font-size: 16pt">Natural Language Processing</FONT></FONT></H4>
<div style="text-align:justify">

<p>
As a computational scientist, I have a strong commitment to the practical implementation of theoretical results and findings in concrete NLP applications.
</p>

<p>
As such, the <a href="phd.html">Mental Salience Framework</a> (MSF), that I've developed in my PhD thesis, is a framework for the development and application of salience metrics in <b>Natural Language Generation</b>: If the grammar provides alternative realization candidates for a given piece of information, the MSF defines how salience metrics can be applied to predict the contextually contextually most adequate realization.
I've practially implemented such salience-based preferences for the choice of referring expressions, the assignment of grammatical roles and word order preferences in the sentence planning module for the PoliBox system. Together with Manfred Stede, I've also built a text planning module for PoliBox where the discourse structure (and hence, the hierarchical organization and the choice of discourse connectives) were derived from primitive pieces of information.

Subsequent research on <a href="discourse.xml">discourse phenomena</a> was chiefly concerned with the evaluation of design decisions made in these systems. 
</p>

<p>
In the context of <b>Natural Language Understanding</b>, the MSF can be employed to the spotting of `relevant' or `important' expressions:
It specifies a distinction between two dimensions of salience and formalizes their interaction. If one dimension, say, backward-looking salience, is known, and deviations from packaging preferences according to this dimension of salience are encountered, then the other dimension (i.e., forward-looking salience, or importance) must be involved.
The MSF also has a potential application in <b>Machine Translation</b>: Information packaging preferences differ across the languages of the world, and the salience-based approach can be applied in the pre- and postediting steps of MT, e.g., in order to derive contextual appropriate word order.
Both applications, however, have not yet been systematically explored.
</p>

<p>
In a number of papers, the <a href="ontologies.xml">OLiA ontologies</a>, originally designed to allow users to explore corpora with multiple layers of heterogeneous, unknown or complex annotations, have also been shown to be practically applicable for various NLP tasks.
The ontologies allow to abstract from string-based representations of linguistic annotations, and to operate on conceptual representations of linguistic annotations:
In <b>ensemble combination</b> architectures, the OLiA ontologies make it possible to combine linguistic analyses produced by tools with different tagsets;
in <b>NLP pipeline systems</b>, e.g., UiMA, the OLiA ontologies can be employed for the tool-independent specification of interfaces between different modules;
further, the OLiA ontologies provide a natural integration of NLP analyses with <b>Semantic Web</b> techniques.
They contribute to the portability and scalability of linguistic annotations and the NLP tools that they produce.
</p>
</div>
                

<H2>Selected Publications</H2>


                <table>
                    <xsl:for-each select="document('http://www.sfb632.uni-potsdam.de/~chiarcos/bibtex/publications.xml')//bibtex:entry
                        [count(.//*[contains(text(),'agging')])&gt;0 or
                         count(.//*[contains(text(),'pipeline')])&gt;0 or
                         count(.//*[contains(text(),'Data Category Registry')])&gt;0 or
                         count(.//*[contains(text(),'lassifier')])&gt;0 or
                         count(.//*[contains(text(),'Querying')])&gt;0 or
                         count(.//*[contains(text(),'enerierung')])&gt;0 or
                         count(.//*[contains(text(),'NLG')])&gt;0 or
                         count(.//*[contains(text(),'generation')])&gt;0 or
                         count(.//*[contains(text(),'Genera')])&gt;0]
                         [count(.//*[contains(text(),'Rehm')])=0]">
                        <xsl:call-template name="entry"/>
                    </xsl:for-each>
                </table>
                


<!--P LANG="en-GB"><FONT FACE="Serif">Chiarcos, Ch., Claus, B., and M.
Grabski, eds. (in preparation),</FONT></P>
<UL>
	<P><FONT FACE="Serif"><SPAN LANG="en-GB">Salience. Multidisciplinary
	perspectives on its function in discourse. </SPAN>Trends in
	Linguistics. Studies and Monographs' [TiLSM], de Gruyter, Berlin, in
	preparation.</FONT></P>
</UL>
<P LANG="en-GB"><FONT FACE="Serif">Krasavina, O., Chiarcos, Ch., and
Zalmanov, D. (2007)</FONT></P>
<UL>
	<P LANG="en-GB"><FONT FACE="Serif">Aspects of topicality in the use
	of demonstrative expressions in German, English and Russian, Ant&oacute;nio
	Branco, Tony McEnery, Ruslan Mitkov and F&aacute;tima Silva (Eds.),
	Proc. 6<SUP>th</SUP> Discourse Anaphora and Anaphor Resolution
	Colloquium (DAARC-2007), Lagos (Algarve)/Portugal, March 29-30,
	2007, p.53-58.</FONT></P>
</UL>
<P><FONT FACE="Serif">Chiarcos, Ch. (2006a)</FONT></P>
<UL>
	<P LANG="en-GB"><FONT FACE="Serif"><I>Semimanuelle Generierung und
	Auswertung von Alternativentexten.</I> In Hardarik Bl&uuml;hdorn,
	Eva Breindl, Ulrich Wa&szlig;ner (Eds.), <I>Text &ndash; Verstehen.
	Grammatik und dar&uuml;ber hinaus</I>. Institut f&uuml;r Deutsche
	Sprache. Jahrbuch 2005. De Gruyter, Berlin, New York, 2006,
	p.406-410.</FONT></P>
</UL>
<P LANG="en-GB"><FONT FACE="Serif">Stede, M., Chiarcos, Ch., Grabski,
M., and L. Lagerwerf, eds. (2005),</FONT></P>
<UL>
	<P LANG="en-GB"><FONT FACE="Serif">Salience in Discourse:
	Proceedings of the 6<SUP>th</SUP> International Workshop on
	Multidisciplinary Approaches to Discourse 2005 (MAD 2005), Stichting
	Neerlandistiek VU Amsterdam &amp; Nodus Publikationen, M&uuml;nster</FONT></P>
</UL>
<P><FONT FACE="Serif"><SPAN LANG="en-GB">Chiarcos, Ch.</SPAN> <SPAN LANG="en-GB">(2005)</SPAN></FONT></P>
<P LANG="en-GB" STYLE="margin-left: 1.25cm"><FONT FACE="Serif">Mental
salience and grammatical form: Generating referring expressions. In
Salience in discourse. Proceedings of the 6th Workshop on
Multidisciplinary Approaches to Discourse (MAD-05), M. Stede,
Ch.Chiarcos, M.Grabski and L.Lagerwerf (eds), 17-26. Amsterdam:
Stichting/M&uuml;nster: Nodus.</FONT></P>
<P><FONT FACE="Serif"><SPAN LANG="en-GB">Chiarcos, Ch.</SPAN> <SPAN LANG="en-GB">and
O. Krasavina (2005a)</SPAN></FONT></P>
<P LANG="en-GB" STYLE="text-indent: 1.25cm"><FONT FACE="Serif">Rhetorical
distance revisited. A parameterized approach. In Proceedings of
Constraints in Discourse, Dortmund/Germany, June 3-5, 2005.</FONT></P>
<P LANG="en-GB"><FONT FACE="Serif">Chiarcos, Ch. and O. Krasavina
(2005b)</FONT></P>
<UL>
	<P><a target="_new" href="http://www.corpus.bham.ac.uk/PCLCI"><SPAN STYLE="text-decoration: none"><SPAN LANG="en-GB"><FONT SIZE=2><FONT FACE="Serif"><FONT COLOR="#000000">Rhetorical
	Distance Revisited: A pilot study. In: Proceedings of Corpus
	Linguistics 2005, 14-17 July, Birmingham, UK, published as The
	Corpus Linguistics Conference Series, Vol. 1, no. 1.,
	www.corpus.bham.ac.uk/PCLC</FONT></FONT></FONT></SPAN></SPAN></A></P>
</UL>
<P LANG="en-GB"><FONT FACE="Serif">Chiarcos, Ch. and O. Krasavina
(2005c)</FONT></P>
<P STYLE="margin-left: 2cm"><FONT FACE="Serif"><SPAN LANG="en-GB">Rhetorical
distance in cross-language evaluation. A parameterized approach of
referential accessibility. Proc. 4<SUP>th</SUP> International
Contrastive Linguistics Conference (ICLC 4), Santiago de Compostela,
September 19 - 23, 2005.Chiarcos, Ch.</SPAN> <SPAN LANG="en-GB">and
O. Krasavina (2005d)</SPAN></FONT></P>
<P><FONT FACE="Serif"><SPAN LANG="en-GB">Chiarcos, Ch.</SPAN> <SPAN LANG="en-GB">and
O. Krasavina (2005d)</SPAN></FONT></P>
<P STYLE="margin-left: 1.25cm"><SPAN LANG="en-GB"><a target="_new" href="http://amor.cms.hu-berlin.de/~krasavio/annorichtlinien.pdf"><FONT FACE="Serif">Annotation
Guidelines</FONT></A><FONT FACE="Serif">, Draft. PoCoS - Potsdam
Coreference Scheme.
http://amor.cms.hu-berlin.de/~krasavio/annorichtlinien.pdf
(25/12/2005)</FONT></SPAN></P>
<P><FONT FACE="Serif"><SPAN LANG="en-GB">Chiarcos, Ch.</SPAN> <SPAN LANG="en-GB">and
M. Stede (2004)</SPAN></FONT></P>
<P STYLE="margin-left: 1.27cm"><SPAN LANG="en-GB"><a target="_new" href="http://springerlink.metapress.com/openurl.asp?genre=article&amp;issn=0302-9743&amp;volume=3123&amp;spage=21"><I><FONT FACE="Serif">Salience-Driven
Text-Planning</FONT></I></A><FONT FACE="Serif">, in Anja Belz, Roger
Evans, Paul Piwek (Eds.): Natural Language Generation, Third
International Conference, INLG 2004, Brockenhurst, UK, July 14-16,
2004, Proceedings. Lecutre Notes in Computer Science 3123 Springer,
p. 21-30.</FONT></SPAN></P>
<P><FONT FACE="Serif">Chiarcos, Ch. (2003a)</FONT></P>
<P STYLE="margin-left: 1.27cm"><FONT FACE="Serif"><I>Eine
Satzplanungskomponente f&uuml;r die Textgenerierung</I>, LDV Forum
(GLDV-Journal for Computational Linguistics and Language Technology)
18 (1):53-67.</FONT></P>
<P><FONT FACE="Serif"><SPAN LANG="en-GB">Chiarcos, Ch.</SPAN> <SPAN LANG="en-GB">(2003b)</SPAN></FONT></P>
<P STYLE="margin-left: 1.27cm"><FONT FACE="Serif"><I><SPAN LANG="en-GB">Reference-Tracking.
</SPAN>Sprachliche Mittel der Salienzindikation im Deutschen</I>,
Technical University of Berlin, unpublished master's thesis
(Magisterarbeit) in General Linguistics.</FONT></P>
<P><FONT FACE="Serif">Chiarcos, Ch. (2002)</FONT></P>
<P STYLE="margin-left: 1.27cm"><FONT FACE="Serif"><I>Eine
Satzplanungskomponente f&uuml;r die Textgenerierung</I>, Technical
University of Berlin, unpublished diploma thesis (Diplomarbeit) in
Computational Science.</FONT></P-->
            </body>
        </HTML>
    </xsl:template>
</xsl:stylesheet>
