TY - JOUR
T1 - Production of referring expressions in Arabic
AU - Khan, Imtiaz Hussain
N1 - Publisher Copyright:
© 2015, Springer Science+Business Media New York.
PY - 2016/6/12
Y1 - 2016/6/12
N2 - Most existing studies on evaluation of the generation of referring expressions (GRE) algorithms intend to find how close GRE output is to human output, when they generate expressions in a similar situation. This article explores how native Arabic speakers produce referring expressions. Participants were presented with objects in visual domains and they were asked to describe the (marked) target object by typing a description (Arabic expressions) which can uniquely identify the target to their addressee. The data revealed that a large proportion (above 35 %) of overspecifying descriptions were produced by participants, and that this overspecification is not only because of certain preference for some attributes over the other attributes, the overspecification (and also sometime underspecification) may be because of the complexity (in terms of length) of the description itself. These data were also compared against the TUNA corpus data, which were elicited by native English speakers in identical conditions as ours. A comparative analysis of Arabic and English descriptions reveals that overall both Arabic and English speakers produce similar linguistic descriptions under the identical conditions, implying that reference generation phenomena are not language specific.
AB - Most existing studies on evaluation of the generation of referring expressions (GRE) algorithms intend to find how close GRE output is to human output, when they generate expressions in a similar situation. This article explores how native Arabic speakers produce referring expressions. Participants were presented with objects in visual domains and they were asked to describe the (marked) target object by typing a description (Arabic expressions) which can uniquely identify the target to their addressee. The data revealed that a large proportion (above 35 %) of overspecifying descriptions were produced by participants, and that this overspecification is not only because of certain preference for some attributes over the other attributes, the overspecification (and also sometime underspecification) may be because of the complexity (in terms of length) of the description itself. These data were also compared against the TUNA corpus data, which were elicited by native English speakers in identical conditions as ours. A comparative analysis of Arabic and English descriptions reveals that overall both Arabic and English speakers produce similar linguistic descriptions under the identical conditions, implying that reference generation phenomena are not language specific.
KW - Arabic referring expressions
KW - Generation of referring expressions
KW - Minimal descriptions
KW - Overspecifying descriptions
KW - Underspecifying descriptions
UR - https://www.scopus.com/pages/publications/84930910752
U2 - 10.1007/s10772-015-9282-8
DO - 10.1007/s10772-015-9282-8
M3 - Article
AN - SCOPUS:84930910752
SN - 1381-2416
VL - 19
SP - 385
EP - 392
JO - International Journal of Speech Technology
JF - International Journal of Speech Technology
IS - 2
ER -