This paper presents a salience-based technique for the annotation of directly quoted speech from fiction text. In particular, this paper determines to what extent a naïve (without the use of complex machine learning or knowledge-based techniques) scoring technique can be used for the identification of the speaker of speech quotes. The presented technique makes use of a scoring technique, similar to that commonly found in knowledge-poor anaphora resolution research, as well as a set of hand-coded rules for the final identification of the speaker of each quote in the text. Speaker identification is shown to be achieved using three tasks: the identification of a speech-verb associated with a quote with a recall of 94.41%; the identification of the actor associated with a quote with a recall of 88.22%; and the selection of a speaker with an accuracy of 79.40%.
History
Event
International Symposium of the Pattern Recognition Association of South Africa (18th : 2007 : Pietermaritzburg, South Africa)
Pagination
1 - 6
Publisher
PRASA
Location
Pietermaritzburg, South Africa
Place of publication
Durban, South Africa
Start date
2007-11-28
End date
2007-11-30
ISBN-13
9781868406562
Language
eng
Publication classification
E1.1 Full written paper - refereed
Copyright notice
2007, PRASA
Editor/Contributor(s)
J Tapamo, F Nicolls
Title of proceedings
PRASA 2007 : Proceedings of the 18th International Symposium of the Pattern Recognition Association of South Africa