File(s) under permanent embargo
A naïve, salience-based method for speaker identification in fiction books
This paper presents a salience-based technique for the annotation of directly quoted speech from fiction text. In particular, this paper determines to what extent a naïve (without the use of complex machine learning or knowledge-based techniques) scoring technique can be used for the identification of the speaker of speech quotes. The presented technique makes use of a scoring technique, similar to that commonly found in knowledge-poor anaphora resolution research, as well as a set of hand-coded rules for the final identification of the speaker of each quote in the text. Speaker identification is shown to be achieved using three tasks: the identification of a speech-verb associated with a quote with a recall of 94.41%; the identification of the actor associated with a quote with a recall of 88.22%; and the selection of a speaker with an accuracy of 79.40%.
History
Event
International Symposium of the Pattern Recognition Association of South Africa (18th : 2007 : Pietermaritzburg, South Africa)Pagination
1 - 6Publisher
PRASALocation
Pietermaritzburg, South AfricaPlace of publication
Durban, South AfricaStart date
2007-11-28End date
2007-11-30ISBN-13
9781868406562Language
engPublication classification
E1.1 Full written paper - refereedCopyright notice
2007, PRASAEditor/Contributor(s)
J Tapamo, F NicollsTitle of proceedings
PRASA 2007 : Proceedings of the 18th International Symposium of the Pattern Recognition Association of South AfricaUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC