You are not logged in.

Evaluating parts-of-speech taggers for use in a text-to-scene conversion system

Glass, Kevin and Bangay, Shaun 2005, Evaluating parts-of-speech taggers for use in a text-to-scene conversion system, in SAICSIT '05 : Research for a Changing World – Proceedings of SAICSIT 2005, South African Institute for Computer Scientists and Information Technologists, Pretoria, South Africa, pp. 20-28.

Attached Files
Name Description MIMEType Size Downloads

Title Evaluating parts-of-speech taggers for use in a text-to-scene conversion system
Author(s) Glass, Kevin
Bangay, Shaun
Conference name South African institute of computer scientists and information technologists (2005: White River, South Africa)
Conference location White River, South Africa
Conference dates 20-22 Sept. 2005
Title of proceedings SAICSIT '05 : Research for a Changing World – Proceedings of SAICSIT 2005
Editor(s) Bishop, Judith
Kourie, Derrick
Publication date 2005
Series ACM International Conference Proceeding Series
Conference series South African Institute for Computer Scientists and Information Technologists Conference
Start page 20
End page 28
Total pages 285 p.
Publisher South African Institute for Computer Scientists and Information Technologists
Place of publication Pretoria, South Africa
Keyword(s) corpora
parts-of-speech tagging
Summary This paper presents parts-of-speech tagging as a first step towards an autonomous text-to-scene conversion system. It categorizes some freely available taggers, according to the techniques used by each in order to automatically identify word-classes. In addition, the performance of each identified tagger is verified experimentally. The SUSANNE corpus is used for testing and reveals the complexity of working with different tagsets, resulting in substantially lower accuracies in our tests than in those reported by the developers of each tagger. The taggers are then grouped to form a voting system to attempt to raise accuracies, but in no cases do the combined results improve upon the individual accuracies. Additionally a new metric, agreement, is tentatively proposed as an indication of confidence in the output of a group of taggers where such output cannot be validated.
ISBN 1595932585
Language eng
Field of Research 080107 Natural Language Processing
Socio Economic Objective 890299 Computer Software and Services not elsewhere classified
HERDC Research category E1.1 Full written paper - refereed
Copyright notice ©2005, SAICSIT
Persistent URL http://hdl.handle.net/10536/DRO/DU:30039203

Document type: Conference Paper
Collection: School of Information Technology
Connect to link resolver
 
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in TR Web of Science
Scopus Citation Count Cited 0 times in Scopus
Google Scholar Search Google Scholar
Access Statistics: 205 Abstract Views, 2 File Downloads  -  Detailed Statistics
Created: Mon, 24 Oct 2011, 11:43:06 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.