Deakin University
Browse

File(s) under permanent embargo

Opinion search in web logs

Version 2 2024-06-04, 04:12
Version 1 2017-08-03, 12:22
conference contribution
posted on 2024-06-04, 04:12 authored by DJ Osman, John YearwoodJohn Yearwood
Web logs (blogs) are a fast growing forum for people of all ages to express their feelings and opinions on topics of interest. The entries are often written in informal language without the structure found in newswire or published articles. One blog entry may contain many topics, these topics may express an opinion or a fact on a particular topic. This research is in contrast to work on opinion detection which has been carried out on more formally authored texts and on segments that are either whole documents or sentences. Whole web logs are divided into topics using a simple text segmentation approach. Similarity scores are used to distinguish where topic changes occur. The results are compared to human-evaluated topic changes and the most accurate algorithm is used in the remainder of the research.Words within each topicblock are allocated weightings depending on their opinion-bearing strength. Two approaches of using these weights, the sum and the maximum, are used to determine whether the topic-block is opinion-bearing or non-opinion-bearing. The opinionbearing topic-blocks are rated by human evaluators as either opinion-bearing or non-opinionbearing with precision of 67% for approach A and 70% for approach B. These results are compared with two approaches on published text to identify the difference between web logs and published articles. © 2007, Australian Computer Society, Inc.

History

Volume

63

Pagination

133-139

Location

Ballarat, Vic.

Start date

2007-01-30

End date

2007-02-02

ISSN

1445-1336

Language

eng

Publication classification

EN.1 Other conference paper

Title of proceedings

ADC 2007 : Proceedings of the 18th Australasian Database Conference

Event

Australian Computer Society. Conference (18th : 2007 : Ballarat, Vic.)

Publisher

Australian Computer Society

Series

Australian Computer Society Conference

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC