Deakin University
Browse

Contextual topic discovery using unsupervised keyphrase extraction and hierarchical semantic graph model

journal contribution
posted on 2023-11-06, 00:30 authored by H Du, Srikanth ThudumuSrikanth Thudumu, A Giardina, Rajesh VasaRajesh Vasa, Kon MouzakisKon Mouzakis, L Jiang, J Chisholm, S Bista
AbstractRecent technological advancements have led to a significant increase in digital documents. A document’s key information is generally represented by the keyphrases that provide the abstract description contained therein. With traditional keyphrase techniques, however, it is difficult to identify relevant information based on context. Several studies in the literature have explored graph-based unsupervised keyphrase extraction techniques for automatic keyphrase extraction. However, there is only limited existing work that embeds contextual information for keyphrase extraction. To understand keyphrases, it is essential to grasp both the concept and the context of the document. Hence, a hybrid unsupervised keyphrase extraction technique is presented in this paper called ContextualRank, which embeds contextual information such as sentences and paragraphs that are relevant to keyphrases in the keyphrase extraction process. We propose a hierarchical topic modeling approach for topic discovery based on aggregating the extracted keyphrases from ContextualRank. Based on the evaluation on two short-text datasets and one long-text dataset, ContextualRank obtains remarkable improvements in performance over other baselines in the short-text datasets.

History

Journal

Journal of Big Data

Volume

10

Article number

156

Pagination

1-19

Location

Berlin, Germany

ISSN

2196-1115

eISSN

2196-1115

Language

en

Publication classification

C1 Refereed article in a scholarly journal

Issue

1

Publisher

Springer Science and Business Media LLC

Usage metrics

    Research Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC