科研成果

您所在的位置: 首页 >> 科研成果 >> 正文

chenjing,wangtiantian,luquan :THC-DAT: a document analysis tool based on topic hierarchy and context information

2019年07月02日 14:47  点击:[]

【摘要】Purpose – The purpose of this paper is to propose a novel within-document analysis tool (DAT) topic hierarchy and context-based document analysis tool (THC-DAT) which enables users to interactively analyze any multi-topic document based on fine-grained and hierarchical topics automatically extracted from it. THC-DAT used hierarchical latent Dirichlet allocation method and took the context information into account so that it can reveal the relationships between latent topics and related texts in a document. Design/methodology/approach– The methodology is a case study. The authors reviewed the related literature first, then utilized a general “build and test” research model. After explaining the model, interface and functions of THC-DAT, a case study was presented using a scholarly paper that was analyzed with the tool. Findings– THC-DAT can organize and serve document topics and texts hierarchically and context based, which overcomes the drawbacks of traditional DATs. The navigation, browse, search and comparison functions of THC-DAT enable users to read,search and analyze multi-topic document efficiently and effectively. Practical implications– It can improve the document organization and services in digital libraries or e-readers, by helping users to interactively read, search and analyze documents efficiently and effectively, exploringly learn about unfamiliar topics with little cognitive burden, or deepen their understanding of a document. Originality/value– This paper designs a tool THC-DAT to analyze document in a THC way. It contributes to overcoming the coarse-analysis drawbacks of existing within-DATs.


【关键词】Digital libraries;E-readers;Document analysis;Context information;hLDA;Multi-topic documents


该文章发表于《Library Hi Tech》2016年版

 Copyright 2018-2019   版权所有:华中师范大学信息管理学院
地址:湖北省武汉市洪山区珞喻路152号 邮编:430079