中文
Profile
No content
张翔
Associate Professor
Paper Publications
A Combined Method for Chinese Micro-blogging Topic Tracking
Release time:2024-08-09 Hits:
Affiliation of Author(s):
信息与控制工程学院
Journal:
Machine Tool Technology,Mechatronics and Information Engineering
Key Words:
中文关键字:中文微博;话题追踪;LDA;Bagging,英文关键字:Chinese micro-blogging;,topic tracking;LDA;Bagging
Abstract:
To the problem of Chinese micro-blogging topic tracking, a method combined LDA model and Bagging of ensemble learning was proposed. The method firstly used the LDA hidden topic modeling, effectively solved the issue that the dataset’s sparsity of the short text, then made the C4.5 decision tree as a weak classifier, through examples resampling to obtain multiple training set, compounding the training sets according to the voting rule, and ultimately getting the similarity of the micro-blogging topic. Experiments show that, compared with the model based on single vector model, classical TF-IDF and the tracking method of C.45Bagging similarity computing, this method have a better performance on precision, recall ratio and F1 value
Note:
张翔
Co-author:
尚勃,朱雨洁
First Author:
zhangxiang,董丽丽
Indexed by:
Journal paper
Volume:
卷:644
Issue:
期:
Page Number:
页:2816-2821
Translation or Not:
no
Date of Publication:
2014-01-01

Pre One:Application of Spark Parallelization Technology in Architectural Text Classification

Next One:面向中文文本分类的c4.5Bagging算法研究