搜索结果: 1-6 共查到“Chinese Text Categorization”相关记录6条 . 查询时间(0.101 秒)
Experimental Study on Representing Units in Chinese Text Categorization
byte 3-gram Experimental Study Chinese Text Categorization
2009/1/22
This paper is a comparative study on representing units in Chinese text categorization. Several kinds of representing units, including byte 3-gram, Chinese character, Chinese word, and Chinese word wi...
A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization
Semi-Quantitative Analysis Character-Bigrams
2015/1/24
A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization.
Raising High-Degree Overlapped Character Bigrams into Trigrams for Dimensionality Reduction in Chinese Text Categorization(图)
Trigrams Dimensionality Reduction
2015/1/26
High dimensionality of feature space is a crucial obstacle for Automated Text Categorization. According to the characteristics of Chinese character N-grams, this paper reveals that there exists a kind...
Eliminating High-degree Biased Character Bigrams for Dimensionality Reduction in Chinese Text Categorization(图)
Character Bigrams Dimensionality Reduction
2015/1/26
High dimensionality of feature space is a main obstacle for Text Categorization (TC). In a candidate feature set consisting of Chinese character bigrams, there exist a number of bigrams which are high...
A Study on Feature Weighting in Chinese Text Categorization(图)
Feature Weighting Chinese Text Categorization
2015/1/26
In Text Categorization (TC) based on Vector Space Model, feature weighting and feature selection are major problems and difficulties. This paper proposes two methods of weighting features by combining...
Chinese Text Categorization Based on the Binary Weighting Model with Non-Binary Smoothing(图)
Binary Weighting Non-binary Smoothing
2015/1/26
In Text Categorization (TC) based on the vector space model, feature weighting is vital for the categorization effectiveness. Various non-binary weighting schemes are widely used for this purpose. By ...