Mining pinyin-to-character conversion rules from large-scale corpus : a rough set approach
This paper introduces a rough set technique for solving the problem of mining Pinyin-to-character (PTC) conversion rules. It first presents a text-structuring method by constructing a language information table from a corpus for each pinyin, which it will then apply to a free-form textual corpus. Da...
Veröffentlicht in: | IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society. - 1997. - 34(2004), 2 vom: 24. Apr., Seite 834-44 |
---|---|
1. Verfasser: | |
Weitere Verfasser: | , |
Format: | Aufsatz |
Sprache: | English |
Veröffentlicht: |
2004
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society |
Schlagworte: | Journal Article |