After couples of days hard working, I can now introduce CONCISE: A Simple and Clear Concordance Software. Definitions from dictionary states that Concise means "giving a lot of information clearly and in a few words; brief but comprehensive." That is the way the software Concise works--giving only necessary and important information, such as concordance, collocates, word clusters, and words list.
Concise is a simple and clear concordance software on Mac OS X, works well with Chinese (中文) tokens. However, the Chinese Tokenizer was **NOT** included in this version. There are still too many problems to tokenize Chinese sentences. Concise does only concordance analysis, collocation analysis, word clusters, word list, and generate collocational network data so fat.
Features:
- Simple and Clear
- Working well with Chinese
- Supporting concordance analysis, collocation analysis, word clusters, and words list
- Generating Collocational Network data
Further works:
- output query results to text and Microsoft Excel format.
- keyword analysis (comparing with another corpus)
- Chinese Tokenizer (perhaps)
Concise is developed entirely by JAVA and SWT. Hopefully, Concise will be able to run on multiple platforms. Nonetheless, I am not going to release Concise right now. So, let's wait and see.
There is also a sample visualized collocational network (node word is 強震[strong earthquake]). This work is a description of Japan Earthquake on March 11th, 2011. Data were collected from news service at Yahoo! Taiwan among March 11th to April 11th, 2011. See Descriptions of Japan Earthquake and Tsunami News Reports (日本地震海嘯新聞事件的時間描述, in Chinese) and News Analysis of Japan Earthquake and Nuclear Crisis (日本地震與核輻射新聞報導, in Chinese) for details.
留言
張貼留言