- 12
- 0
- 约3.21万字
- 约 10页
- 2017-11-27 发布于江苏
- 举报
汉语的语素概念提取和语义构词分析
文章编号:1003-0077 (2017)00-0000-00
汉语的语素概念提取与语义构词分析*
刘扬 1,2,林子 1,3,康司辰 1,3
(1.北京大学 计算语言学教育部重点实验室,北京 100871;
2.北京大学 计算语言学研究所,北京 100871;
3.北京大学中国语言文学系,北京 100871)
摘要:作为基础的表义单位,语素及此上的构词分析,既是汉语作为意合语言进行语义分析
的起点,也是认知、理解词义的关键。本文提出了一种探寻汉语语义基元和分析语义构词的
新的方法和视角:基于语素义相似度计算形成 “同义语素集”,用来表征 “语素概念”,并
借鉴生成词库理论和面向对象思想形成 “语素概念体系”;建立在这些基础上的汉语语义构
词分析,在全局性语义分析、数据挖掘等方面也有新的进展。这些思路、做法及语言资源建
设,有望推动人文领域和计算应用等相关工作的开展。
关键词:语素 ;语素义 ;语素概念 ;语义基元 ;语义构词
中图分类号:TP391 文献标识码:A
Towards a Description of
Chinese Morphemic Concepts and Semantic Word-Formation
Liu Yang 1,2,Lin Zi1,3,Kang Sichen1,3
(1. Key Laboratory of Computational Linguistics (Ministry of Education) ,Peking University, Beijing 100871, China;
2. Institute of Computational Linguistics, Peking University, Beijing 10087 1, China;
3. Department of Chinese Language and Literature, Peking University, Beijing 100871, China)
Abstract: Morphemes and their Word-Formation Analysis are both the starting point for the
semantic analysis of Chinese as the parataxis language, and also the key to understanding the
meaning of words. This paper presents a novel approach to exploring the Chinese Semantic
Primitives and using them for Semantic Word-Formation Analysis: first form the Synonymous
Morpheme Sets, used for denoting the Morphemic Concepts, based on similarity calculation of
Chinese morpheme glosses; then form the Morphemic Concept Hierarchy, serving as a systematic
description of the Chinese Semantic Primitives, by principles of the Generative Lexicon Theory
and the Object-Oriented Ideas; built on these, Chinese Semantic Word-Formation Analysis has
made new progress from overall consideration and data mining. Th
原创力文档

文档评论(0)