首页 | 本学科首页   官方微博 | 高级检索  
     

基于向量空间模型和专利文献特征的相似专利确定方法
引用本文:陈芨熙,顾新建,陈国海,魏江. 基于向量空间模型和专利文献特征的相似专利确定方法[J]. 浙江大学学报(工学版), 2009, 43(10): 1848-1852. DOI: 10.3785/j.issn.1008-973X.2009.10.018
作者姓名:陈芨熙  顾新建  陈国海  魏江
作者单位:(1.浙江大学 现代制造工程研究所,浙江省先进制造技术重点研究实验室,浙江 杭州 310027;2.浙江大学 管理学院,浙江 杭州 310058)
基金项目:国家“十一五”科技支撑计划资助项目(2006BAF01A02),国家“863”高技术研究发展计划资助项目(2007AA04Z101).
摘    要:为了确定专利文献的相似性,帮助企业进行专利申请、保护和利用,提出基于向量空间模型(VSM)和专利文献特征的相似专利确定方法.依据专利文献的信息特征构建专利模型树,定义了专利模型树和专利模型树的节点.通过分析专利模型树的节点属性值,采用基于向量空间模型的文本分类技术,以专利名称和专利摘要的加权相似度作为专利文献分类的依据,对专利文献进行分类,然后在类内根据专利文献特征的相似性确定相似专利,并根据企业的实际应用需求,分析专利文献要素权重确定的几种方法.应用示例验证了该方法能够有效地进行专利分类和相似专利检索.

关 键 词:专利文献  专利检索  文本分类  向量空间模型

Method of discovering similar patents based on vector space model and characteristics of patent documents
CHEN Ji-xi,GU Xin-jian,CHEN Guo-hai,WEI Jiang. Method of discovering similar patents based on vector space model and characteristics of patent documents[J]. Journal of Zhejiang University(Engineering Science), 2009, 43(10): 1848-1852. DOI: 10.3785/j.issn.1008-973X.2009.10.018
Authors:CHEN Ji-xi  GU Xin-jian  CHEN Guo-hai  WEI Jiang
Affiliation:(1. Institute of Manufacturing Engineering, Zhejiang Province Key Laboratory of Advanced Manufacturing Technology, Zhejiang University, Hangzhou 310027,China; 2. School of Management, Zhejiang University, Hangzhou 310058, China)
Abstract:A method to discover the similarity of patent documents was proposed in order to help enterprises in patent application, protection and utilization. A patent model tree was built based on the characteristics of patent documents. The patent model tree and its nodes were defined. Through analyzing the nodes’ attribute values, patent documents were categorized by using the vector space model(VSM) based text categorization technology and the weighted similarities of patent name and patent abstract. According to the categorization, similar patents were discovered by the weighted similarities of patent characteristics in the same category. Several ways to identify the weight of patent characteristics were discussed according to the actual needs in enterprise application. A case study showed that the method can be used in patent categorization and similar patent search.
Keywords:patent documents  patent retrieve  text categorization  vector space model(VSM)
点击此处可从《浙江大学学报(工学版)》浏览原始摘要信息
点击此处可从《浙江大学学报(工学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号