期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

张晨光李玉鑑《计算机工程与应用》2009,45(21):139-141

眉毛图像的分割,由于受到毛发、姿势及个体差异的影响,是一个非常困难的问题。提出了一种利用半监督学习技术进行彩色眉毛图像分割的方法,首先通过手工在眉毛图像上简单画上几条线标注部分眉毛点和非眉毛点,然后利用半监督学习技术完成眉毛图像分割并从中提取纯眉毛图像,最后通过实验说明该方法具有非常好的分割效果,可用于眉毛识别的前期预处理。相似文献

2.

基于双模型交互学习的半监督医学图像分割

下载免费PDF全文

方超伟李雪李钟毓焦李成张鼎文《自动化学报》2023,49(4):805-819

在医学图像中, 器官或病变区域的精准分割对疾病诊断等临床应用有着至关重要的作用, 然而分割模型的训练依赖于大量标注数据. 为减少对标注数据的需求, 本文主要研究针对医学图像分割的半监督学习任务. 现有半监督学习方法广泛采用平均教师模型, 其缺点在于, 基于指数移动平均(Exponential moving average, EMA)的参数更新方式使得老师模型累积学生模型的错误知识. 为避免上述问题, 提出一种双模型交互学习方法, 引入像素稳定性判断机制, 利用一个模型中预测结果更稳定的像素监督另一个模型的学习, 从而缓解了单个模型的错误经验的累积和传播. 提出的方法在心脏结构分割、肝脏肿瘤分割和脑肿瘤分割三个数据集中取得优于前沿半监督方法的结果. 在仅采用30%的标注比例时, 该方法在三个数据集上的戴斯相似指标(Dice similarity coefficient, DSC)分别达到89.13%, 94.15%, 87.02%. 相似文献

3.

哈希图半监督学习方法及其在图像分割中的应用 总被引：2，自引：0，他引：2

下载免费PDF全文

张晨光李玉鑑《自动化学报》2010,36(11):1527-1533

图半监督学习(Graph based semi-supervised learning, GSL)方法需要花费大量时间构造一个近邻图, 速度比较慢. 本文提出了一种哈希图半监督学习(Hash graph based semi-supervised learning, HGSL)方法, 该方法通过局部敏感的哈希函数进行近邻搜索, 可以有效降低图半监督学习方法所需的构图时间. 图像分割实验表明, 该方法一方面可以达到更好的分割效果, 使分割准确率提高0.47%左右; 另一方面可以大幅度减小分割时间, 以一幅大小为300像素×800像素的图像为例, 分割时间可减少为图半监督学习所需时间的28.5%左右. 相似文献

4.

基于自监督学习的病理图像层次分割

吴崇数林霖薛蕴菁时鹏《计算机应用》2020,40(6):1856-1862

在苏木精-伊红(HE)染色病理图像中,细胞染色分布的不均匀和各类组织形态的多样性给病理图像的自动分割带来极大挑战。为解决该问题,提出了一种基于自监督学习的病理图像三步层次分割方法,对病理图像中各类组织进行由粗略到精细的全自动逐层分割。首先,根据互信息的计算结果在RGB色彩空间中进行特征选择;其次,采用K-means聚类将图像初步分割为各类组织结构的色彩稳定区域与模糊区域;然后,以色彩稳定区域为训练集采用朴素贝叶斯分类对模糊区域进行进一步分割,得到完整的细胞核、细胞质和胞外间隙这三类组织结构;最后,对细胞核部分进行结合形状和色彩强度的混合分水岭分割得到细胞核间的精确边界,进而量化计算细胞核个数、核占比、核质比等指标。对脑膜瘤HE染色病理图像的分割实验结果表明,所提方法对于染色和细胞形态差异保持较高的鲁棒性,各类组织区域分割误差在5%以内,在细胞核分割精度的对比实验中平均正确率在96%以上,满足临床自动图像分析的要求,其量化结果可以为定量病理分析提供依据。相似文献

5.

基于符号规范化割的半监督学习与图像分割

刘先锋杨予丹石静陈明《小型微型计算机系统》2020,(7):1547-1552

半监督学习利用少量的辅助信息以提升学习器的性能.基于图的学习方法是较为典型的半监督学习实现途径,利用图来表达和分析数据,能够处理复杂的数据分布.不同于这类方法通常所利用的无符号图,符号网络具有更强的表达能力,其负边能够表达额外的信息.本文基于符号网络的规范化割(Signed Normalized Cut,SNCut),提出了可处理成对约束的半监督聚类,通过实验验证了负边给半监督学习带来的附加价值.将SNCut应用于图像分割问题,获得的分割效果明显优于规范化割.为了进一步强化边界对齐性,引入马尔科夫随机场(Markov Random Field,MRF)正则化项,构建SNCut&MRF目标函数,并提出基于界优化和图割的求解算法.结果表明,SNCut&MRF相比一些典型分割方法有更好的分割性能,在边界处表现良好. 相似文献

6.

基于半监督学习和条件概率的膝关节软骨MRI图像分割

下载免费PDF全文

马春帅程远志《计算机系统应用》2025,34(1):100-109

本文提出了一种基于半监督学习和条件概率的膝关节软骨分割方法, 旨在解决医学图像分割中标注样本数量不足和质量差的问题. 现有的标签树嵌入深度学习模型难以对网络输出间的层次关系进行有效建模, 而本文提出了一种条件到无条件的混合训练与任务级一致性结合的方法, 有效地利用了标签之间的层次关系和相似性, 提高了分割精度. 具体来说, 本文使用一个联合预测像素级分割图和目标的几何感知水平集表示的双任务深度网络. 通过可微分的任务变换层, 将水平集表示转换为近似的分割映射. 同时, 本文在标签和未标记数据上引入了基于水平线的分割映射与直接预测的分割映射之间的任务级一致性正则化. 在两个公共数据集上的大量实验表明, 本文的方法可以通过包含未标记的数据来显著提高性能. 相似文献

7.

基于朴素贝叶斯的半监督学习遥感影像分类 总被引：1，自引：0，他引：1

杨伟方涛许刚《计算机工程》2010,36(20):167-169

为提高分类器识别率,减少标注样本使用数量,提出一种基于朴素贝叶斯的半监督学习方法。研究基于该方法的分类器分类效果,采用遥感影像数据作为训练和测试集,与基于朴素贝叶斯的全监督学习分类器分类效果作比较。实验结果表明,当标注样本与非标注样本比例在1:2~1:9时,半监督学习可以利用比全监督学习更少的标注样本,达到更高的分类精度。相似文献

8.

耦合先验拉普拉斯坐标的半监督图像分割算法

曹昀炀王涛《计算机应用》2019,39(9):2695-2700

针对传统半监督图像分割方法难以精确分割分散或细小区域的缺陷,提出了一种耦合标签先验和拉普拉斯坐标模型的半监督图像分割算法。首先,扩展拉普拉斯坐标（LC）模型,通过引入标签先验项进一步精确表征未标记像素点与已标记像素点之间的关系。然后,基于矩阵方程的求导优化,有效估计像素属于标签的后验概率,以实现图像目标分割的任务。得益于标签先验的引入,所提算法对分散或细小区域的分割更加鲁棒。最后,在多个公开的半监督分割数据集上实验结果表明,相比拉普拉斯坐标算法,所提算法的分割准确率获得了显著提升,验证了所提算法的有效性。相似文献

9.

基于新型间谍技术的半监督自训练正例无标记学习

李婷婷吕佳范伟亚《计算机应用》2019,39(10):2822-2828

正例无标记(PU)学习中的间谍技术极易受噪声和离群点干扰,导致划分的可靠正例不纯,且在初始正例中随机选择间谍样本的机制极易造成划分可靠负例时效率低下,针对这些问题提出一种结合新型间谍技术和半监督自训练的PU学习框架。首先,该框架对初始有标记样本进行聚类并选取离聚类中心较近的样本来取代间谍样本,这些样本能有效地映射出无标记样本的分布结构,从而更好地辅助选取可靠负例;然后对间谍技术划分后的可靠正例进行自训练提纯,采用二次训练的方式取回被误分为正例样本的可靠负例。该框架有效地解决了传统间谍技术在PU学习中分类效率易受数据分布干扰以及随机间谍样本影响的问题。通过9个标准数据集上的仿真实验结果表明,所提框架的平均分类准确率和F-值均高于基本PU学习算法(Basic_PU)、基于间谍技术的PU学习算法(SPY)、基于朴素贝叶斯的自训练PU学习算法(NBST)和基于迭代剪枝的PU学习算法(Pruning)。相似文献

10.

关联词约束的半监督文本分类方法

韩红旗朱东华刘嵩汪雪锋《计算机工程与应用》2010,46(4):113-116

提出了一种没有训练集情况下实现对未标注类别文本文档进行分类的问题。类关联词是与类主体相关、能反映类主体的单词或短语。利用类关联词提供的先验信息,形成文档分类的先验概率,然后组合利用朴素贝叶斯分类器和EM迭代算法,在半监督学习过程中加入分类约束条件,用类关联词来监督构造一个分类器,实现了对完全未标注类别文档的分类。实验结果证明,此方法能够以较高的准确率实现没有训练集情况下的文本分类问题,在类关联词约束下的分类准确率要高于没有约束情况下的分类准确率。相似文献

11.

Learning naive Bayes classifiers from positive and unlabelled examples with uncertainty

Jiazhen He Xue Li Peng Shi 《International journal of systems science》2013,44(10):1805-1825

Traditional classification algorithms require a large number of labelled examples from all the predefined classes, which is generally difficult and time-consuming to obtain. Furthermore, data uncertainty is prevalent in many real-world applications, such as sensor network, market analysis and medical diagnosis. In this article, we explore the issue of classification on uncertain data when only positive and unlabelled examples are available. We propose an algorithm to build naive Bayes classifier from positive and unlabelled examples with uncertainty. However, the algorithm requires the prior probability of positive class, and it is generally difficult for the user to provide this parameter in practice. Two approaches are proposed to avoid this user-specified parameter. One approach is to use a validation set to search for an appropriate value for this parameter, and the other is to estimate it directly. Our extensive experiments show that the two approaches can basically achieve satisfactory classification performance on uncertain data. In addition, our algorithm exploiting uncertainty in the dataset can potentially achieve better classification performance comparing to traditional naive Bayes which ignores uncertainty when handling uncertain data. 相似文献

12.

Learning Bayesian classifiers from positive and unlabeled examples

Borja Calvo Pedro Larraaga Jos A. Lozano 《Pattern recognition letters》2007,28(16):2375-2384

The positive unlabeled learning term refers to the binary classification problem in the absence of negative examples. When only positive and unlabeled instances are available, semi-supervised classification algorithms cannot be directly applied, and thus new algorithms are required. One of these positive unlabeled learning algorithms is the positive naive Bayes (PNB), which is an adaptation of the naive Bayes induction algorithm that does not require negative instances. In this work we propose two ways of enhancing this algorithm. On one hand, we have taken the concept behind PNB one step further, proposing a procedure to build more complex Bayesian classifiers in the absence of negative instances. We present a new algorithm (named positive tree augmented naive Bayes, PTAN) to obtain tree augmented naive Bayes models in the positive unlabeled domain. On the other hand, we propose a new Bayesian approach to deal with the a priori probability of the positive class that models the uncertainty over this parameter by means of a Beta distribution. This approach is applied to both PNB and PTAN, resulting in two new algorithms. The four algorithms are empirically compared in positive unlabeled learning problems based on real and synthetic databases. The results obtained in these comparisons suggest that, when the predicting variables are not conditionally independent given the class, the extension of PNB to more complex networks increases the classification performance. They also show that our Bayesian approach to the a priori probability of the positive class can improve the results obtained by PNB and PTAN. 相似文献

13.

Effective spam filtering: A single-class learning and ensemble approach

Chih-Ping Hsueh-Ching Tsang-Hsiang 《Decision Support Systems》2008,45(3):491

The annoyance of spam emails increasingly plagues both individuals and organizations. In response, most of prior research investigates spam filtering as a classical text categorization task, in which training examples must include both spam (positive examples) and legitimate (negative examples) emails. However, in many spam filtering scenarios, obtaining legitimate emails for training purpose can be more difficult than collecting spam and unclassified emails. Hence, it is more appropriate to construct a classification model for spam filtering that uses positive training examples (i.e., spam) and unlabeled instances only and does not require legitimate emails as negative training examples. Several single-class learning techniques, such as PNB and PEBL, have been proposed in the literature. However, they incur inherent limitations with regard to spam filtering. In this study, we propose and develop an ensemble approach, referred to as E2, to address these limitations. Specifically, we follow the two-stage framework of PEBL but extend each stage with an ensemble strategy. The empirical evaluation results from two spam filtering corpora suggest that our proposed E2 technique generally outperforms benchmark techniques (i.e., PNB and PEBL) and exhibits more stable performance than its counterparts. 相似文献

14.

基于分歧的半监督学习 总被引：9，自引：0，他引：9

下载免费PDF全文

周志华《自动化学报》2013,39(11):1871-1878

传统监督学习通常需使用大量有标记的数据样本作为训练例,而在很多现实问题中,人们虽能容易地获得大批数据样本,但为数据提供标记却需耗费很多人力物力.那么,在仅有少量有标记数据时,可否通过对大量未标记数据进行利用来提升学习性能呢？为此,半监督学习成为近十多年来机器学习的一大研究热点.基于分歧的半监督学习是该领域的主流范型之一,它通过使用多个学习器来对未标记数据进行利用, 而学习器间的\"分歧\"对学习成效至关重要.本文将综述简介这方面的一些研究进展. 相似文献

15.

Word Sense Disambiguation by Learning Decision Trees from Unlabeled Data 总被引：1，自引：0，他引：1

Seong-Bae Park Byoung-Tak Zhang Yung Taek Kim 《Applied Intelligence》2003,19(1-2):27-38

In this paper we describe a machine learning approach to word sense disambiguation that uses unlabeled data. Our method is based on selective sampling with committees of decision trees. The committee members are trained on a small set of labeled examples which are then augmented by a large number of unlabeled examples. Using unlabeled examples is important because obtaining labeled data is expensive and time-consuming while it is easy and inexpensive to collect a large number of unlabeled examples. The idea behind this approach is that the labels of unlabeled examples can be estimated by using committees. Using additional unlabeled examples, therefore, improves the performance of word sense disambiguation and minimizes the cost of manual labeling. Effectiveness of this approach was examined on a raw corpus of one million words. Using unlabeled data, we achieved an accuracy improvement up to 20.2%. 相似文献

16.

疾病-病症和病症-治疗物质的关系抽取研究

冯钦林杨志豪林鸿飞《计算机工程与应用》2017,53(10):251-257

随着生物医学文献的快速增长,在海量的生物医学文献中存在大量有关疾病、病症和治疗物质的信息,这些信息对疾病的治疗和药物的研制有着重要的意义。针对疾病与治疗物质之间的信息抽取,重点训练两个模型,即疾病与病症模型和病症与治疗物质模型。疾病与病症模型判断一种疾病是否会存在或者导致一种生理现象的产生;病症与治疗物质模型判断一种物质是否改变人的生理现象或者生理过程。使用半监督学习的Tri-training的方法,利用大量未标注数据辅助少量有标注数据进行训练提高分类性能。实验结果表明,Tri-training方法中利用未标注数据有助于提高实验结果;且在训练过程中使用集成学习的思想将三个分类器器集成在一起,提高了学习性能。相似文献

17.

基于朴素Bayes组合的简易集成分类器

下载免费PDF全文

宋丛威《计算机系统应用》2021,30(2):265-267

朴素Bayes分类器是一种简单有效的机器学习工具.本文用朴素Bayes分类器的原理推导出\"朴素Bayes组合\"公式,并构造相应的分类器.经过测试,该分类器有较好的分类性能和实用性,克服了朴素Bayes分类器精确度差的缺点,并且比其他分类器更加快速而不会显著丧失精确度. 相似文献

18.

一种使用未标记样本聚类信息的自训练方法*

刘伟涛许信顺《计算机应用研究》2010,27(9):3341-3344

为了有效地利用结构信息,提出了一种新的自学习算法,算法中利用聚类方法从自标记样本中选择可信度高的样本,同时用一个数据编辑方法从这些可信度高的样本中剔除被错标的可能性较高的样本。算法在UCI数据上进行了验证,效果和收敛速度比对比算法要好,说明引入聚类选择候选样本是有效的。相似文献

19.

基于图半监督学习的医学图像检索

吴梦麟陈强孙权森《计算机辅助设计与图形学学报》2013,25(9):1354-1360

针对医学图像检索中底层特征不能完整地描述图像的高层语义的问题,提出一种基于图的半监督学习框架的医学图像检索算法.首先根据图像之间的距离关系构建图模型,并在标记传播过程中加入密度相似性约束,得到查询图像的类别归属度,即图像的视觉语义表示;然后提取图像分块SIFT特征,用词袋进行描述,以获取图像的局部特征;最后设计了结合视觉语义和局部特征的相似性度量准则.在ImageCLEFmed上的实验结果表明,该算法能够有效地表达图像的视觉语义,检索效率优于单一底层特征检索. 相似文献