首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.
软件容错是提高软件可靠性的重要手段,可重用软部件必须提供软件容错的措施,本文将Ada提供的异常处理机制作为容错的工具,分析了用这种工具实现多版本程序设计、恢复块技术和防卫式程序设计等容错技术的途径,介绍了在可重用部件中构造异常处理程序直接实现软件容错的方法,最后还讨论了异常处理机制在容错软件中的应用准则。  相似文献   

2.
基于模糊重用库的容错软件开发   总被引:1,自引:0,他引:1  
容错设计是改进软件可靠性的有效途径,然而由于设计多样性的实现很困难且开销大,导致该项技术目前仅应用于一些安全关键系统之中,为此,提出了一种将重用技术引入到容错软件开发过程中的应用框架,框架以模糊重用库为核心,增强的可靠性开发模型为基础,参数化可靠性指标为相异性设计的依据,简化了容错软件的开发过程,在方法和实践上改进了软件系统的可靠性,介绍了重用库结构,增强的可靠性开发方法,基于重用的相异性设计及其容错软件开发的支持。  相似文献   

3.
软件容错技术初探   总被引:3,自引:0,他引:3  
对软件容错中的时间容错和信息容错作了介绍,并探讨了软件容错技术的两种典型结构模型:恢复模块法和N重版本程序结构。对以上两种方法的可靠性进行了分析。  相似文献   

4.
韩振林 《测控技术》2002,21(12):64-66
主要介绍在工业企业网络测量中应用通过磁盘阵列,双机容错软件及代理监测程序实现系统的容错技术,提高了系统的可靠及稳定性。  相似文献   

5.
随着计算机系统复杂度的不断增加,软件失效造成的损失越来越普遍。如何降低系统损失,提高系统的可靠性成为人们面对的主要问题。软件容错技术目前是降低软件失效,提高系统可靠性的主要途径。本文根据软件容错技术的发展现状,在分析传统软件容错技术缺陷的基础上,介绍了一种新的主动软件容错技术--软件自愈,并对基于时间策略的自愈模型进行了详细的分析。  相似文献   

6.
软件容错的方法、模型与实现   总被引:1,自引:0,他引:1  
随着计算机硬件技术的飞速发展,软件开发的低效率与不可靠性已成为阻碍计算技术继续发展的主要障碍.本文分析了计算机系统内部的主要故障来源,研究了软件故障的性质,指出软件容错的基础是程序设计差异,并以此为基础研究了软件容错的基本方法,构造了容错软件模型,介绍并评价了当前流行的软件容错技术,解决了以向后恢复为基础的软件容错技术在分布系统中的实现问题.  相似文献   

7.
计算机系统的容错技术方法   总被引:1,自引:0,他引:1  
随着计算机技术的发展,计算机系统的可靠性越来越受到人们的重视,而容错技术是提高可靠性的一种有效方法。本文研究了计算机容错技术的各种方法,如硬件容错、信息容错、软件容错等,介绍了TMR(三模冗余)的原理及其缺点,详细研究了两种最基本的软件容错技术NVP和RB。这些容错技术可有效提高计算机系统的可靠性。  相似文献   

8.
描述了平台概况,介绍了常用的冗余方法、软件容错技术、双机容错技术和集群技术等容错技术,并结合我区社区管理信息化创新平台建设,阐述了该平台采取的接入层虚拟化双机冗余、软件容错技术、数据库服务器双机容错技术与服务器虚拟化集群技术等多项容错技术.通过采用这些技术显著提高了该信息系统的可靠性,确保了社区管理与综合服务的质量.  相似文献   

9.
高可靠性是计算机系统所要解决和实现的重要目标,而容错技术是提高可靠性的有效途径。章论述了可靠性的基本概念和3种故障模式;介绍了计算机容错设计所采用的硬件冗余、信息冗余和时间冗余技术,着重分析了RAID技术和纠错码技术;指出了计算机容错设计的发展趋势及面临的挑战。  相似文献   

10.
工作站机群系统已成为分布式并行处理发展的主流方向之一,随着机群系统应用领域的逐渐拓展和规模的不断扩大,人们对其可靠性的要求日益提高,设计高可靠的群机系统,需要着重研究其系统容错技术,本文叙述了并行异构回卷恢复和检查点派生,实现透明的可移植容错和负载均衡能力,避免调整检查点就构成全局一致性状态,不仅使BSP应用程序自治容错能力,而且能够在机群(Clusters)间迁移,保持系统负载均衡,重点介绍了检查点设置,检查点派生、卷回、进程迁移技术。  相似文献   

11.
As multiprocessor systems become more complex, their reliability will need to increase as well. In this paper we propose a novel technique which is applicable to a wide variety of distributed real-time systems, especially those exhibiting data parallelism. System-level fault tolerance involves reliability techniques incorporated within the system hardware and software whereas application-level fault tolerance involves reliability techniques incorporated within the application software. We assert that, for high reliability, a combination of system-level fault tolerance and application-level fault tolerance works best. In many systems, application-level fault tolerance can be used to bridge the gap when system-level fault tolerance alone does not provide the required reliability. We exemplify this with the RTHT target tracking benchmark and the ABF beamforming benchmark.  相似文献   

12.
软件避错是提高软件可靠性的主要方法之一,它包含程序检验,测试,正确性证明等技术,然而,随  相似文献   

13.
With the scaling up of high-performance computing systems in recent years, their reliability has been descending continuously. Therefore, system resilience has been regarded as one of the critical challenges for large-scale HPC systems. Various techniques and systems have been proposed to ensure the correct execution and completion of parallel programs. This paper provides a comprehensive survey of existing software resilience approaches. Firstly, a classification of software resilience approaches is presented; then we introduce major approaches and techniques, including checkpointing, replication, soft error resilience, algorithm-based fault tolerance, fault detection and prediction. In addition, challenges exposed by system-scale and heterogeneous architecture are also discussed.  相似文献   

14.
本文考虑软件故障严重程度,并采用C&K面向对象度量集,以支持向量机分析方法为数学工具,建立一种基于面向对象软件易发性故障预测模型。实验结果表明,与基于朴素贝叶斯的预测模型、随机预测模型和NNge预测模型相比,本文提出的预测模型对于高严重程度故障、低严重程度故障以及未划分故障严重程度的情形均获得较好的预测效果。  相似文献   

15.
一种中间件服务容错配置管理方法   总被引:1,自引:0,他引:1  
李军国  黄罡  邹键  梅宏 《计算机学报》2007,30(10):1696-1704
提出一种基于运行时刻软件体系结构的容错管理方法,支持开发者和管理员针对不同中间件服务失效定制合适的故障检测和修复机制.首先,运行时刻软件体系结构自动构造构件依赖视图和错误传播①视图,为理解和分析整个系统的可靠性提供全局视图;然后,操作运行时刻软件体系结构配置容错机制;最后利用AOP技术将容错机制插装到中间件中,使其具备指定的容错能力.上述过程在一个可视化工具的辅助下半自动实施,并在J2EE中间件上得到验证.  相似文献   

16.
在高可信软件的设计和开发中,软件容错是提高系统可信性的一种实现技术之一. 容错性就是指软件在故障出现时保证提供服务的能力,对退化故障进行容错的一种处理方式就是依靠冗余技术. 本文在分析结构冗余及其对可信性的影响的基础上,在基于构件的可信软件系统中提出了对核心构件进行冗余的机制,包括单个构件的双模冗余结构、组合构件的双模冗余结构和构件的三取二冗余及其扩展结构,并给出了其故障检测和判断方法. 同时,在各种冗余结构的基础上对系统可靠性能进行分析.  相似文献   

17.
In this paper we present an approach to reliable distributed computing, which incorporates fault tolerance into applications at low cost, in terms of both run-time performance and programming effort required to construct reliable application software. In our model fault tolerance is based on distributed consistent checkpointing and rollback-recovery integrated with a user-level reliable transmission protocol. By employing novel techniques 8and algorithms, our approach is distinguished from other consistent checkpointing schemes by the following features: first, minimum communication overhead for constructing a consistent distributed checkpoint and catching messages in transit during checkpointing; second, tolerance to message losses due to site failures or unreliable non-FIFO networks; and third, efficient checkpointing and recovery of persistent state, i.e., user files. Based on the model, a software library prototype called Libra has been implemented for supporting fault tolerance in distributed message-passing applications with file operations. The library provides an easy to use programming interface including message-passing and file I/O primitives, which hides the complexity of both fault-tolerant network communications and checkpointing and recovering user files from the application level. Experience with a number of long-running distributed applications shows that Libra can provide fault tolerance in a cost-effective manner.  相似文献   

18.
软件双冗余容错系统的容错能力和性能分析   总被引:1,自引:0,他引:1  
双冗余是比较常用的冗余容错设计方法.软件双冗余容错系统通过冗余执行完成相同功能的两个软件副本,并检查它们的结果,根据两者结果是否一致来判断是否出现了错误.建立了软件双冗余容错系统的运行时模型,并引入了软件双冗余容错系统的容错能力的概念.根据该模型分析了单个软件副本的容错能力对软件双冗余容错系统的容错能力和性能的影响.分析结果显示,提高单个软件副本的容错能力不仅能够提高软件双冗余容错系统的容错能力,还能够提高系统的性能.但在极端情况下,双冗余容错系统的容错能力也可能会小于单个软件副本的容错能力.  相似文献   

19.
BackgroundSoftware fault prediction is the process of developing models that can be used by the software practitioners in the early phases of software development life cycle for detecting faulty constructs such as modules or classes. There are various machine learning techniques used in the past for predicting faults.MethodIn this study we perform a systematic review of studies from January 1991 to October 2013 in the literature that use the machine learning techniques for software fault prediction. We assess the performance capability of the machine learning techniques in existing research for software fault prediction. We also compare the performance of the machine learning techniques with the statistical techniques and other machine learning techniques. Further the strengths and weaknesses of machine learning techniques are summarized.ResultsIn this paper we have identified 64 primary studies and seven categories of the machine learning techniques. The results prove the prediction capability of the machine learning techniques for classifying module/class as fault prone or not fault prone. The models using the machine learning techniques for estimating software fault proneness outperform the traditional statistical models.ConclusionBased on the results obtained from the systematic review, we conclude that the machine learning techniques have the ability for predicting software fault proneness and can be used by software practitioners and researchers. However, the application of the machine learning techniques in software fault prediction is still limited and more number of studies should be carried out in order to obtain well formed and generalizable results. We provide future guidelines to practitioners and researchers based on the results obtained in this work.  相似文献   

20.
宇宙射线辐射所导致的瞬态故障一直是航天计算面临的最主要挑战之一.而随着集成电路制造工艺的持续进步,现代处理器的性能在大幅度提高的同时,其可信性也正日益面临着瞬态故障的严重威胁.当前针对瞬态故障的容错技术可大致分为两类:基于硬件实现和基于软件实现.相比较前者,后者由于在实现成本和灵活性等方面的优势而备受关注.本文首先概述...  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号