期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

吴向军姜云飞凌应标《软件学报》2007,18(3):490-504

提出了谓词之间的一种相似关系,并用该相似关系得到可实现某谓词的动作集.利用该动作集中所有动作的公共前提谓词和公共效果谓词,提取出隐含在动作描述中的领域知识,并给出了描述领域知识的一种形式化方法.最后,对具体的规划问题,可利用领域知识判断出初始状态或目标状态中存在的矛盾.该领域知识的提取策略已应用于智能规划器StepByStep之中,所获取的领域知识对选择待实现的谓词提供了必要的理论依据. 相似文献

2.

自动获取派生谓词规划领域的通用规划

《计算机学报》2014,(8)

通用规划(解)是针对某个领域的像算法一样的规划解,通过对其的解释可以直接得出具体问题的规划解,而不需要调用任何规划系统.但是目前通用规划的提取只能在一些简单或者特殊的领域中进行,没有推广到复杂或者一般的规划领域.该文提出在包含派生谓词的规划领域自动获取通用规划的方法.与已有获取方法不同的是:首先,基于派生谓词规则,文中方法明确指出派生谓词目标与动作效果之间的依赖关系,用以完善通用规划中动作应用的目的;其次,在提取过程中借助角色来帮助识别规划解中的循环结构.实验结果表明,文中方法不仅容易在派生谓词规划领域中获取通用规划,而且还能够以较好的性能求解一类以派生谓词为主要目标的规划"难"题.该文是在派生谓词规划领域中提取通用规划的首创性工作. 相似文献

3.

特征谓词知识树分解策略的研究

吴向军边芮凌应标姜云飞《计算机研究与发展》2011,48(2)

搜索空间约减是智能规划研究中的重要内容之一.谓词知识树是一种特殊的树状结构,它表达了规划领域中实现同一谓词的所有动作.在规划求解过程中,这些动作的前提条件通常是不能同时得到满足的.因此,提出了知识树的分解原理以及基于特征前提的知识树分解策略,并给出了相应的分解算法.对任意一个规划领域,利用该分解算法可将知识树分解成若干个较小规模的知识子树,使其对具体规划状态具有更强的针对性.在规划求解过程中使用知识子树可以避免一些不必要的动作搜索,提高规划效率.实验结果表明分解算法是有效的. 相似文献

4.

智能规划器StepByStep的研究和开发 总被引：3，自引：0，他引：3

吴向军姜云飞凌应标《软件学报》2008,19(9):2243-2264

智能规划器是智能规划研究成果的重要表现形式,规划器的求解效率和规划质量是智能规划理论研究的直接反映.首先介绍智能规划器的一般结构和StepByStep规划器的总体结构,然后详细阐述StepByStep规划器各组成部分所采用的方法和策略,定义谓词知识树来提取领域知识.在谓词知识树的基础上定义谓词规划树,并用各种策略来提高规划树的生成效率.在谓词规划树的基础上设计StepByStep的规划策略,最后用8个规划器对3个具有代表性的基准规划领域及其规划问题进行实际的求解实验,分析了StepByStep规划器在求解效率和规划质量上的具体表现.实验数据表明,StepByStep规划器的规划策略对3个不同规划领域都具有很好的指导作用,验证了领域知识在规划求解过程中的实际价值. 相似文献

5.

GP——基于规划图的遗传规划算法 总被引：5，自引：2，他引：5

陈蔼祥姜云飞张学农刘国英《计算机学报》2007,30(1):153-160

图规划是智能规划领域近年来出现的一种新的规划方法,对智能规划的发展有着重要的影响.图规划的规划产生过程分为两个主要步骤,首先用动作的前提条件和效果产生一个谓词和动作交错出现的图--规划图,然后在规划图中抽取规划解.而第二步往往更为困难和耗时.文章依据遗传算法对规划图提出一种新的解抽取方法,以一种简明、直观的形式给出染色体的编码方式,并在此基础上定义了各种遗传操作算子,将遗传算法引入图规划算法,充分利用遗传算法的并行全局搜索能力实现规划解的搜索.实验表明,在求解大规模的规划问题时,文中的遗传规划算法在求解速度和找到的规划解的质量两方面均显示出优越性. 相似文献

6.

智能规划中的动作互斥编码方式研究

下载免费PDF全文

杨超吕帅刘磊魏唯张波吴俊《计算机工程》2011,37(9):213-215

以规划领域中的动作为对象,研究规划方法中的动作互斥编码方式。介绍基于规划图的动作互斥编码、利用提取领域相关信息生成动作效果的直接阻碍与间接阻碍编码,以及依赖于域转移图动作间的长距离互斥编码,说明每类动作互斥编码的构造方法及其削减搜索空间、提高求解效率的作用。相似文献

7.

在部分观测环境下学习规划领域的派生谓词规则

饶东宁蒋志华姜云飞邓玉辉《计算机学报》2015,(7)

文中提出了一种在部分观测环境下学习规划领域的派生谓词规则的方法。在规划领域描述语言（PDDL）中,派生谓词用来描述动作的非直接效果,是规划领域模型和搜索控制知识的重要组成部分。然而,对于大多数规划领域而言,从无到有地构造派生谓词规则是不容易的。因此,研究自动获取派生谓词的推导规则是有意义的。已有研究工作提出通过修订一个初始的不完备的领域理论来获取推导规则的方法,但是它们的主要缺点在于待学习谓词的训练例的数量非常少,这是因为训练例按照非常有限的方式来生成。而更本质的原因在于它们假设环境是不可观测的。其实,在现实生活中很多动作的非直接效果是可以观测的,或者通过简单的目测或者通过专门的工具。因此文中提出增加观测来反映动作的非直接效果,以便增加待学习谓词的训练例数目从而改善学习的精准度。此外,为了补充一些在归纳学习过程中学习不到的谓词,文中还提出了一个后处理方法来使得学习到的规则在语义上更完整。通过在派生谓词基准领域上的实验表明,文中所提出的方法是可行有效的。更深远的意义在于,文中的研究工作有利于规划领域的自动建模或者控制知识的自动获取的研究与实现。相似文献

8.

基于Graphplan的ARBAC策略安全分析方法 总被引：3，自引：0，他引：3

刘强姜云飞饶东宁《计算机学报》2009,32(5)

策略安全分析是访问控制系统保持安全状态的重要机制.针对具有角色继承层次和角色静态互斥特征的分布式访问控制系统,文中采用智能规划技术进行策略安全分析.首先,提出了策略安全分析问题向规划问题转换的整体思路,定义"虚动作"模型以描述角色继承关系,使用领域互斥表述静态互斥角色,引入领域公理处理ARBAC策略的开放世界假设问题和前提条件中的负谓词问题.其后,运用图规划(Graphplan)算法求解转换而来的规划问题,重点分析了领域公理对规划图中部分NooP动作的剪枝作用,提出了领域公理在规划图扩展阶段的应用方式以及据此改进的图规划算法,介绍了已开发的面向ARBAC策略安全分析实验型规划系统.最后,进行了应用示例说明. 相似文献

9.

智能规划中基于遗传算法的动作模型学习 总被引：4，自引：0，他引：4

赖志锋姜云飞《计算机学报》2007,30(6):945-953

在动作间的状态未知条件下,利用遗传算法,从不完整的领域描述和规划实例中学习动作模型,并且设计了AMLS-GA(Action Model Learning System Based on Genetic Algorithm)系统来具体实现这一思想.作者为每一个动作构建一个可能谓词集,这个谓词集覆盖了动作前提表、增加表和删除表中的所有谓词.采用二进制编码的方式,把动作模型编码成GA搜索空间中的一个假设,学习过程是在标准的遗传算法框架下进行的.把学习结果的正确性定义为尽可能多的解释规划实例,并且通过实验的方法对比学习到的模型与专家预定义模型之间的差别.实验结果表明,算法能在较短的时间内,学习到一个逼近专家描述的动作模型. 相似文献

10.

模型检测规划中的状态之间的可达关系研究 总被引：1，自引：0，他引：1

文中华黄巍刘任任姜云飞《计算机学报》2012,35(8):1634-1643

当前,对基于模型检测规划研究的算法中存在大量的冗余计算,一些不可能参与构成解的状态动作序偶被反复筛选.文中给出了一种在不确定规划领域求规划解的新思路:在求规划解之前,找到不确定状态转移系统的状态之间的可达关系,从而根据状态之间的可达关系进行约简.提出了不确定状态转移系统的超图、超图的邻接矩阵和可达矩阵等概念,设计了用超图的邻接矩阵求不确定状态转移系统中状态之间可达关系的方法.利用不确定状态转移系统的超图、超图的邻接矩阵和状态之间的可达关系获得了关于弱规划解、强规划解和强循环规划解的一些重要性质.这些性质是关于一些状态动作序偶是否不可能参与构成弱规划解、强规划解和强循环规划解的结论.通过这些性质可以将大量的状态动作序偶直接去掉,从而大幅度简化求规划解的过程,提高求规划解效率. 相似文献

11.

部分观测环境下的时态动作模型学习方法

下载免费PDF全文

刘胜赵彤洲蔡敦波《计算机工程与应用》2018,54(2):29-39

已有的动作模型学习方法针对确定的或不确定的瞬时动作，而未考虑动作模型中的时态关系。提出了在部分观测环境下自动学习时态动作模型的方法。设计了学习动作持续时间表达式一般形式的两阶段线性回归方法。通过分析命题时间戳设计了动作前提、效果与动作之间时态关系算子的构建算法。在“国际智能规划竞赛”的规划问题集上进行了实验，结果表明了该方法的有效性。相似文献

12.

Discovering relational and numerical expressions from plan traces for learning action models

Segura-Muros José Á. Pérez Raúl Fernández-Olivares Juan 《Applied Intelligence》2021,51(11):7973-7989

In this paper, we propose a domain learning process build on a machine learning-based process that, starting from plan traces with (partially known) intermediate states, returns a planning domain with numeric predicates, and expressive logical/arithmetic relations between domain predicates written in the planning domain definition language (PDDL). The novelty of our approach is that it can discover relations with little information about the ontology of the target domain to be learned. This is achieved by applying a selection of preprocessing, regression, and classification techniques to infer information from the input plan traces. These techniques are used to prepare the planning data, discover relational/numeric expressions, or extract the preconditions and effects of the domain’s actions. Our solution was evaluated using several metrics from the literature, taking as experimental data plan traces obtained from several domains from the International Planning Competition. The experiments demonstrate that our proposal—even with high levels of incompleteness—correctly learns a wide variety of domains discovering relational/arithmetic expressions, showing F-Score values above 0.85 and obtaining valid domains in most of the experiments.

相似文献

13.

Synthesizing plans that contain actions with context-dependent effects1

Edwin P. D. Pednault 《Computational Intelligence》1988,4(3):356-372

This paper presents a method of solving planning problems that involve actions whose effects change according to the situations in which they are performed. The approach is an extension of the conventional planning methodology in which plans are constructed through an iterative process of scanning for goals that are not yet satisfied, inserting actions to achieve them, and introducing subgoals to these actions. This methodology was originally developed under the assumption that one would be dealing exclusively with actions that produce the same effects in every situation. The extension involves introducing additional subgoals to actions above and beyond the preconditions of execution normally introduced. These additional subgoals, called secondary preconditions, ensure that the actions are performed in contexts conducive to producing the effects we desire. This paper defines and analyzes secondary preconditions from a mathematically rigorous standpoint and demonstrates how they can be derived from regression operators. 相似文献

14.

在部分观测环境下的不确定动作模型学习

饶东宁蒋志华姜云飞《软件学报》2014,25(1):51-63

近年来,动作模型学习引起了研究人员的极大兴趣.可是,尽管不确定规划已经研究了十几年,动作模型学习的研究仍然集中于经典的确定性动作模型上.提出了在部分观测环境下学习不确定动作模型的算法,该算法可应用于假定人们对转移系统一无所知的情形下进行,输入只有动作-观测序列.在现实世界中,这样的场景很常见.致力于动作是由简单逻辑结构组成的、且观测以一定频率出现的一类问题的研究.学习过程分为3个步骤：首先,计算命题在状态中成立的概率;然后,将命题抽取成效果模式,再抽取前提;最后,对效果模式进行聚类以去除冗余.在基准领域上进行的实验结果表明,动作模型学习技术可推广到不确定的部分观测环境中. 相似文献

15.

Normative Communication Models for Agent

Renée Elio Anita Petrinjak 《Autonomous Agents and Multi-Agent Systems》2005,11(3):273-305

An agent message is an attempted action upon the information state of the receiver that, if successful, would cause the receiver to move to a new information state. A model of normative communication can define when messages are not merely unsuccessful but instead are illegal or impossible actions upon the receiver’s internal state. The model uses the preconditions of the other core message types, coupled with a model of task interdependencies, agent roles, and belief-desire-intention elements, to define the preconditions for sending a canonical not-understood error message. By defining the space of messages that are legal actions on an agent’s internal state, a normative communication model also defines a set of ‘reasons’ that can accompany the error message. A not-understood error message signals a mismatch between agent interaction models and the accompanying reason opens the possibility for agents to realign their respective models. The paper discusses the matters arising from this possibility. This approach assumes that normative communication behavior reflects normative domain behavior. It also assumes that each agent accesses the normative model, in contrast with more centralized frameworks for defining normative interaction among agents and identifying interaction errors. 相似文献

16.

Updating action domain descriptions

Eiter T Erdem E Fink M Senko J 《Artificial Intelligence》2010,174(15):1172-1221

相似文献

17.

一种路径概率感知的无线传感器网络目标跟踪协议

张力王国军彭勇《计算技术与自动化》2009,28(2):107-110

目标跟踪是无线传感器网络的一种重要应用。目标跟踪协议可分为使用预测和不使用预测两种。许多使用预测的协议假设被跟踪对象保持当前的速度和方向继续运动,但是,在目标沿着一定路径运动时,当处于路径拐点或是分叉点,遵循这种假设的算法很有可能失败。因此,提出一种路径概率感知的目标跟踪无线传感器网络协议。利用路径的拐点和分叉点的目标方向概率信息来预测目标运动新的方向,并且根据节点相对于路径以及邻居的位置来分配感应、通信和运算的能量。提出与路径走向概率相适应的调度方式。概率分析表明,本协议提供更好的目标跟踪效果并且节省能量。相似文献

18.

Improving concurrency in temporal plans

Amol Dattatraya Mali Michael Osowski 《Artificial Intelligence Review》2011,35(3):191-209

There is an increasing interest in solving temporal planning problems. Identification and propagation of mutual exclusion relations between actions can significantly enhance the efficiency of a planner. Current definitions of mutually exclusive actions severely restrict their concurrency. In this paper, we report on thirteen groups of permanently mutually exclusive PDDL 2.1, Level 3 actions. We report on sixteen types of potentially-conflicting interactions between two actions where concurrency may be maximized by adjusting starting time of one of the two actions. We discuss several examples where actions can overlap despite conflicting preconditions and/or effects. The processes executing these actions are mostly independent. We report on a new domain-rewriting technique called “baiting” in order to improve the concurrency in temporal plans. Baiting actions lure a temporal planner into improving concurrency. The technique involves splitting user-identified operators. We report on three types of baiting (standard, double and nested) and show their suitability for various types of action interactions. Baiting requires minimal modification to the planning code. Baiting does not increase the branching in search trees. Baiting does not affect the soundness and completeness of a temporal planner. Our empirical evaluation shows that the makespans of plans generated by efficient planner Sapa with baited domain are significantly lower than makespans of plans generated without baiting. 相似文献