首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
基于DDMINER分布式数据库系统中频繁项目集的更新   总被引:13,自引:0,他引:13  
吉根林  杨明  赵斌  孙志挥 《计算机学报》2003,26(10):1387-1392
给出了一种分布式数据挖掘系统的体系结构DDMINER,对分布式数据库系统中频繁项目集的更新问题进行探讨,既考虑了数据库中事务增加的情况,又考虑了事务删除的情况;提出了一种基于DDMINER的局部频繁项目集的更新算法ULF和全局频繁项目集的更新算法UGF.该算法能够产生较少数量的候选频繁项目集,在求解全局频繁项目集过程中,传送候选局部频繁项目集支持数的通信量为O(n);将文章提出的算法用Java语言加以实现,并对算法性能进行了研究;实验结果表明这些算法是正确、可行的,并且具有较高的效率.  相似文献   

2.
Clustering network sites is a vital issue in parallel and distributed database systems DDBS. Grouping distributed database network sites into clusters is considered an efficient way to minimize the communication time required for query processing. However, clustering network sites is still an open research problem since its optimal solution is NP-complete. The main contribution in this field is to find a near optimal solution that groups distributed database network sites into disjoint clusters in order to minimize the communication time required for data allocation. Grouping a large number of network sites into a small number of clusters effectively increases the transaction response time, results in better data distribution, and improves the distributed database system performance. We present a novel algorithm for clustering distributed database network sites based on the communication time as database query processing is time dependent. Extensive experimental tests and simulations are conducted on this clustering algorithm. The experimental and simulation results show that a better network distribution is achieved with significant network servers load balance and network delay, a minor communication time between network sites is realized, and a higher distributed database system performance is recognized.  相似文献   

3.
Shared-nothing并行事务数据库系统中规则的挖掘与更新算法   总被引:1,自引:0,他引:1  
关联规则是数据挖掘中的一个重要研究内容.本文提出了Shared—nothing并行事务数据库系统(简称SNPDBS)中一种快速的关联规则挖掘算法SNPMAR,并考虑当最小支持度发生变化后SNPDBS中关联规则的高效更新问题,提出了一种有效的关联规则更新算法SNPIUA.  相似文献   

4.
One of the key challenges for structure from motion systems in order to make them robust to failure is the ability to handle outliers among the correspondences. In this paper we present two new algorithms that find the optimal solution in the presence of outliers when the camera undergoes a pure translation. The first algorithm has polynomial-time computational complexity, independently of the amount of outliers. The second algorithm does not offer such a theoretical complexity guarantee, but we demonstrate that it is magnitudes faster in practice. No random sampling approaches such as RANSAC are guaranteed to find an optimal solution, while our two methods do. We evaluate and compare the algorithms both on synthetic and real experiments. We also embed the algorithms in a larger system, where we optimize for the rotation angle as well (the rotation axis is measured by other means). The experiments show that for problems with a large amount of outliers, the RANSAC estimates may deteriorate compared to our optimal methods.  相似文献   

5.
The author analyzes and compares the performance of two timestamp ordering concurrency control algorithms, namely, the basic and multiversion, for database systems. The multiversion algorithm improves the performance of the basic timestamp ordering algorithm by keeping multiple versions of data objects. The author discusses the performance enhancement in the multiversion algorithm over the basic algorithm. The author also discusses the storage overhead due to multiple versions of data objects in the multiversion algorithm. The exact performance model of these algorithms is so complex that it is impossible to find a closed-form solution. The author reduces the complexity of the analysis by analyzing a single transaction in isolation and reflects the presence of other transactions on the isolated transaction by the probability of conflict/abort. The analysis provides useful insight into the performance of these algorithms  相似文献   

6.
为了实现有效的能量利用和数据传输,针对无线EH-MIMO协作系统,提出了在节点的多根天线中选择部分天线进行能量收集和部分天线进行数据传输的遍历最优算法。选择结果即是所提出的EH-MIMO协作模型信道容量最大化问题的遍历解决方案,因此在容量性能方面是最优的。为了降低算法的复杂度,进一步提出了递增天线选择和递减天线选择的两种次优天线选择算法,并与最优算法进行了对比。仿真结果表明,次优算法在系统信道容量和能量效率方面接近最优算法,同时具有较低的复杂度。  相似文献   

7.
Replicated databases that use quorum-consensus algorithms to perform majority voting are prone to deadlocks. Due to the P-out-of-Q nature of quorum requests, deadlocks that arise are generalized deadlocks and are hard to detect. We present an efficient distributed algorithm to detect generalized deadlocks in replicated databases. The algorithm performs reduction of a distributed wait-for-graph (WFG) to determine the existence of a deadlock. If sufficient information to decide the reducibility of a node is not available at that node, the algorithm attempts reduction later in a lazy manner. We prove the correctness of the algorithm. The algorithm has a message complexity of 2e messages and a worst-case time complexity of 2d+2 hops, where e is the number of edges and d is the diameter of the WFG. The algorithm is shown to perform significantly better in both time and message complexity than the best known existing algorithms. We conjecture that this is an optimal algorithm, in time and message complexity, to detect generalized deadlocks if no transaction has complete knowledge of the topology of the WFG or the system and the deadlock detection is to be carried out in a distributed manner  相似文献   

8.
稀疏重构算法中凸松弛法在恢复效率方面、贪婪追踪法在恢复精度方面存在不足,基于遗传算法迭代优化的思想,结合模拟退火以及多种群算法的优势,提出了基于模拟退火遗传算法和基于多种群遗传算法的启发式稀疏重构算法。所提算法均从传统遗传算法易陷入局部最优解的缺陷出发,分别通过保持个体间的差异性和提高种群多样性来搜索待求稀疏信号的全局最优解,并通过理论分析证明了所提算法参数选取及搜索策略的有效性。此外,以阵列信号处理中空间信源的波达方向(DOA)估计问题为例,验证所提算法的有效性。仿真结果表明,相较于正交匹配追踪OMP算法和基于l1范数奇异值分解的l1-SVD算法,所提算法提高了DOA估计的精度,且降低了运算复杂度,使其快速收敛至全局最优解。  相似文献   

9.
分布式数据库系统由于数据的分布和冗余使得分布式查询处理增加了许多新的内容和复杂性,因此分布式查询处理的优化显得尤为重要。本文简要介绍分布式查询优化的目标、策略,并针对分布式数据库系统的查询优化,讲述三个典型的算法:INGRES算法、SystemR*算法、SDD-1算法,并进行对比、优化、总结,最后对SDD-1算法进行改进。  相似文献   

10.
In this paper, we propose an ordinal optimization (OO) theory-based algorithm to solve the yet to be explored distributed state estimation with continuous and discrete variables problems (DSECDP) of large distributed power systems. The proposed algorithm copes with a huge amount of computational complexity problem in large distributed systems and obtains a satisfactory solution with high probability based on the OO theory. There are two contributions made in this paper. First, we have developed an OO theory-based algorithm for DSECDP in a deregulated environment. Second, the proposed algorithm is implemented in a distributed power system to select a good enough discrete variable solution. We have tested the proposed algorithm for numerous examples on the IEEE 118-bus and 244-bus with four subsystems using a 4-PC network and compared the results with other competing approaches: Genetic Algorithm, Tabu Search, Ant Colony System and Simulated Annealing methods. The test results demonstrate the validity, robustness and excellent computational efficiency of the proposed algorithm in obtaining a good enough feasible solution.  相似文献   

11.
Preventive measures sometimes fail to defect malicious attacks. With attacks on data-intensive applications becoming an ever more serious threat, intrusion tolerant database systems are a significant concern. The main objective of such systems is to detect attacks, and to assess and repair the damage in a timely manner. This paper focuses on efficient damage assessment and repair in distributed database systems. The complexity caused by data partition, distributed transaction processing, and failures makes intrusion recovery much more challenging than in centralized database systems. This paper identifies the key challenges and presents an efficient algorithm for distributed damage assessment and repair.  相似文献   

12.
传统分布式的网络架构制约路由算法的创新,软件定义网络的出现为路由算法的优化提供了新思路。已有研究中,启发式算法广泛应用于服务质量路由,但由于计算复杂度高而无法在大型网络中应用。而其他算法均存在不同程度的问题,要么复杂度较高,要么算法性能较差,如最短路径算法。基于 SDN 分级分域架构,提出了 LC-LD 路由算法,综合时延条件和代价度量约束并在计算复杂度和算法性能之间保持平衡。仿真分析表明,LC-LD路由算法在有较低的计算复杂度的同时还有较高的服务质量路由选路性能。  相似文献   

13.
A method for concurrency control in distributed database management systems that increases the level of concurrent execution of transactions, called ordering by serialization numbers (OSN), is proposed. The OSN method works in the certifier model and uses time-interval techniques in conjunction with short-term locks to provide serializability and prevent deadlocks. The scheduler is distributed, and the standard transaction execution policy is assumed, that is, the read and write operations are issued continuously during transaction execution. However, the write operations are copied into the database only when the transaction commits. The amount of concurrency provided by the OSN method is demonstrated by log classification. It is shown that the OSN method provides more concurrency than basic timestamp ordering and two-phase locking methods and handles successfully some logs which cannot be handled by any of the past methods. The complexity analysis of the algorithm indicates that the method works in a reasonable amount of time  相似文献   

14.
Incremental linear discriminant analysis for face recognition.   总被引:3,自引:0,他引:3  
Dimensionality reduction methods have been successfully employed for face recognition. Among the various dimensionality reduction algorithms, linear (Fisher) discriminant analysis (LDA) is one of the popular supervised dimensionality reduction methods, and many LDA-based face recognition algorithms/systems have been reported in the last decade. However, the LDA-based face recognition systems suffer from the scalability problem. To overcome this limitation, an incremental approach is a natural solution. The main difficulty in developing the incremental LDA (ILDA) is to handle the inverse of the within-class scatter matrix. In this paper, based on the generalized singular value decomposition LDA (LDA/GSVD), we develop a new ILDA algorithm called GSVD-ILDA. Different from the existing techniques in which the new projection matrix is found in a restricted subspace, the proposed GSVD-ILDA determines the projection matrix in full space. Extensive experiments are performed to compare the proposed GSVD-ILDA with the LDA/GSVD as well as the existing ILDA methods using the face recognition technology face database and the Carneggie Mellon University Pose, Illumination, and Expression face database. Experimental results show that the proposed GSVD-ILDA algorithm gives the same performance as the LDA/GSVD with much smaller computational complexity. The experimental results also show that the proposed GSVD-ILDA gives better classification performance than the other recently proposed ILDA algorithms.  相似文献   

15.
With the prevalence of Internet services and the increase of their complexity, there is a growing need to improve their operational reliability and availability. While a large amount of monitoring data can be collected from systems for fault analysis, it is hard to correlate this data effectively across distributed systems and observation time. In this paper, we analyze the mass characteristics of user requests and propose a novel approach to model and track transaction flow dynamics for fault detection in complex information systems. We measure the flow intensity at multiple checkpoints inside the system and apply system identification methods to model transaction flow dynamics between these measurements. With the learned analytical models, a model-based fault detection and isolation method is applied to track the flow dynamics in real time for fault detection. We also propose an algorithm to automatically search and validate the dynamic relationship between randomly selected monitoring points. Our algorithm enables systems to have self-cognition capability for system management. Our approach is tested in a real system with a list of injected faults. Experimental results demonstrate the effectiveness of our approach and algorithms  相似文献   

16.
Because distributed manufacturing technology is the foundation of modernized production and traditional heuristic methods exhibit problems of high complexity and low efficiency, this paper designs a scheduling algorithm based on the singular value decomposition heuristic (SVDH) method. The algorithm uses the device distribution and the transportation relationship between devices in a distributed manufacturing system. The algorithm takes the sequence relationship between tasks and the distance between devices as the implicit relationship between the task and the device. The algorithm makes use of the implicit relationship to amend the processing time matrix of the task and corrects the processing time matrix that contains the transportation relationship. Singular value decomposition principal component analysis is performed on the corrected processing time to find the most suitable processing device for each process, and an initial solution matrix is established. The heuristic solution is used to optimize the initial solution to find the optimal scheduling result based on the initial solution matrix. The establishment of the initial solution can effectively reduce the computational complexity of the heuristic solution, realize a parallelizing solution, and improve the efficiency of the heuristic solutions. In addition, the SVDH scheduling result has a lower transfer time between devices due to the consideration of the topology of tasks and devices, that is, the transit time. In this paper, the experiments are conducted on the heuristic performance, scheduling results, and transportation time. The experimental results show the advantages of SVDH over general heuristic algorithms in terms of efficiency and transit time.  相似文献   

17.
A Distribution Design Methodology for Object DBMS   总被引:1,自引:0,他引:1  
The design of distributed databases involves making decisions on the fragmentation and placement of data and programs across the sites of a computer network. The first phase of the distribution design in a top-down approach is the fragmentation phase, which clusters in fragments the information accessed simultaneously by applications. Most distribution design algorithms propose a horizontal or vertical class fragmentation. However, the user has no assistance in the choice between these techniques. In this work we present a detailed methodology for the design of distributed object databases that includes: (i) an analysis phase, to indicate the most adequate fragmentation technique to be applied in each class of the database schema; (ii) a horizontal class fragmentation algorithm, and (iii) a vertical class fragmentation algorithm. Basically, the analysis phase is responsible for driving the choice between the horizontal and the vertical partitioning techniques, or even the combination of both, in order to assist distribution designers in the fragmentation phase of object databases. Experiments using our methodology have resulted in fragmentation schemas offering a high degree of parallelism together with an important reduction of irrelevant data.  相似文献   

18.
Distributed systems generate a large amount of monitoring data such as log files to track their operational status. However, it is hard to correlate such monitoring data effectively across distributed systems and along observation time for system management. In previous work, we proposed a concept named flow intensity to measure the intensity with which internal monitoring data reacts to the volume of user requests. We calculated flow intensity measurements from monitoring data and proposed an algorithm to automatically search constant relationships between flow intensities measured at various points across distributed systems. If such relationships hold all the time, we regard them as invariants of the underlying systems. Invariants can be used to characterize complex systems and support various system management tasks. However, the computational complexity of the previous invariant search algorithm is high so that it may not scale well in large systems with thousands of measurements. In this paper, we propose two efficient but approximate algorithms for inferring invariants in large-scale systems. The computational complexity of new randomized algorithms is significantly reduced, and experimental results from a real system are also included to demonstrate the accuracy and efficiency of our new algorithms.  相似文献   

19.
《Pattern recognition letters》2001,22(6-7):799-811
Feature selection is used to improve the efficiency of learning algorithms by finding an optimal subset of features. However, most feature selection techniques can handle only certain types of data. Additional limitations of existing methods include intensive computational requirements and inability to identify redundant variables. In this paper, we present a novel, information-theoretic algorithm for feature selection, which finds an optimal set of attributes by removing both irrelevant and redundant features. The algorithm has a polynomial computational complexity and is applicable to datasets of a mixed nature. The method performance is evaluated on several benchmark datasets by using a standard classifier (C4.5).  相似文献   

20.
基于比例公平的多用户MIMO-OFDM系统自适应资源分配算法*   总被引:2,自引:1,他引:1  
针对传统多用户MIMO-OFDM系统中自适应资源分配算法计算复杂度较高、实时性不强、无法保证用户间公平性等问题,提出了一种低复杂度的自适应子载波、比特及功率分配算法。在子载波分配上,该算法能够在兼顾比例速率约束的前提下使系统发射功率达到最小化;在比特及功率分配上,该算法将非线性优化问题转换为线性优化问题,在保证系统性能的同时显著降低计算量。仿真结果表明,该算法具有良好的性能,能够有效降低计算量,并使系统容量在用户间分配得更加公平和合理。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号