首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Mining sequential patterns to find ordered events or subsequence patterns is essential in many applications, such as analysis of consumer shopping data, web clickstreams, and biological sequences. Traditional patterns reveal which items are frequently purchased together and in what order. However, information about the time intervals between purchases is missing. Therefore, Yang proposed using multi-time-interval sequential patterns to consider the time intervals between each pair of items in a pattern. For example, 〈Bread, ti1, Milk, (ti2ti1), Jam〉 means that Bread is bought before Milk within an interval of ti1, and Jam is bought after Bread and Milk within intervals of ti2 and ti1, respectively, where ti1 and ti2 are predefined time intervals. Although this new type of pattern considers the intervals between all pairs of items, it contains a sharp boundary problem; that is, when the time interval between two purchases is near the boundary of two predetermined time ranges, we either ignore or overemphasize it. In this study, we applied the concept of fuzzy sets to solve the sharp boundary problem. The discovered patterns, called fuzzy multi-time-interval sequential patterns, describe time intervals in linguistic terms for better understanding. Two algorithms, FuzzMI-Apriori and FuzzMI-PrefixSpan, were developed for mining fuzzy multi-time-interval patterns. Experiments using synthetic and real datasets showed the algorithms’ computational efficiency, scalability, and effectiveness.  相似文献   

2.
Mining non-redundant time-gap sequential patterns   总被引:1,自引:1,他引:0  
Mining sequential patterns is to discover sequential purchasing behaviors for most of the customers from a large amount of customer transactions. An example of such a pattern is that most of the customers purchased item B after purchasing item A, and then they purchased item C after using item B. The manager can use this information to promote item B and item C when a customer purchased item A and item B, respectively. However, the manager cannot know what time the customers will need these products if we only discover the sequential patterns without any extra information. In this paper, we develop a new algorithm to discover not only the sequential patterns but also the time interval between any two items in the pattern. We call this information the time-gap sequential patterns. An example of time-gap sequential pattern is that most of the customers purchased item A, and then they bought item B after m to n days, and then after p to q days, they bought item C. When a customer bought item A, the information about item B can be sent to this customer after m to n days, that is, we can provide the product information in which the customer is interested on the appropriate date.  相似文献   

3.
闫伟  张浩  陆剑峰 《计算机应用》2005,25(7):1584-1586
采用数据挖掘中的时间序列模式对流程企业中的运行数据进行分析,首先采用模糊理论对实际数据进行处理,找出偏离常规运行状态但未到报警界限的参数点,然后采用时间窗对参数离散处理,划分时间间隔得到时间序列数据库。采用TimeSeq_PrefixSpan算法并编程实现,得到了按次序排列且有时间间隔的异常参数点对设备故障影响的规则,起到了对设备故障预警监控的作用。  相似文献   

4.
Many researchers in database and machine learning fields are primarily interested in data mining because it offers opportunities to discover useful information and important relevant patterns in large databases. Most previous studies have shown how binary valued transaction data may be handled. Transaction data in real-world applications usually consist of quantitative values, so designing a sophisticated data-mining algorithm able to deal with various types of data presents a challenge to workers in this research field. In the past, we proposed a fuzzy data-mining algorithm to find association rules. Since sequential patterns are also very important for real-world applications, this paper thus focuses on finding fuzzy sequential patterns from quantitative data. A new mining algorithm is proposed, which integrates the fuzzy-set concepts and the AprioriAll algorithm. It first transforms quantitative values in transactions into linguistic terms, then filters them to find sequential patterns by modifying the AprioriAll mining algorithm. Each quantitative item uses only the linguistic term with the maximum cardinality in later mining processes, thus making the number of fuzzy regions to be processed the same as the number of the original items. The patterns mined out thus exhibit the sequential quantitative regularity in databases and can be used to provide some suggestions to appropriate supervisors.  相似文献   

5.
Radio frequency identification (RFID) technology has been successfully applied to gather customers’ shopping habits from their motion paths and other behavioral data. The customers’ behavioral data can be used for marketing purposes, such as improving the store layout or optimizing targeted promotions to specific customers. Some data mining techniques, such as clustering algorithms can be used to discover customers’ hidden behaviors from their shopping paths. However, shopping path data has peculiar challenges, including variable length, sequential data, and the need for a special distance measure. Due to these challenges, traditional clustering algorithms cannot be applied to shopping path data. In this paper, we analyze customer behavior from their shopping path data by using a clustering algorithm. We propose a new distance measure for shopping path data, called the Operation edit distance, to solve the aforementioned problems. The proposed distance method enables the RFID customer shopping path data to be processed effectively using clustering algorithms. We have collected a real-world shopping path data from a retail store and applied our method to the dataset. The proposed method effectively determined customers’ shopping patterns from the data.  相似文献   

6.
Discovering fuzzy time-interval sequential patterns in sequence databases.   总被引:1,自引:0,他引:1  
Given a sequence database and minimum support threshold, the task of sequential pattern mining is to discover the complete set of sequential patterns in databases. From the discovered sequential patterns, we can know what items are frequently brought together and in what order they appear. However, they cannot tell us the time gaps between successive items in patterns. Accordingly, Chen et al. have proposed a generalization of sequential patterns, called time-interval sequential patterns, which reveals not only the order of items, but also the time intervals between successive items. An example of time-interval sequential pattern has a form like (A, I2, B, I1, C), meaning that we buy A first, then after an interval of I2 we buy B, and finally after an interval of I1 we buy C, where I2 and I1 are predetermined time ranges. Although this new type of pattern can alleviate the above concern, it causes the sharp boundary problem. That is, when a time interval is near the boundary of two predetermined time ranges, we either ignore or overemphasize it. Therefore, this paper uses the concept of fuzzy sets to extend the original research so that fuzzy time-interval sequential patterns are discovered from databases. Two efficient algorithms, the fuzzy time interval (FTI)-Apriori algorithm and the FTI-PrefixSpan algorithm, are developed for mining fuzzy time-interval sequential patterns. In our simulation results, we find that the second algorithm outperforms the first one, not only in computing time but also in scalability with respect to various parameters.  相似文献   

7.
In response to the thriving development in electronic commerce (EC), many on-line retailers have developed Web-based information systems to handle enormous amounts of transactions on the Internet. These systems can automatically capture data on the browsing histories and purchasing records of individual customers. This capability has motivated the development of data-mining applications. Sequential pattern mining (SPM) is a useful data-mining method to discover customers’ purchasing patterns over time. We incorporate the recency, frequency, and monetary (RFM) concept presented in the marketing literature to define the RFM sequential pattern and develop a novel algorithm for generating all RFM sequential patterns from customers’ purchasing data. Using the algorithm, we propose a pattern segmentation framework to generate valuable information on customer purchasing behavior for managerial decision-making. Extensive experiments are carried out, using synthetic datasets and a transactional dataset collected by a retail chain in Taiwan, to evaluate the proposed algorithm and empirically demonstrate the benefits of using RFM sequential patterns in analyzing customers’ purchasing data.  相似文献   

8.
Sequential pattern mining is essential in many applications, including computational biology, consumer behavior analysis, web log analysis, etc. Although sequential patterns can tell us what items are frequently to be purchased together and in what order, they cannot provide information about the time span between items for decision support. Previous studies dealing with this problem either set time constraints to restrict the patterns discovered or define time-intervals between two successive items to provide time information. Accordingly, the first approach falls short in providing clear time-interval information while the second cannot discover time-interval information between two non-successive items in a sequential pattern. To provide more time-related knowledge, we define a new variant of time-interval sequential patterns, called multi-time-interval sequential patterns, which can reveal the time-intervals between all pairs of items in a pattern. Accordingly, we develop two efficient algorithms, called the MI-Apriori and MI-PrefixSpan algorithms, to solve this problem. The experimental results show that the MI-PrefixSpan algorithm is faster than the MI-Apriori algorithm, but the MI-Apriori algorithm has better scalability in long sequence data.  相似文献   

9.
Data-mining is the process of extracting desirable knowledge or interesting patterns from existing databases for specific purposes. Most conventional data-mining algorithms identify the relationships among transactions using binary values, however, transactions with quantitative values are commonly seen in real-world applications. This paper thus proposes a new data-mining algorithm for extracting interesting knowledge from transactions stored as quantitative values. The proposed algorithm integrates fuzzy set concepts and the apriori mining algorithm to find interesting fuzzy association rules in given transaction data sets. Experiments with student grades at I-Shou University were also made to verify the performance of the proposed algorithm.  相似文献   

10.
Nowadays, many databases record ordered or temporally annotated data, such as Web access logs or genomic sequences. Therefore, sequence mining has become an important research area. Among these data mining approaches, sequential patterns aim at describing frequent behaviors. In the access data of a commercial Web site, one may, for instance, discover that “35% of customers successively buy a PSP then a memory stick and PSP games”. To provide more complete information, fuzzy sequential patterns were designed, including quantitative values within the mining task. Such patterns, considering the previous example, would be “35% of customers buy a PSP, then they buy few games many times, and then they buy a high‐capacity memory stick once.” However, symbolic or fuzzy sequential patterns, in their current form, do not allow to extract temporal tendencies that are typical of sequential data. By means of temporal tendency mining, one may discover in the same access data that “An increasing number of purchases of PSP games during a very short period is frequently followed by a purchase of a high‐capacity memory stick a few days later.” It would be easy to conclude that the users either quickly succeed in registering or make several attempts before they look at the help page within a few seconds. To the best of our knowledge, no method has been designed for discovering this kind of patterns. Therefore, we propose, in this paper, two approaches that extract pattern‐expressing trends or evolution. First, we define evolution patterns that summarize the evolution of the quantities in the data. We explain how they can be obtained from a quantitative sequence database. Second, we define gradual trends in fuzzy sequential data. These trends describe variations in the fulfillment of fuzzy properties according to time. For both kinds of patterns, we developed algorithms that were implemented and tested on real data. © 2009 Wiley Periodicals, Inc.  相似文献   

11.
As dot-com bubble burst in 2002, an uncountable number of small-sized online shopping malls have emerged every day due to many good characteristics of online marketplace, including significantly reduced search costs and menu cost for products or services and easily accessing products or services in the world. However, all the online shopping malls have not continuously flourished. Many of them even vanished because of the lack of customer relationship management (CRM) strategies that fit them. The objective of this paper is to propose CRM strategies for small-sized online shopping mall based on association rules and sequential patterns obtained by analyzing the transaction data of the shop. We first defined the VIP customers in terms of recency, frequency and monetary (RFM) value. Then, we developed a model which classifies customers into VIP or non-VIP, using various data mining techniques such as decision tree, artificial neural network, logistic regression and bagging with each of these as a base classifier. Last, we identified association rules and sequential patterns from the transactions of VIPs, and then these rules and patterns were utilized to propose CRM strategies for the online shopping mall.  相似文献   

12.
Mining interesting user behavior patterns in mobile commerce environments   总被引:6,自引:6,他引:0  
Discovering user behavior patterns from mobile commerce environments is an essential topic with wide applications, such as planning physical shopping sites, maintaining e-commerce on mobile devices and managing online shopping websites. Mobile sequential pattern mining is an emerging issue in this topic, which considers users’ moving paths and purchased items in mobile commerce environments to find the complete set of mobile sequential patterns. However, an important factor, namely users’ interests, has not been considered yet in past studies. In practical applications, users may only be interested in the patterns with some user-specified constraints. The traditional methods without considering the constraints pose two crucial problems: (1) Users may need to filter out uninteresting patterns within huge amount of patterns, (2) Finding the complete set of patterns containing the uninteresting ones needs high computational cost and runtime. In this paper, we address the problem of mining mobile sequential patterns with two kinds of constraints, namely importance constraints and pattern constraints. Here, we consider the importance of an item as its utility (i.e., profit) in the mobile commerce environment. An efficient algorithm, IM-Span (I nteresting M obile S equential Pa tter n mining), is proposed for dealing with the two kinds of constraints. Several effective strategies are employed to reduce the search space and computational cost in different aspects. Experimental results show that the proposed algorithms outperform state-of-the-art algorithms significantly under various conditions.  相似文献   

13.
为了更好地分析购物篮数据,挖掘出潜在客户,序列模式挖掘应运而生。序列模式挖掘是数据挖掘一个重要研究内容,近年来在很多领域得到广泛运用。概述序列模式挖掘的发展现状,研究基本挖掘框架的经典挖掘算法与扩展模型挖掘算法,特别针对近年来出现的新数据形式序列模式挖掘,以及基于零压缩二叉决策图(ZBDD)结构的挖掘算法做了阐述,最后对序列模式挖掘发展趋势进行了展望。  相似文献   

14.
Since Agrawal and Srikant proposed sequential pattern mining in 1995, there have been many scholars working to improve the efficiency and reduce the processing time of algorithms. This study intends to propose a fuzzy AprioriSome algorithm for fuzzy sequential patterns mining with integration with clustering technique, K-means algorithm. Two experiments performed using transaction data provided by a securities firm and foodmarket data from SQL sever 2000 demonstrate the strength of fuzzy AprioriSome sequential pattern mining in mining large quantity of transaction data.  相似文献   

15.
Mining user behavior patterns in mobile environments is an emerging topic in data mining fields with wide applications. By integrating moving paths with purchasing transactions, one can find the sequential purchasing patterns with the moving paths, which are called mobile sequential patterns of the mobile users. Mobile sequential patterns can be applied not only for planning mobile commerce environments but also for analyzing and managing online shopping websites. However, unit profits and purchased numbers of the items are not considered in traditional framework of mobile sequential pattern mining. Thus, the patterns with high utility (i.e., profit here) cannot be found. In view of this, we aim at integrating mobile data mining with utility mining for finding high-utility mobile sequential patterns in this study. Two types of algorithms, namely level-wise and tree-based methods, are proposed for mining high-utility mobile sequential patterns. A series of analyses and comparisons on the performance of the two different types of algorithms are conducted through experimental evaluations. The results show that the proposed algorithms outperform the state-of-the-art mobile sequential pattern algorithms and that the tree-based algorithms deliver better performance than the level-wise ones under various conditions.  相似文献   

16.
Mining fuzzy association rules in a bank-account database   总被引:1,自引:0,他引:1  
This paper describes how we applied a fuzzy technique to a data-mining task involving a large database that was provided by an international bank with offices in Hong Kong. The database contains the demographic data of over 320,000 customers and their banking transactions, which were collected over a six-month period. By mining the database, the bank would like to be able to discover interesting patterns in the data. The bank expected that the hidden patterns would reveal different characteristics about different customers so that they could better serve and retain them. To help the bank achieve its goal, we developed a fuzzy technique, called fuzzy association rule mining II (FARM II). FARM II is able to handle both relational and transactional data. It can also handle fuzzy data. The former type of data allows FARM II to discover multidimensional association rules, whereas the latter data allows some of the patterns to be more easily revealed and expressed. To effectively uncover the hidden associations in the bank-account database, FARM II performs several steps which are described in detail in this paper. With FARM II, the bank discovered that they had identified some interesting characteristics about the customers who had once used the bank's loan services but then decided later to cease using them. The bank translated what they discovered into actionable items by offering some incentives to retain their existing customers.  相似文献   

17.
Sequential Pattern Mining in Multi-Databases via Multiple Alignment   总被引:2,自引:0,他引:2  
To efficiently find global patterns from a multi-database, information in each local database must first be mined and summarized at the local level. Then only the summarized information is forwarded to the global mining process. However, conventional sequential pattern mining methods based on support cannot summarize the local information and is ineffective for global pattern mining from multiple data sources. In this paper, we present an alternative local mining approach for finding sequential patterns in the local databases of a multi-database. We propose the theme of approximate sequential pattern mining roughly defined as identifying patterns approximately shared by many sequences. Approximate sequential patterns can effectively summerize and represent the local databases by identifying the underlying trends in the data. We present a novel algorithm, ApproxMAP, to mine approximate sequential patterns, called consensus patterns, from large sequence databases in two steps. First, sequences are clustered by similarity. Then, consensus patterns are mined directly from each cluster through multiple alignment. We conduct an extensive and systematic performance study over synthetic and real data. The results demonstrate that ApproxMAP is effective and scalable in mining large sequences databases with long patterns. Hence, ApproxMAP can efficiently summarize a local database and reduce the cost for global mining. Furthremore, we present an elegant and uniform model to identify both high vote sequential patterns and exceptional sequential patterns from the collection of these consensus patterns from each local databases.  相似文献   

18.
In this paper, we propose an Interactive Fuzzy Interval Reasoning (FIR) method by combining fuzzy logic with interval computing to better serve Web users in terms of effectiveness and flexibility. Web users may use convenient interval inputs for online shopping. In order to serve different customers based on their preferences, different personalized fuzzy partitions to meet different needs are provided for the different Web customers. The Interactive Fuzzy Interval Reasoning method is used to design the Web shopping agent. Java servlets and Microsoft Access are used to implement the fuzzy Web shopping system.  相似文献   

19.
Sequential pattern mining, including weighted sequential pattern mining, has been attracting much attention since it is one of the essential data mining tasks with broad applications. The weighted sequential pattern mining aims to find more interesting sequential patterns, considering the different significance of each data element in a sequence database. In the conventional weighted sequential pattern mining, usually pre-assigned weights of data elements are used to get the importance, which are derived from their quantitative information and their importance in real world application domains. In general sequential pattern mining, the generation order of data elements is considered to find sequential patterns. However, their generation times and time-intervals are also important in real world application domains. Therefore, time-interval information of data elements can be helpful in finding more interesting sequential patterns. This paper presents a new framework for finding time-interval weighted sequential (TiWS) patterns in a sequence database and time-interval weighted support (TiW-support) to find the TiWS patterns. In addition, a new method of mining TiWS patterns in a sequence database is also presented. In the proposed framework of TiWS pattern mining, the weight of each sequence in a sequence database is first obtained from the time-intervals of elements in the sequence, and subsequently TiWS patterns are found considering the weight. A series of evaluation results shows that TIWS pattern mining is efficient and helpful in finding more interesting sequential patterns.  相似文献   

20.
As more retailers evolve into customer-centric and segment-based business, business intelligence (BI) and customer relationship management (CRM) systems are playing a key role in achieving and maintaining competitive advantage. For the past ten years, the authors have had the rare opportunity of observing and interviewing employees and managers of three different management teams at three separate Fingerhut companies as they experimented with various ITs for their companies. When the first Fingerhut company peaked in 1998, as many as 200 analysts and 40 statisticians mined the database for insights that helped predict consumer shopping patterns and credit behaviour. Data mining and BI helped Fingerhut spot shopping patterns, bring product offerings to the right customers, and nurture customer relationships. By 1998, Fingerhut was the second largest catalogue retailer in the U.S. with revenues nearing $2 billion. However, after Federated acquired Fingerhut in 1999 and made it a subsidiary, Fingerhut Net, it suffered great losses and was eventually liquidated. Finally, a new company, Fingerhut Direct Marketing, was resurrected in 2002 under a new management team, and it once again became successful. What went right? What went wrong? The paper concludes with CRM and BI systems success factors and a discussion of lessons learned.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号