Journal of Applied Science and Engineering

Published by Tamkang University Press

1.30

Impact Factor

2.10

CiteScore

Rui Yang1 and Dong Ye This email address is being protected from spambots. You need JavaScript enabled to view it.1

1School of Electrical Engineering, Zhengzhou University of Science and Technology Zhengzhou 450000 China 


Received: November 12, 2019
Accepted: May 9, 2020
Publication Date: December 1, 2020

 Copyright The Author(s). This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are cited.


Download Citation: ||https://doi.org/10.6180/jase.202012_23(4).0005  

ABSTRACT


Data stream is continuous and uncertain. Frequent pattern mining for data stream will cause that data distributes unevenly and concept drift. In order to improve mining efficiency and decrease data storage, we propose a hybrid time decay model and probability decay window model (HTPDWM) for data stream closed frequent pattern mining. This new method is divided into three steps. First, we adopt mining closed frequent pattern of sliding window model and time decay model in data stream to deal with new and old things. Second, we use probability decay window model and closure operator to calculate expected support degree and improve efficiency of close pattern mining respectively. Third, we use decay factor to correct concept drift and data distributes evenly. Finally, we make experiments to verify the effectiveness of the new method. Results show that HTPDWM can present stable with different sliding window and have better performance when processing time and memory space.


Keywords: Data stream; Frequent pattern mining; Time decay model; Probability decay window model; Closure opera-tor; Decay factor


REFERENCES


  1. [1] H. Wang, P.S. Yu, and J. Han. Mining Concept-Drifting Data Streams. In Data Mining & Knowledge Discovery Handbook, page 789. 2010.
  2. [2] LinT eng,Hang Li,and Shoulin Yin. Modified pyramid dual tree direction filter-based image denoising via curvature scale and non local mean multigrade remnant filter. International Journal of Communication Systems, 31(16), nov 2018.
  3. [3] Shoulin Yin, Ye Zhang, and Shahid Karim. Large Scale Remote Sensing Image Segmentation Based on Fuzzy Region Competition and Gaussian Mixture Model. IEEE Access, 6:26069–26080, may 2018.
  4. [4] Zhen Li, Yahui Yang, Guangxing Zhang, and Guangcheng Qin. An adaptive method for identifying heavy hitters combining sampling and data streaming counting. In ICACTE 2010 - 2010 3rd International Conference on Advanced Computer Theory and Engineering, Proceedings, volume 6, 2010.
  5. [5] Jeffrey Xu Yu, Zhihong Chong, Hongjun Lu, Zhenjie Zhang,andAoyingZhou. A false negative approach to mining frequent item sets from highspeed transactional data streams. Information Sciences, 176(14):1986–2015, jul 2006.
  6. [6] Show Jane Yen, Yue Shi Lee, and Chiu Kuang Wang. An efficient algorithm for incrementally mining frequent closed item sets. Applied Intelligence,40(4):649–668,2014.
  7. [7] H F Li, S Y Lee, and M K Shan. An efficient algorithm for mining frequent itemsets over the entire history of data streams. Proc of First International ..., 2004.
  8. [8] M.Jeya Sutha and F. Ramesh Dhanaseelan. An efficient method for detection of breast cancer based on closed frequent item sets mining. Journal of Medical Imaging and Health Informatics, 5(5):987–994, sep 2015.
  9. [9] Feng Zhang, Min Liu, Feng Gui, Weiming Shen, Abdallah Shami, and Yunlong Ma. A distributed frequent itemset mining algorithm using spark for big data analytics. Cluster Computing, 18(4):1493–1501, 2015.
  10. [10] M. A.S. Srinivas, A. Prasanthi, and Y. Narasimhulu. Analysis of a prey-predator harvesting time delay model with interval biological parameters. International Journal of Mathematical Modelling and Numerical Optimisation, 6(2):114–140, 2015.
  11. [11] Debing Mei, Min Zhao, Hengguo Yu, Chuanjun Dai, and Yi Wang. Nonlinear dynamics of a nutrient phytoplankton model with time delay. Discrete Dynamics in Nature and Society, 2015, 2015.
  12. [12] R Agrawal, R Srikant Proc. 20th int. conf. very large data Bases, and Undefined 1994. Fast Algorithms For Mining Association Rules In Datamining. International Journal of Scientific & Technology Research,2(12):13– 24, 2013.
  13. [13] Jinxia Su, Yanwen Li, and Xuejing Zhao. Data stream clustering by fast density-peak-search. Statistics and its Interface, 11(1):183–189, 2018.
  14. [14] Maciej Jaworski, Leszek Rutkowski, and Miroslaw Pawlak. Hybrid splitting criterion in decision trees for data stream mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 9693, pages 60–72. Springer Verlag, 2016.


    



 

2.1
2023CiteScore
 
 
69th percentile
Powered by  Scopus

SCImago Journal & Country Rank

Enter your name and email below to receive latest published articles in Journal of Applied Science and Engineering.