Abstract
The problem of frequent pattern mining has been widely studied in the literature because of its numerous applications to a variety of data mining problems such as clustering and classification. In addition, frequent pattern mining also has numerous applications in diverse domains such as spatiotemporal data, software bug detection, and biological data. The algorithmic aspects of frequent pattern mining have been explored very widely. This chapter provides an overview of these methods, as it relates to the organization of this book.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
C. Aggarwal. Outlier Analysis, Springer, 2013.
C. Aggarwal. Social Sensing, Managing and Mining Sensor Data, Springer, 2013.
C. C. Aggarwal, and P. S. Yu. Online generation of Association Rules, ICDE Conference, 1998.
R. Agrawal, and R. Srikant. Fast Algorithms for Mining Association Rules in Large Databases, VLDB Conference, pp. 487–499, 1994.
R. Agrawal, and R. Srikant. Mining Sequential Patterns, ICDE Conference, 1995.
C. C. Aggarwal, and P. S. Yu. A New Framework for Itemset Generation, ACM PODS Conference, 1998.
C. Aggarwal and P. Yu. Privacy-preserving data mining: Models and Algorithms, Springer, 2008.
C. C. Aggarwal, and H. Wang. Managing and Mining Graph Data Data. Springer 2010.
C. C. Aggarwal, and C. K. Reddy. Data Clustering: Algorithms and Applications, CRC Press, 2013.
R. Agrawal, T. Imielinski, and A. Swami. Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering, 5(6), pp. 914–925, 1993.
R. Agrawal, J. Gehrke, D. Gunopulos, P. Raghavan. Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications, ACM SIGMOD Conference, 1998.
R. Agarwal, C. C. Aggarwal, and V. V. V. Prasad. Depth-first Generation of Long Patterns, ACM KDD Conference, 2000. Also appears as IBM Research Report, RC, 21538, 1999.
R. Agarwal, C. C. Aggarwal, and V. V. V. Prasad. A Tree Projection Algorithm for Generation of Frequent Itemsets, Journal of Parallel and Distributed Computing, 61(3), pp. 350–371, 2001. Also appears as IBM Research Report, RC 21341, 1999.
C. C. Aggarwal, N. Ta, J. Wang, J. Feng, M. Zaki. Xproj: A framework for projected structural clustering of XML documents, ACM KDD Conference, 2007.
C. C. Aggarwal, Y. Li, J. Wang, J. Feng. Frequent Pattern Mining with Uncertain Data, ACM KDD Conference, 2009.
C. Aggarwal, Y. Li, P. Yu, and R. Jin. On dense pattern mining in graph streams, VLDB Conference, 2010.
R. J. Bayardo Jr. Efficiently mining long patterns from databases. ACM SIGMOD Conference, 1998.
J.-F. Boulicaut, A. Bykowski, and C. Rigotti. Free-sets: A Condensed Representation of Boolean data for the Approximation of Frequency Queries. Data Mining and Knowledge Discovery, 7(1), pp. 5–22, 2003.
G. Buehrer, and K. Chellapilla. A Scalable Pattern Mining Approach to Web Graph Compression with Communities. WSDM Conference, 2009.
T. Calders, and B. Goethals. Mining all non-derivable frequent itemsets, Principles of Knowledge Discovery and Data Mining, 2006.
T. Calders, C. Rigotti, and J. F. Boulicaut. A survey on condensed representations for frequent sets. In Constraint-based mining and inductive databases, pp. 64–80, Springer, 2006.
J. H. Chang, W. S. Lee. Finding Recent Frequent Itemsets Adaptively over Online Data StreamsFinding Recent Frequent Itemsets Adaptively over Online Data Streams. ACM KDD Conference, 2003.
M. Charikar, K. Chen, and M. Farach-Colton. Finding Frequent Items in Data Streams, Automata, Languages and Programming, pp. 693–703, 2002.
M. S. Chen, J. S. Park, and P. S. Yu. Efficient data mining for path traversal patterns, IEEE Transactions on Knowledge and Data Engineering, 10(2), pp. 209–221, 1998.
C. Clifton, M. Kantarcioglu, J. Vaidya, X. Lin, and M. Zhu. Tools for privacy preserving distributed data mining. ACM SIGKDD Explorations Newsletter, 4(2), pp. 28–34, 2002.
E. Cohen. M. Datar, S. Fujiwara, A. Gionis, P. Indyk, R. Motwani, J. Ullman, and C. Yang. Finding Interesting Associations without Support Pruning, IEEE TKDE, 13(1), pp. 64–78, 2001.
G. Cormode, S. Muthukrishnan. What’s hot and what’s not: tracking most frequent items dynamically, ACM TODS, 30(1), pp. 249–278, 2005.
J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. OSDI, pp. 137–150, 2004.
M. Deshpande, M. Kuramochi, N. Wale, and G. Karypis. Frequent substructure-based approaches for classifying chemical compounds. IEEE TKDE., 17(8), pp. 1036–1050, 2005.
A. Evfimievski, R. Srikant, R. Agrawal, and J. Gehrke. Privacy preserving mining of association rules. Information Systems, 29(4), pp. 343–364, 200–4.
M. Garofalakis, R. Rastogi, and K. Shim.: Sequential Pattern Mining with Regular Expression Constraints, VLDB Conference, 1999.
V. Guralnik, and G. Karypis. Parallel tree-projection-based sequence mining algorithms. Parallel Computing, 30(4): pp. 443–472, April 2004.
J. Han, J. Pei, and Y. Yin. Mining Frequent Patterns without Candidate Generation, ACM SIGMOD Conference, 2000.
J. Han, H. Cheng, D. Xin, and X. Yan. Frequent Pattern Mining: Current Status and Future Directions, Data Mining and Knowledge Discovery, 15(1), pp. 55–86, 2007.
J. Han, J. Pei, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. C. Hsu. FreeSpan: frequent pattern-projected sequential pattern mining. ACM KDD Conference, 2000.
J. Han, J. Pei, H. Pinto, B. Mortazavi-Asl, Q. Chen, U. Dayal, and M. C. Hsu. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. ICDE Conference, 2001.
J. Han, J.-G. Lee, H. Gonzalez, X. Li. Mining Massive RFID, Trajectory, and Traffic Data Sets (Tutorial). ACM KDD Conference, 2008. Video of Tutoral Lecture at: http://videolectures.net/kdd08_han_mmrfid/
H. Jeung, M. L. Yiu, X. Zhou, C. Jensen, H. Shen, Discovery of Convoys in Trajectory Databases, VLDB Conference, 2008.
R. Jin, G. Agrawal. Frequent Pattern Mining in Data Streams, Data Streams: Models and Algorithms, pp. 61–84, Springer, 2007.
R. Jin, L. Liu, and C. Aggarwal. Discovering highly reliable subgraphs in uncertain graphs. ACM KDD Conference, 2011.
G. Kuramuchi and G. Karypis. Frequent Subgraph Discovery, ICDM Conference, 2001.
A. R. Leach and V. J. Gillet. An Introduction to Chemoinformatics. Springer, 2003.
W. Lee, S. Stolfo, and P. Chan. Learning Patterns from Unix Execution Traces for Intrusion Detection, AAAI workshop on AI methods in Fraud and Risk Management, 1997.
W. Lee, S. Stolfo, and K. Mok. A Data Mining Framework for Building Intrusion Detection Models, IEEE Symposium on Security and Privacy, 1999.
J.-G. Lee, J. Han, K.-Y. Whang, Trajectory Clustering: A Partition-and-Group Framework, ACM SIGMOD Conference, 2007.
J.-G. Lee, J. Han, X. Li. Trajectory Outlier Detection: A Partition-and-Detect Framework, ICDE Conference, 2008.
J.-G. Lee, J. Han, X. Li, H. Gonzalez. TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. PVLDB, 1(1): pp. 1081–1094, 2008.
X. Li, J. Han, and S. Kim. Motion-alert: Automatic Anomaly Detection in Massive Moving Objects, IEEE Conference in Intelligence and Security Informatics, 2006.
X. Li, J. Han, S. Kim and H. Gonzalez. ROAM: Rule- and Motif-based Anomaly Detection in Massive Moving Object Data Sets, SDM Conference, 2007.
Z. Li, B. Ding, J. Han, R. Kays. Swarm: Mining Relaxed Temporal Object Moving Clusters, VLDB Conference, 2010.
C. Liu, X. Yan, H. Lu, J. Han, and P. S. Yu. Mining Behavior Graphs for “backtrace” of non-crashing bugs, SDM Conference, 2005.
B. Liu, W. Hsu, Y. Ma. Integrating Classification and Association Rule Mining, ACM KDD Conference, 1998.
S. Ma, and J. Hellerstein. Mining Partially Periodic Event Patterns with Unknown Periods, IEEE International Conference on Data Engineering, 2001.
H. Mannila, H. Toivonen, and A. I. Verkamo. Discovering Frequent Episodes in Sequences, ACM KDD Conference, 1995.
R. Ng, L. V. S. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations rules. ACM SIGMOD Conference, 1998.
N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Discovering frequent closed itemsets for association rules. International Conference on Database Theory, pp. 398–416, 1999.
J. Pei, and J. Han. Can we push more constraints into frequent pattern mining? ACM KDD Conference, 2000.
J. Pei, J. Han, R. Mao. CLOSET: An Efficient Algorithms for Mining Frequent Closed Itemsets, DMKD Workshop, 2000.
J. Pei, J. Han, H. Lu, S. Nishio, S. Tang, and D. Yang. H-mine: Hyper-structure mining of frequent patterns in large databases. In Data Mining, ICDM Conference, 2001.
J. Pei, J. Han, and L. V. S. Lakshmanan. Mining Frequent Patterns with Convertible Constraints in Large Databases, ICDE Conference, 2001.
J. Pei, J. Han, and W. Wang. Constraint-based Sequential Pattern Mining: The Pattern-Growth Methods, Journal of Intelligent Information Systems, 28(2), pp. 133–160, 2007.
P. Shenoy, J. Haritsa, S. Sudarshan, G. Bhalotia, M. Bawa, D. Shah. Turbo-charging Vertical Mining of Large Databases. ACM SIGMOD Conference, pp. 22–33, 2000.
J. Srivastava, R. Cooley, M. Deshpande, and P. N. Tan. Web usage mining: Discovery and applications of usage patterns from Web data. ACM SIGKDD Explorations Newsletter, 1(2), pp. 12–23, 2000.
Y. Tong, L. Chen, Y. Cheng, P. Yu. Mining Frequent Itemsets over Uncertain Databases. PVLDB, 5(11), pp. 1650–1661, 2012.
V. S. Verykios, A. K. Elmagarmid, E. Bertino, Y. Saygin, and E. Dasseni. Association rule hiding. IEEE Transactions on Knowledge and Data Engineering, pp. 434–447, 16(4), pp. 434–447, 200–4.
J. Vreeken, M. van Leeuwen, and A. Siebes. Krimp: Mining itemsets that compress. Data Mining and Knowledge Discovery, 23(1), pp. 169–214, 2011.
J. Wang, J. Han, and J. Pei. CLOSET+: Searching for the Best strategies for mining frequent closed itemsets. ACM KDD Conference, 2003.
Z. Xing, J. Pei, and E. Keogh. A Brief Survey on Sequence Classification, ACM SIGKDD Explorations, 12(1), 201–0.
X. Yan, P. S. Yu, and J. Han, Graph indexing: A frequent structure-based approach. ACM SIGMOD Conference, 2004.
X. Yan, P. S. Yu, and J. Han. Substructure similarity search in graph databases. ACM SIGMOD Conference, 2005.
X. Yan, F. Zhu, J. Han, and P. S. Yu. Searching substructures with superimposed distance, ICDE Conference, 2006.
M. Zaki. Efficiently mining frequent trees in a forest: Algorithms and applications. IEEE Transactions on Knowledge and Data Engineering, 17(8), pp. 1021–1035, 2005.
M. Zaki, C. Aggarwal. XRules: An Effective Classifier for XML Data, ACM KDD Conference, 2003.
M. Zaki, C. J. Hsiao.: An Efficient Algorithm for Closed Frequent Itemset Mining, SDM Conference, 2002.
S. Zhang, T. Wang. Discovering Frequent Agreement Subtrees from Phylogenetic Data. IEEE Transactions on Knowledge and Data Engineering, 20(1), pp. 68–82, 2008.
Z. Zou, J. Li, H. Gao, and S. Zhang. Mining Frequent Subgraph Patterns from Uncertain Graph Data, IEEE Transactions on Knowledge and Data Engineering, 22(9), pp. 1203–1218, 2010.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Aggarwal, C. (2014). An Introduction to Frequent Pattern Mining. In: Aggarwal, C., Han, J. (eds) Frequent Pattern Mining. Springer, Cham. https://doi.org/10.1007/978-3-319-07821-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-07821-2_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07820-5
Online ISBN: 978-3-319-07821-2
eBook Packages: Computer ScienceComputer Science (R0)