MindMap Gallery Data Mining Principles and Algorithms (3rd Edition) Chapter 1 Introduction
This is a mind map about the introduction to the first chapter of Data Mining Principles and Algorithms (Third Edition), which mainly includes: 1.1 The emergence and development of data mining; 1.2 The development trend of data mining research; 1.3 Data mining concepts; 1.4 Data Classification issues of mining technology; 1.5 Commonly used knowledge representation patterns and methods in data mining; 1.6 Data mining issues under different data storage forms; 1.7 Rough set method and its application in data mining; 1.8 Application analysis of data mining; 1.9 This chapter Summary and literature notes.
Edited at 2021-10-02 16:17:26Avatar 3 centers on the Sully family, showcasing the internal rift caused by the sacrifice of their eldest son, and their alliance with other tribes on Pandora against the external conflict of the Ashbringers, who adhere to the philosophy of fire and are allied with humans. It explores the grand themes of family, faith, and survival.
This article discusses the Easter eggs and homages in Zootopia 2 that you may have discovered. The main content includes: character and archetype Easter eggs, cinematic universe crossover Easter eggs, animal ecology and behavior references, symbol and metaphor Easter eggs, social satire and brand allusions, and emotional storylines and sequel foreshadowing.
[Zootopia Character Relationship Chart] The idealistic rabbit police officer Judy and the cynical fox conman Nick form a charmingly contrasting duo, rising from street hustlers to become Zootopia police officers!
Avatar 3 centers on the Sully family, showcasing the internal rift caused by the sacrifice of their eldest son, and their alliance with other tribes on Pandora against the external conflict of the Ashbringers, who adhere to the philosophy of fire and are allied with humans. It explores the grand themes of family, faith, and survival.
This article discusses the Easter eggs and homages in Zootopia 2 that you may have discovered. The main content includes: character and archetype Easter eggs, cinematic universe crossover Easter eggs, animal ecology and behavior references, symbol and metaphor Easter eggs, social satire and brand allusions, and emotional storylines and sequel foreshadowing.
[Zootopia Character Relationship Chart] The idealistic rabbit police officer Judy and the cynical fox conman Nick form a charmingly contrasting duo, rising from street hustlers to become Zootopia police officers!
Chapter One Introduction
1.1 The emergence and development of data mining
1.1.1 Business demand analysis of data mining technology
1. The main reasons why data mining has aroused the demand of experts, scholars and commercial manufacturers: the widespread use of large-scale data systems and the urgent need to convert data into useful knowledge.
2. Data mining technology changes with the development of the times.
3. Data, information, and knowledge can be regarded as different forms of generalized data expression.
1.1.2 Technical background generated by data mining
1. Technical basis: the development of computers and related technologies.
2. Technical background arising from data mining: the development of information technologies such as databases, data warehouses and the Internet, the improvement of computer performance and the development of advanced architectures, and the research and application of statistics, artificial intelligence and other methods in data analysis.
1.1.3 Analysis of data mining technology requirements in the big data era
1. The development of big data research can be roughly divided into three stages: 2000 and before, called the "budding stage of the big data concept", 2001-2010, called the "big data concept exploration stage", 2011 and later , the concept of big data has been further deepened, has become the focus of academic research, and has become a concept supported by many applications.
2. The four perspectives of the current big data concept: data theory, methodology, environment theory, and attribute theory.
Based on attribute theory, it shows that data mining technology is the core of big data analysis. New challenging technical methods that need to be strengthened include: large-volume data analysis requires data mining technology to support, and high-speed aggregation of big data proposes new methods for data mining. Challenging topics and various types of big data need to be supported by the development of relevant research branches of data mining. Big data with huge value is the research target of data mining technology.
1.2 Development Trend of Data Mining Research
Data mining needs to focus on several major aspects: the smooth integration of data mining technology and specific business logic, the applicability of data mining technology to specific data storage types, the selection and standardization of large data, the architecture and design of data mining systems Interactive mining technology, visualization issues of data mining languages and systems, data mining theory and algorithm research.
1.3 Data mining concepts
1.3.1 Data mining technology from a business perspective
Data mining is essentially a new business information processing technology
1.3.2 Technical meaning of data mining
1. “Knowledge Discovery” (KDD) in Databases
2. Data Mining and Knowledge Discovery (DMKD)
3. The meaning of data mining technology: KDD is regarded as a special case of data mining. Data mining is a step in the KDD process. KDD has the same meaning as Data Mining.
1.1.3 Theoretical basis of data mining research
Some important theoretical frameworks: pattern discovery framework, rule discovery framework, based on probability and statistical theory, microeconomic perspective, based on data compression theory, based on inductive database theory, visual data mining.
1.4 Classification issues of data mining technology
Classification according to mining tasks
Classification according to mining objects
Classification based on mining methods
Classification based on the knowledge that can be discovered through data mining
1.5 Commonly used knowledge representation models and methods in data mining
1.5.1 General knowledge mining
1. Definition: Generalized knowledge mining refers to generalized knowledge that describes category characteristics.
2. Concept description method: essentially summarizing the connotative characteristics of certain objects. Concept description is divided into characteristic description and distinctive description.
3. Multidimensional data analysis can be regarded as an effective method for generalized knowledge mining.
4. Multi-level concept description problems (currently using more concept layering methods): collection grouping layering, rule-based layering, operation derived layering, and pattern layering.
1.5.2 Related knowledge mining
1. Definition: Relevant knowledge reflects the dependence or association between an event and other events.
2. Purpose: Find hidden information in the database.
3. Type: simple association, temporal association, causal association, quantitative association.
4. Broadly speaking, correlation analysis is the essence of data mining.
5. Association rule mining is the most commonly used method for association knowledge discovery.
1.5.3 type of knowledge mining
1. Definition: Class knowledge describes a class of things that have common characteristics in a certain sense and are clearly distinguished from things of different classes.
2. Two basic methods for mining class knowledge
Categories: Decision trees, Bayesian classification, neural networks, genetic algorithms and evolutionary theory, analogical learning, others.
Hierarchy-based clustering method clustering: partition-based clustering method, density-based clustering method, grid-based clustering method, model-based clustering method.
1.5.4 Predictive knowledge mining
Definition: Predictive knowledge mining refers to knowledge generated from historical and current data and able to predict future data trends.
Modes: trend prediction mode, cycle analysis mode, sequence mode, neural network.
1.5.5 Specific knowledge mining
The following questions can help you understand the tasks and methods of specific knowledge mining: isolated point analysis, sequence anomaly analysis, and discovery of specific rules.
1.6 Data mining issues under different data storage forms
1.6.1 Data mining in transaction database: A transaction database is a collection of transaction data.
1.6.2 Data mining in relational databases
Some problems currently being studied: multi-dimensional knowledge mining problems, multi-table mining and quantity mining problems, multi-layer knowledge mining problems, knowledge evaluation problems, and constrained data mining problems.
1.6.3 Data mining in data warehouse: 0LAM, OLAP.
1.6.4 Develop data mining in fifteen new databases based on relational models
1.6.5 Data mining in new application-oriented data sources
Data types of data mining: structured data mining, unstructured data mining, semi-structured data mining.
Wed mining must face some key issues: heterogeneous data source environment, semi-structured data structure, and dynamically changing language environment.
There are three schools of research on Wed mining: Wed structure mining, Wed usage mining, and Wed content mining.
1.6.6 Data mining in Web data sources
1.7 Rough set method and its application in data mining
1.7.1 Some important concepts of rough sets
Rough set theory is a teaching tool for studying imprecise and uncertain knowledge. Rough sets abstract the objective world into an information system.
Reduction: Minimal set of attributes.
1.7.2 Examples of rough set application
1.7.3 Application scope of rough set method in KDD: rule learning and decision table derivation, knowledge reduction, attribute-related analysis, and data preprocessing.
1.8 Application analysis of data mining
1.8.1 Data mining and CRM: Data mining can help companies determine customer characteristics and enable companies to provide targeted services to customers.
1.8.2 Data mining and social networks: Many examples illustrate the integration and development trend of the two.
1.8.3 Analysis of successful cases of data mining applications
Application of data mining in sports competitions
Data mining applied to commercial banks: financial investment, fraud screening.
Data mining applied to telecommunications
Data mining applied to scientific exploration
Application of data mining in information security
1.9 Summary of this chapter and literature notes