MindMap Gallery Machine Learning (Xigua Book) Chapter 1 Mind Map
The mind map of Chapter 1 of Machine Learning (Xigua Book) is dedicated to studying how to use experience to improve the performance of the system itself through calculation methods. Updated from time to time, welcome to like and collect~
Edited at 2024-03-03 15:59:34This is a mind map about bacteria, and its main contents include: overview, morphology, types, structure, reproduction, distribution, application, and expansion. The summary is comprehensive and meticulous, suitable as review materials.
This is a mind map about plant asexual reproduction, and its main contents include: concept, spore reproduction, vegetative reproduction, tissue culture, and buds. The summary is comprehensive and meticulous, suitable as review materials.
This is a mind map about the reproductive development of animals, and its main contents include: insects, frogs, birds, sexual reproduction, and asexual reproduction. The summary is comprehensive and meticulous, suitable as review materials.
This is a mind map about bacteria, and its main contents include: overview, morphology, types, structure, reproduction, distribution, application, and expansion. The summary is comprehensive and meticulous, suitable as review materials.
This is a mind map about plant asexual reproduction, and its main contents include: concept, spore reproduction, vegetative reproduction, tissue culture, and buds. The summary is comprehensive and meticulous, suitable as review materials.
This is a mind map about the reproductive development of animals, and its main contents include: insects, frogs, birds, sexual reproduction, and asexual reproduction. The summary is comprehensive and meticulous, suitable as review materials.
Machine Learning Chapter 1
inductive preference
Inductive preference The preference for a certain type of hypothesis in the learning process is called inductive preference, which can be understood as the value in machine learning.
General principles
Ocam's Razor: If there are multiple hypotheses that are consistent with observations, choose the simplest one.
NFL theorem
Important premise of NFL theorem
development path
reasoning period
knowledge period
learning period
Application status
One of the most active research branches in the computer field
Closely related to ordinary people’s lives
Affect the political life of human society
With natural science exploration color
Reading material
introduction
Committed to studying how to use computing methods and experience to improve the performance of the system itself.
basic terminology
"data set"
This set of records
"instance" or "sample"
Each record is a description of an event or object
"attribute" or "characteristic"
Matters that reflect the performance or nature of an event or object in some aspect
"Attribute value"
Values on attributes, such as "green" and "black"
"attribute space" (attribute space), "sample space" (sample space) or "input space"
The space spanned by attributes. For example: a three-dimensional space describing a watermelon
"Feature vector"
Since each point in space corresponds to a coordinate vector, we also call an example an "Feature vector"
"dimensionality".
Generally, let D={x1,x2,..,xm} represent a data set containing m examples, each example is described by d attributes (for example, the watermelon data above uses 3 attributes), then each example xi=(xi1;xi2;...xid) is a vector in the d-dimensional sample space The value on the attribute is "stiffness"), and d is called the "dimensionality" of sample xi.
"learning" or "training"
The process of learning a model from data is called "learning" or "training". This process is completed by executing a learning algorithm.
"training data" or "training examples" or "training examples"
Data used during training
"Training samples"
Each sample is called a "training sample"
"Training set"
A set of training samples
learned model
The learned model corresponds to a certain potential law about the data, so it is also called a "hypothesis"; this potential law itself is called "truth" or "reality", and the learning process is to find out or approach the truth.
Generally, (xi, yi) is used to represent the i-th example, where yi∈Y is the label of example xi, and Y is the set of all labels, also called "label space" or "output space".
Learning tasks
"Supervised learning"
"Classification"
Predicts discrete values
For “dichotomy” involving only two categories "class" tasks
One of the classes is usually called the "positive class"
The other class is the "anti-class", also known as the "negative class"
"Multiple classification" tasks
Involves multiple categories
"return"
Predicts continuous values
"Unsupervised learning"
"clustering"
"Semi-supervised learning"
When there are both labeled training samples and unlabeled training samples Learning algorithms.
Generalization
The goal of machine learning is to make the learned model well applicable to "new models". ", rather than just the training set, we call the model's ability to adapt to new samples Strength is the ability of generalization.
hypothesis space
"induction"
Induction, generalization from the specific to the general process.
"Learning from examples" is an inductive process, so it is also called "Inductive Learning"
narrow sense
broad sense
"deduction"
deduction, the process from general to "specialization". Machine learning is essentially inductive learning.
hypothesis space
A hypothesis formed from all the values that the attribute can take.
version space
A "set of hypotheses" consistent with the training set.