Week 1 Introduction
Week 1 Introduction
What is Machine Learning?
Tom Mitchell provides a more modern definition: “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.”(对某一任务表现水平随不断经历某种过程而提高)
Example: playing checkers.
E = the experience of playing many games of checkers
T = the task of playing checkers.
P = the probability that the program will win the next game.
In general, any machine learning problem can be assigned to one of two broad classifications:
Types
Supervised Learning
对于给定的数据,已知其正确结果的形态和输入和输出的关系。
分类:
“regression”回归:连续型(广义,不一定绝对连续,如销量)
”classification”分类:离散型(可能性通常很少,如0-1)
example:
(a) Regression - Given a picture of a person, we have to predict their age on the basis of the given picture
(b) Classification - Given a patient with a tumor, we have to predict whether the tumor is malignant or benign.
Unsupervised Learning
未知结果。可以在不知道中间的各种变量的意义的情况下,根据变量的结构、关系做出推断。
Example:
Clustering: Take a collection of 1,000,000 different genes, and find a way to automatically group these genes into groups that are somehow similar or related by different variables, such as lifespan, location, roles, and so on.
Non-clustering: The “Cocktail Party Algorithm”, allows you to find structure in a chaotic environment. (i.e. identifying individual voices and music from a mesh of sounds at a cocktail party).
Model and Cost Function
Model Representation
代价方程,又称平方误差方程,是衡量预测方程和真实值之间误差的函数。