这是我参与8月更文挑战的第18天，活动详情查看：8月更文挑战

Notes of Andrew Ng’s Machine Learning —— (0) Introduction

Welcome

Machine Learning

Examples:

Database mining Large datasets from growth of automation / web. E.g. Web click data, medical records, biology, engineering
Applications can’t program by hand. E.g. Autonomous helicopter, handwriting recognition, most of Natural Language Progressing (NLP), Computer Vision.
Self-customizing programs E.g. Amazon, Netflix product recommendations
Understanding human learning (brain, real AI)

Arthur Samuel (1959) : Machine Learning: Field of study that gives computers the ability to learn without being explicitly programmed.
Tom Mitchell (1998), a more modern definition : Well-posed Learning Problem: A computer program is said to learn from experience E with respect to some task T and some prformance measure P, if its performance on T, as measured by P, improves with experience E.

Example:

Playing checkers.

E = the experience of playing many games of checkers T = the task of playing checkers. P = the probability that the program will win the next game.
Spam Filter

E = Watching you label emails as spam or not spam T = Classifying emails as spam or not spam P = The number (or fraction) of emails correctly classified as spam / not spam.

Others: Reinforcement learning, recommender system

In supervised learning, we are:

In unsupervised learning we can:

approach problems with little or no idea what our results should look like
derive structure from data where we don't necessarily know the effect of the variables.
derive this structure by clustering the data based on relationships among the variables in the data.

With unsupervised learning there is no feedback based on the prediction results.

E.g.

Clustering:
- Google news looks for tens of thousands of news stories and automatically cluster them together. So, the news stories that are all about the same topic get displayed together.
- Take a collection of 1,000,000 different genes, and find a way to automatically group these genes into groups that are somehow similar or related by different variables, such as lifespan, location, roles, and so on.
Non-clustering:

The "Cocktail Party Algorithm", allows you to find structure in a chaotic environment. (i.e. identifying individual voices and music from a mesh of sounds at a cocktail party).