Social Media, Data Mining & Machine Learning: Bias in Machine Learning

Bias in Machine Learning

Posted by JoSeK at 4:12 PM . Saturday, June 10, 2006

In statistics, the term bias is used in two different ways

A biased sample is a statistical sample where their members have not the same probability to be chosen.
A biased estimator is one estimator that over or understimates the quantity to be estimated.

In Machine learning the term bias is more related to the biased estimator as it is applied to the classifiers. As can be seen in [1], the bias can be expressed as

reflecting sensivity to the target function f(x). The bias represents "how closely on average the estimate is able to approximate the target". The bias has direct effects on the predicted error as we can decompose it as follows [2]

References
[1] J. H. Friedman, "On bias, variance, 0/1 loss, and the curse-of-dimensionality", Data Mining and Knowledge Discovery vol.1, nº 1, 55-77, 1997. (Download).
[2] G. M. James, "Variance and Bias for General Loss Functions", Machine Learning 51, nº 2, 115-135, 2003. (Download)

0 comments:

Post a Comment

©2006-2008 Social Media, Data Mining & Machine Learning
Disclaimer: put a content dislaimer here - Mauris elit. Donec neque. Phasellus nec sapien quis pede facilisis suscipit. Aenean quis risus sit amet eros volutpat ullamcorper. Ut a mi. Etiam nulla. Mauris interdum.Lorem ipsum dolor sit amet, consectetuer adipiscing elit. Quisque sed felis. Aliquam sit amet felis. Mauris semper, velit semper laoreet dictum, quam diam dictum urna

The Forte theme by Moses Francis
Port to Blogger by Blog and Web and BTemplates

Social Media, Data Mining & Machine Learning

Bias in Machine Learning

0 comments:

Post a Comment

Labels

Blog Archive

Related Blogs