brain-learningfromlarge-2003.pdf (6.16 MB)
Download fileLearning from large data : bias, variance, sampling, and learning curves
thesis
posted on 2003-01-01, 00:00 authored by Damien. BrainCommercial organisations demand value from data collection and customer identity tracking schemes like "Fly Buys". This dissertation shows that many commonly used data mining techniques cannot function with the very large data sets that result from such approaches to data collection and proposes new approaches to algorithm design for mining such data sets.