Graph explorer

Node harvest

When choosing a suitable technique for regression and classification with multivariate predictor variables, one is often faced with a tradeoff between interpretability and high predictive accuracy. To give a classical example, classification and regression trees are easy to understand and interpret. Tree ensembles like Random Forests provide usually more accurate predictions. Yet tree ensembles are also more difficult to analyze than single trees and are often criticized, perhaps unfairly, as `black box' predictors. Node harvest is trying to reconcile the two aims of interpretability and predictive accuracy by combining positive aspects of trees and tree ensembles. Results are very sparse and interpretable and predictive accuracy is extremely competitive, especially for low signal-to-noise data. The procedure is simple: an initial set of a few thousand nodes is generated randomly. If a new observation falls into just a single node, its prediction is the mean response of all training observation within this node, identical to a tree-like prediction. A new observation falls typically into several nodes and its prediction is then the weighted average of the mean responses across a

5 nodes7 linksoverview mapNode harvest
5 nodes7 links
Node harvest5 visible / 5 total nodes / 7 links
Related contextRelated contextRelated contextAuthorshipTopic signalTopic signalTopic signalWNode harvestpreprint / 2011ANicolai MeinshausenResearcherTMachine Learning49008 worksTMethodology5119 worksTApplications3567 works
PaperSignal 104 links

Node harvest

preprint / 2011

Open