Recursive partitioning

From WikiMD.org
Jump to navigation Jump to search

Recursive partitioning

Recursive partitioning (pronunciation: /rɪˈkɜːrsɪv pɑːrˈtɪʃənɪŋ/) is a statistical method used in machine learning and data mining. It involves the division of data into subsets, which are then divided into further subsets in a recursive manner, hence the name.

Etymology

The term "recursive partitioning" is derived from the words "recursive", which means repeating, and "partitioning", which refers to the division of something into parts. The term reflects the method's process of repeatedly dividing data into smaller subsets.

Methodology

Recursive partitioning begins with a data set and a response variable that needs to be predicted. The data set is split into two subsets based on an explanatory variable that maximizes the difference in response variable values between the two resulting subsets. This process is repeated recursively on each subset until a stopping criterion is met.

Types

There are several types of recursive partitioning, including:

  • Decision tree learning: This is a method used for classification and regression. It creates a model that predicts the value of a target variable by learning simple decision rules inferred from the data features.
  • Random forests: This is a method that operates by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes or mean prediction of the individual trees.
  • Boosted trees: This method builds multiple decision trees in a sequential manner, where each tree is built to correct the errors made by the previous one.

Applications

Recursive partitioning is used in various fields, including:

  • Healthcare: It is used to predict patient outcomes based on various factors such as age, gender, and medical history.
  • Finance: It is used to predict stock prices based on historical data.
  • Marketing: It is used to segment customers into different groups based on their purchasing behavior.

Related Terms

  • Data mining: The process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.
  • Machine learning: The study of computer algorithms that improve automatically through experience and by the use of data.

External links

Esculaap.svg

This WikiMD dictionary article is a stub. You can help make it a full article.


Languages: - East Asian 中文, 日本, 한국어, South Asian हिन्दी, Urdu, বাংলা, తెలుగు, தமிழ், ಕನ್ನಡ,
Southeast Asian Indonesian, Vietnamese, Thai, မြန်မာဘာသာ, European español, Deutsch, français, русский, português do Brasil, Italian, polski