Results 1 to 10 of about 796,511 (318)

Feature Selection with the Boruta Package

open access: yesJournal of Statistical Software, 2010
This article describes a R package Boruta, implementing a novel feature selection algorithm for finding emph{all relevant variables}. The algorithm is designed as a wrapper around a Random Forest classification algorithm.
Miron B. Kursa, Witold R. Rudnicki
doaj   +1 more source

Agnostic Feature Selection [PDF]

open access: yes, 2020
Unsupervised feature selection is mostly assessed along a supervised learning setting, depending on whether the selected features efficiently permit to predict the (unknown) target variable. Another setting is proposed in this paper: the selected features aim to efficiently recover the whole dataset.
Doquet, Guillaume Florent   +1 more
openaire   +2 more sources

Online Feature Selection with Streaming Features [PDF]

open access: yesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
We propose a new online feature selection framework for applications with streaming features where the knowledge of the full feature space is unknown in advance. We define streaming features as features that flow in one by one over time whereas the number of training examples remains fixed.
Xindong Wu 0001   +4 more
openaire   +2 more sources

Hybrid-Recursive Feature Elimination for Efficient Feature Selection

open access: yesApplied Sciences, 2020
As datasets continue to increase in size, it is important to select the optimal feature subset from the original dataset to obtain the best performance in machine learning tasks.
Hyelynn Jeon, Sejong Oh
doaj   +1 more source

Feature Selection [PDF]

open access: yesACM Computing Surveys, 2017
Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data-mining and machine-learning problems. The objectives of feature selection include building simpler and more comprehensible models, improving data-mining performance, and preparing clean,
Jundong Li   +6 more
openaire   +2 more sources

Biogeography-based optimization for feature selection

open access: yesProceedings of the International Florida Artificial Intelligence Research Society Conference, 2023
Data clustering has many applications in medical sciences, banking, and data mining. K-means is the most popular data clustering algorithm due to its efficiency and simplicity of implementation. However, K-means has some limitations, which may affect its
Mandana Gholami   +2 more
doaj   +1 more source

Ontology-Based Feature Selection: A Survey

open access: yesFuture Internet, 2021
The Semantic Web emerged as an extension to the traditional Web, adding meaning (semantics) to a distributed Web of structured and linked information.
Konstantinos Sikelis   +2 more
doaj   +1 more source

Redundancy Is Not Necessarily Detrimental in Classification Problems

open access: yesMathematics, 2021
In feature selection, redundancy is one of the major concerns since the removal of redundancy in data is connected with dimensionality reduction. Despite the evidence of such a connection, few works present theoretical studies regarding redundancy.
Sebastián Alberto Grillo   +9 more
doaj   +1 more source

Feature Selection Embedded Robust K-Means

open access: yesIEEE Access, 2020
Clustering is one of the most important unsupervised learning problems in machine learning. As one of the most widely used clustering algorithms, K-means has been studied extensively.
Qian Zhang, Chong Peng
doaj   +1 more source

Nested ensemble selection: An effective hybrid feature selection method

open access: yesHeliyon, 2023
It has been shown that while feature selection algorithms are able to distinguish between relevant and irrelevant features, they fail to differentiate between relevant and redundant and correlated features.
Firuz Kamalov   +4 more
doaj   +1 more source

Home - About - Disclaimer - Privacy