Classification

Super RaSE: Super Random Subspace Ensemble Classification

We propose a new ensemble classification algorithm, named Super Random Subspace Ensemble (Super RaSE), to tackle the sparse classification problem. The proposed algorithm is motivated by the Random Subspace Ensemble algorithm (RaSE). The RaSE method …

Imbalanced classification: A paradigm-based review

A common issue for classification in scientific research and industry is the existence of imbalanced classes. When sample sizes of different classes are imbalanced in training data, naively implementing a classification method often leads to …

RaSE: A Variable Screening Framework via Random Subspace Ensembles

Variable screening methods have been shown to be effective in dimension reduction under the ultra-high dimensional setting. Most existing screening methods are designed to rank the predictors according to their individual contributions to the …

RaSE: Random Subspace Ensemble Classification

We propose a new model-free ensemble classification framework, Random Subspace Ensemble (RaSE), for sparse classification. In the RaSE algorithm, we aggregate many weak learners, where each weak learner is a base classifier trained in a subspace …

RaSEn R package

References: Tian, Y. and Feng, Y. (2021). RaSE: Random Subspace Ensemble Classification. Journal of Machine Learning Research. Tian, Y. and Feng, Y. (2021). RaSE: A variable screening framework via random subspace ensembles. Journal of the American Statistical Association. Zhu, J. and Feng, Y. (2021). Super RaSE: Super Random Subspace Ensemble Classification. Journal of Risk and Financial Management. R package. downloads downloads 10K 10K A demo

Neyman-Pearson classification algorithms and NP receiver operating characteristics

In many binary classification applications, such as disease diagnosis and spam detection, practitioners commonly face the need to limit type I error (that is, the conditional probability of misclassifying a class 0 observation as class 1) so that it …

JDINAC: joint density-based non-parametric differential interaction network analysis and classification using high-dimensional sparse omics data

Motivation A complex disease is usually driven by a number of genes interwoven into networks, rather than a single gene product. Network comparison or differential network analysis has become an important means of revealing the underlying mechanism …

A survey on Neyman-Pearson classification and suggestions for future research

In statistics and machine learning, classification studies how to automatically learn to make good qualitative predictions (i.e., assign class labels) based on past observations. Examples of classification problems include email spam filtering, fraud …

Feature Augmentation via Nonparametrics and Selection (FANS) in High Dimensional Classification

We propose a high-dimensional classification method that involves nonparametric feature augmentation. Knowing that marginal density ratios are the most powerful univariate classifiers, we use the ratio estimates to transform the original feature …

Neyman-Pearson Classification under High-Dimensional Settings

Most existing binary classification methods target on the optimization of the overall classification risk and may fail to serve some real-world applications such as cancer diagnosis, where users are more concerned with the risk of misclassifying one …