Structural minimax probability machine

Bin Gu; Xingming Sun; Victor S. Sheng

doi:10.1109/TNNLS.2016.2544779

Structural minimax probability machine

Bin Gu, Xingming Sun, Victor S. Sheng

Computer Science

Research output: Contribution to journal › Article › peer-review

261 Scopus citations

Abstract

Minimax probability machine (MPM) is an interesting discriminative classifier based on generative prior knowledge. It can directly estimate the probabilistic accuracy bound by minimizing the maximum probability of misclassification. The structural information of data is an effective way to represent prior knowledge, and has been found to be vital for designing classifiers in real-world problems. However, MPM only considers the prior probability distribution of each class with a given mean and covariance matrix, which does not efficiently exploit the structural information of data. In this paper, we use two finite mixture models to capture the structural information of the data from binary classification. For each subdistribution in a finite mixture model, only its mean and covariance matrix are assumed to be known. Based on the finite mixture models, we propose a structural MPM (SMPM). SMPM can be solved effectively by a sequence of the second-order cone programming problems. Moreover, we extend a linear model of SMPM to a nonlinear model by exploiting kernelization techniques. We also show that the SMPM can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi-min margin machine under certain special conditions. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of SMPM.

Original language	English
Article number	7452660
Pages (from-to)	1646-1656
Number of pages	11
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	28
Issue number	7
DOIs	https://doi.org/10.1109/TNNLS.2016.2544779
State	Published - Jul 2017

Keywords

Bayes learning
finite mixture models
kernel methods
second-order cone programming (SOCP)
structural learning

Access to Document

10.1109/TNNLS.2016.2544779

Cite this

@article{55cf4d2e80b9485ab76b371efdc61310,

title = "Structural minimax probability machine",

abstract = "Minimax probability machine (MPM) is an interesting discriminative classifier based on generative prior knowledge. It can directly estimate the probabilistic accuracy bound by minimizing the maximum probability of misclassification. The structural information of data is an effective way to represent prior knowledge, and has been found to be vital for designing classifiers in real-world problems. However, MPM only considers the prior probability distribution of each class with a given mean and covariance matrix, which does not efficiently exploit the structural information of data. In this paper, we use two finite mixture models to capture the structural information of the data from binary classification. For each subdistribution in a finite mixture model, only its mean and covariance matrix are assumed to be known. Based on the finite mixture models, we propose a structural MPM (SMPM). SMPM can be solved effectively by a sequence of the second-order cone programming problems. Moreover, we extend a linear model of SMPM to a nonlinear model by exploiting kernelization techniques. We also show that the SMPM can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi-min margin machine under certain special conditions. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of SMPM.",

keywords = "Bayes learning, finite mixture models, kernel methods, second-order cone programming (SOCP), structural learning",

author = "Bin Gu and Xingming Sun and Sheng, {Victor S.}",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2017",

month = jul,

doi = "10.1109/TNNLS.2016.2544779",

language = "English",

volume = "28",

pages = "1646--1656",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

number = "7",

}

TY - JOUR

T1 - Structural minimax probability machine

AU - Gu, Bin

AU - Sun, Xingming

AU - Sheng, Victor S.

PY - 2017/7

Y1 - 2017/7

N2 - Minimax probability machine (MPM) is an interesting discriminative classifier based on generative prior knowledge. It can directly estimate the probabilistic accuracy bound by minimizing the maximum probability of misclassification. The structural information of data is an effective way to represent prior knowledge, and has been found to be vital for designing classifiers in real-world problems. However, MPM only considers the prior probability distribution of each class with a given mean and covariance matrix, which does not efficiently exploit the structural information of data. In this paper, we use two finite mixture models to capture the structural information of the data from binary classification. For each subdistribution in a finite mixture model, only its mean and covariance matrix are assumed to be known. Based on the finite mixture models, we propose a structural MPM (SMPM). SMPM can be solved effectively by a sequence of the second-order cone programming problems. Moreover, we extend a linear model of SMPM to a nonlinear model by exploiting kernelization techniques. We also show that the SMPM can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi-min margin machine under certain special conditions. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of SMPM.

AB - Minimax probability machine (MPM) is an interesting discriminative classifier based on generative prior knowledge. It can directly estimate the probabilistic accuracy bound by minimizing the maximum probability of misclassification. The structural information of data is an effective way to represent prior knowledge, and has been found to be vital for designing classifiers in real-world problems. However, MPM only considers the prior probability distribution of each class with a given mean and covariance matrix, which does not efficiently exploit the structural information of data. In this paper, we use two finite mixture models to capture the structural information of the data from binary classification. For each subdistribution in a finite mixture model, only its mean and covariance matrix are assumed to be known. Based on the finite mixture models, we propose a structural MPM (SMPM). SMPM can be solved effectively by a sequence of the second-order cone programming problems. Moreover, we extend a linear model of SMPM to a nonlinear model by exploiting kernelization techniques. We also show that the SMPM can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi-min margin machine under certain special conditions. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of SMPM.

KW - Bayes learning

KW - finite mixture models

KW - kernel methods

KW - second-order cone programming (SOCP)

KW - structural learning

UR - http://www.scopus.com/inward/record.url?scp=84963962002&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2016.2544779

DO - 10.1109/TNNLS.2016.2544779

M3 - Article

C2 - 27101618

AN - SCOPUS:84963962002

SN - 2162-237X

VL - 28

SP - 1646

EP - 1656

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 7

M1 - 7452660

ER -

Structural minimax probability machine

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this