TY - GEN
T1 - Investigation on the construction of the Relevance Vector Machine based on cross entropy minimization
AU - Liu, Xiaofang
AU - Li, Ruikang
AU - Cheng, Dansong
AU - Cheng, Kai
N1 - Publisher Copyright:
© 2016 Chinese Automation and Computing Society.
PY - 2016/10/20
Y1 - 2016/10/20
N2 - As a machine learning method under sparse Bayesian framework, classical Relevance Vector Machine (RVM) applies kernel methods to construct Radial Basis Function(RBF) networks using a least number of relevant basis functions. Compared to the well-known Support Vector Machine (SVM), the RVM provides a better sparsity, and an automatic estimation of hyperparameters. However, the performance of the original RVM purely depends on the smoothness of the presumed prior of the connection weights and parameters. Consequently, the sparsity is actually still controlled by the selection of kernel functions or kernel parameters. This may lead to severe underfitting or overfitting in some cases. In the research presented in this paper, we explicitly involve the number of basis functions into the objective of the optimization procedure, and construct the RVM by the minimization of the cross entropy between the 'hypothetical' probability distribution in the forward training pathway and the 'true' probability distribution in the backward testing pathway. The experimental results have shown that our proposed methodology can achieve both the least complexity of structure and goodness of appropriate fit to data.
AB - As a machine learning method under sparse Bayesian framework, classical Relevance Vector Machine (RVM) applies kernel methods to construct Radial Basis Function(RBF) networks using a least number of relevant basis functions. Compared to the well-known Support Vector Machine (SVM), the RVM provides a better sparsity, and an automatic estimation of hyperparameters. However, the performance of the original RVM purely depends on the smoothness of the presumed prior of the connection weights and parameters. Consequently, the sparsity is actually still controlled by the selection of kernel functions or kernel parameters. This may lead to severe underfitting or overfitting in some cases. In the research presented in this paper, we explicitly involve the number of basis functions into the objective of the optimization procedure, and construct the RVM by the minimization of the cross entropy between the 'hypothetical' probability distribution in the forward training pathway and the 'true' probability distribution in the backward testing pathway. The experimental results have shown that our proposed methodology can achieve both the least complexity of structure and goodness of appropriate fit to data.
KW - Bayesian inference
KW - Cross Entropy Minimization
KW - Radial Basis Function (RBF) Network
KW - Relevance vector machine (RVM)
UR - https://www.scopus.com/pages/publications/84999018363
U2 - 10.1109/IConAC.2016.7604924
DO - 10.1109/IConAC.2016.7604924
M3 - 会议稿件
AN - SCOPUS:84999018363
T3 - 2016 22nd International Conference on Automation and Computing, ICAC 2016: Tackling the New Challenges in Automation and Computing
SP - 233
EP - 237
BT - 2016 22nd International Conference on Automation and Computing, ICAC 2016
A2 - Wang, Jing
A2 - Xu, Zhijie
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 22nd International Conference on Automation and Computing, ICAC 2016
Y2 - 7 September 2016 through 8 September 2016
ER -