An Improved Ensemble of Random Vector Functional Link Networks Based on Particle Swarm Optimization with Double Optimization Strategy

PLoS One. 2016 Nov 11;11(11):e0165803. doi: 10.1371/journal.pone.0165803. eCollection 2016.

Abstract

For ensemble learning, how to select and combine the candidate classifiers are two key issues which influence the performance of the ensemble system dramatically. Random vector functional link networks (RVFL) without direct input-to-output links is one of suitable base-classifiers for ensemble systems because of its fast learning speed, simple structure and good generalization performance. In this paper, to obtain a more compact ensemble system with improved convergence performance, an improved ensemble of RVFL based on attractive and repulsive particle swarm optimization (ARPSO) with double optimization strategy is proposed. In the proposed method, ARPSO is applied to select and combine the candidate RVFL. As for using ARPSO to select the optimal base RVFL, ARPSO considers both the convergence accuracy on the validation data and the diversity of the candidate ensemble system to build the RVFL ensembles. In the process of combining RVFL, the ensemble weights corresponding to the base RVFL are initialized by the minimum norm least-square method and then further optimized by ARPSO. Finally, a few redundant RVFL is pruned, and thus the more compact ensemble of RVFL is obtained. Moreover, in this paper, theoretical analysis and justification on how to prune the base classifiers on classification problem is presented, and a simple and practically feasible strategy for pruning redundant base classifiers on both classification and regression problems is proposed. Since the double optimization is performed on the basis of the single optimization, the ensemble of RVFL built by the proposed method outperforms that built by some single optimization methods. Experiment results on function approximation and classification problems verify that the proposed method could improve its convergence accuracy as well as reduce the complexity of the ensemble system.

MeSH terms

  • Algorithms*
  • Brain Neoplasms / chemistry
  • Diabetes Mellitus / metabolism
  • Humans
  • Lung / chemistry
  • Machine Learning*
  • Neural Networks, Computer*
  • Pattern Recognition, Automated / statistics & numerical data*
  • Wine / analysis

Grants and funding

This work was supported by the National Natural Science Foundation of China (Nos.61271385, 61572241, 61572239, 61520106006). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.