Learning of Bayesian Network Classifiers and Sum-Product Networks
Learning of Bayesian Network Classifiers and Sum-Product Networks
Disciplines
Computer Sciences (100%)
Keywords
-
Bayesian Networks,
Discrimninative Learning,
Discriminative Structure Learing,
Bayesian Network Classifiers,
Probabilistic Graphical Models,
Sum-Product Networks
Discriminative learning of Bayesian networks (BNs) for classification tasks is often beneficial compared to generative learning. This is particularly true in case of model mismatch, i.e. when the BN cannot represent the true data distribution. In the past, we developed maximum margin parameter learning for Bayesian network classifiers and Gaussian Mixture models. Furthermore, we used the margin objective for approximate and exact structure learning. This research is extended within this proposal. The focus is three-fold: (i) Extension of margin-based parameter learning to a hybrid paradigm merging the advantages of generative and discriminative learning. We aim at extending our learning framework to semi- supervised, missing features, and latent variable scenarios. This requires efficient inference during iterative parameter optimization. Additionally, both the discriminative and hybrid learning approach are introduced to potentially deep sum-product networks (SPNs). They explicitly represent the inference process, i.e. structures (including latent variables) exhibiting computational benefits for inference can be exploited. (ii) Discriminative search-and-score structure learning in BNs is time-consuming. We are interested in approximating the non-decomposable discriminative score by a decomposable surrogate to ease the computational costs for score evaluation in BNs. Furthermore, we aim at developing structure learning algorithms for SPNs introducing a global scoring function with an inference cost penalty. (iii) To consolidate SPNs with respect to empirical performance we will compare all developed models to popular generative and discriminative models from the deep community, i.e. restricted Boltzmann machine, auto-encoders, deep belief networks, multi-layer perceptron. Additionally, one particularly interesting recent deep model generative stochastic networks is considered.
Discriminative learning of probabilistic graphical models (PGM) for classification tasks is often beneficial compared to generative learning. This is particularly true in case of model mismatch, i.e. when the PGM cannot represent the true data distribution. The focus of this research project is three-fold: (i) Extension of discriminative parameter learning to a hybrid paradigm merging the advantages of generative and discriminative learning. Furthermore, we extended our learning framework to semi- supervised and missing features scenarios (ii) We developed structure learning algorithms for sum-product networks (SPNs) a particular type of PGM - to better model data distributions. (iii) We applied the developed models to benchmark data and to single channel source separation (SCSS), and bandwidth extension. Furthermore, we performed an extensive comparison to state-of- the-art deep models.
- Technische Universität Graz - 100%
Research Output
- 383 Citations
- 19 Publications
-
2018
Title Heart Sound SegmentationAn Event Detection Approach Using Deep Recurrent Neural Networks DOI 10.1109/tbme.2018.2843258 Type Journal Article Author Messner E Journal IEEE Transactions on Biomedical Engineering Pages 1964-1974 -
2018
Title Hybrid generative-discriminative training of Gaussian mixture models DOI 10.1016/j.patrec.2018.06.014 Type Journal Article Author Roth W Journal Pattern Recognition Letters Pages 131-137 Link Publication -
2016
Title OPTIMAL CONTROL OF AN ENERGY STORAGE FACILITY UNDER A CHANGING ECONOMIC ENVIRONMENT AND PARTIAL INFORMATION DOI 10.1142/s0219024916500266 Type Journal Article Author Shardin A Journal International Journal of Theoretical and Applied Finance Pages 1650026 Link Publication -
2016
Title An explicit upper bound for |L(1,?)| when ?(2) = 1 and ? is even DOI 10.1142/s1793042116501372 Type Journal Article Author Eddin S Journal International Journal of Number Theory Pages 2299-2315 Link Publication -
2019
Title TRIGGERING SUBORDINATE INNOVATION BEHAVIOR: THE INFLUENCE OF LEADERS’ DARK PERSONALITY TRAITS AND LEVEL 5 LEADERSHIP BEHAVIOR DOI 10.1142/s1363919619500452 Type Journal Article Author Strobl A Journal International Journal of Innovation Management Pages 1950045 -
2019
Title Finite groups with an automorphism inverting, squaring or cubing a non-negligible fraction of elements DOI 10.1142/s0219498819500555 Type Journal Article Author Bors A Journal Journal of Algebra and Its Applications Pages 1950055 Link Publication -
2019
Title Crack problem within the context of implicitly constituted quasi-linear viscoelasticity DOI 10.1142/s0218202519500118 Type Journal Article Author Itou H Journal Mathematical Models and Methods in Applied Sciences Pages 355-372 -
2022
Title Blind Speech Separation and Dereverberation using neural beamforming DOI 10.1016/j.specom.2022.03.004 Type Journal Article Author Pfeifenberger L Journal Speech Communication Pages 29-41 Link Publication -
2017
Title Products of two proportional primes DOI 10.1142/s1793042117501445 Type Journal Article Author Moree P Journal International Journal of Number Theory Pages 2583-2596 Link Publication -
2017
Title Fixed Points of Belief Propagation—An Analysis via Polynomial Homotopy Continuation DOI 10.1109/tpami.2017.2749575 Type Journal Article Author Knoll C Journal IEEE Transactions on Pattern Analysis and Machine Intelligence Pages 2124-2136 Link Publication -
2017
Title Diophantine equations in separated variables and lacunary polynomials DOI 10.1142/s179304211750110x Type Journal Article Author Kreso D Journal International Journal of Number Theory Pages 2055-2074 Link Publication -
2017
Title Respiratory Airflow Estimation from Lung Sounds Based on Regression DOI 10.1109/icassp.2017.7952331 Type Conference Proceeding Abstract Author Messner E Pages 1123-1127 -
2017
Title Emergence of the Quantum from the Classical, Mathematical Aspects of Quantum Processes DOI 10.1142/q0121 Type Book Author De Gosson M Publisher World Scientific Publishing -
2020
Title Multi-channel lung sound classification with convolutional recurrent neural networks DOI 10.1016/j.compbiomed.2020.103831 Type Journal Article Author Messner E Journal Computers in Biology and Medicine Pages 103831 Link Publication -
2015
Title Representation Learning for Single-Channel Source Separation and Bandwidth Extension DOI 10.1109/taslp.2015.2470560 Type Journal Article Author Zöhrer M Journal IEEE/ACM Transactions on Audio, Speech, and Language Processing Pages 2398-2409 -
2015
Title Multi-Channel Speech Processing Architectures for Noise Robust Speech Recognition: 3RD CHiME Challenge Results DOI 10.1109/asru.2015.7404830 Type Conference Proceeding Abstract Author Pfeifenberger L Pages 452-459 -
2018
Title Bayesian Neural Networks with Weight Sharing Using Dirichlet Processes DOI 10.1109/tpami.2018.2884905 Type Journal Article Author Roth W Journal IEEE Transactions on Pattern Analysis and Machine Intelligence Pages 246-252 -
2016
Title On the Latent Variable Interpretation in Sum-Product Networks DOI 10.1109/tpami.2016.2618381 Type Journal Article Author Peharz R Journal IEEE Transactions on Pattern Analysis and Machine Intelligence Pages 2030-2044 Link Publication -
2016
Title Maximum margin hidden Markov models for sequence classification DOI 10.1016/j.patrec.2016.03.017 Type Journal Article Author Mutsam N Journal Pattern Recognition Letters Pages 14-20