Deep Generative Bayesian Networks in Machine Learning

Deep Generative Bayesian

Network

Summary

1.

Neural Networks vs Bayesian Neural Networks

2.

Advantages of Bayesian Neural Networks

3.

Bayesian Theory

4.

Preliminary Results

Neural Networks

vs

Bayesian Neural Networks

Regular Neural

Networks

●

Fixed weights and biases

●

Kinda boring

●

One set of features: one result

Bayesian Neural

Network

●

Gaussian distributions

●

Awesome Monte Carlo sampling

●

The weights and biases are drawn

following these distributions

●

One set of features: different

results possible

Advantages of Bayesian

Neural Networks

Advantages vs Disadvantages

●

Robust to small datasets (less prone to

overfitting)

●

“Conscious” of its uncertainties

●

Gives a probability distribution as an

output

●

Can adapt easily to regular neural

networks architectures

●

More computer demanding

●

Implementation more difficult

●

More complexe theory

Bayesian Theory

Comparison to Regular Neural Network Theory

Regular theory

Minimize the loss:

Equivalent to maximizing

the likelihood:

Bayesian theory

Calculate the posterior

distribution:

Baye’s rules (exact inference):

Approximation

Posterior distribution

Parametrized distribution

Kullbach-Liebler divergence:

Loss function

Evidence Lower BOund (ELBO):

Custom dataset

Building custom rings

●

Adding noise to the dataset

●

The goal is to reproduce this noise

True ring

Noised ring

The noise distribution

●

Depends on the features:

○

mean, width, position, angles

●

Make the model fit to this distribution

Fitting the noise distribution

●

Working first on a simpler distribution

●

By minimizing the ELBO loss we can fit

the distribution

blue

= predicted distribution

orange

= true distribution

Updates

No convergence

blue

= predicted distribution

orange

= true distribution

Partial convergence

blue

= predicted distribution

orange

= true distribution

Real distribution

blue

= predicted distribution

orange

= true distribution

Discrete

probability

distribution

The CODE

The CODE

Image loss

and KL loss

blue

= factor switch

orange

= kl_factor adjust

Slide Note

Embed Share

Download

Exploring the differences between Neural Networks and Bayesian Neural Networks, the advantages of the latter including robustness and adaptation capabilities, the Bayesian theory behind these networks, and insights into the comparison with regular neural network theory. Dive into the complexities, uncertainties, and potential benefits of Bayesian Neural Networks for machine learning applications.

alr_bo Follow

Uploaded on Nov 26, 2024 | 0 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

Download Presentation

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript

Deep Generative Bayesian Network 1

Summary 1. Neural Networks vs Bayesian Neural Networks 2. Advantages of Bayesian Neural Networks 3. Bayesian Theory 4. Preliminary Results 2

1 Neural Networks vs Bayesian Neural Networks 3

Regular Neural Networks Fixed weights and biases Kinda boring One set of features: one result 4

Bayesian Neural Network Gaussian distributions Awesome Monte Carlo sampling The weights and biases are drawn following these distributions One set of features: different results possible 5

2 Advantages of Bayesian Neural Networks 6

Advantages vs Disadvantages Robust to small datasets (less prone to More computer demanding overfitting) Implementation more difficult Conscious of its uncertainties More complexe theory Gives a probability distribution as an output Can adapt easily to regular neural networks architectures 7

3 Bayesian Theory 8

Comparison to Regular Neural Network Theory Minimize the loss: Regular theory Equivalent to maximizing the likelihood: Calculate the posterior distribution: Bayesian theory Baye s rules (exact inference): 9

Approximation Posterior distribution Parametrized distribution Kullbach-Liebler divergence: 10

Loss function Evidence Lower BOund (ELBO): 11

4 Custom dataset 12

Building custom rings True ring Adding noise to the dataset The goal is to reproduce this noise Noised ring 13

The noise distribution Depends on the features: mean, width, position, angles Make the model fit to this distribution 14

Fitting the noise distribution Working first on a simpler distribution By minimizing the ELBO loss we can fit the distribution - - blue = predicted distribution orange = true distribution 15