Understanding Convolutional Neural Networks: Architectural Characterizations for Accuracy Inference

undefined

Duc Hoang – Rhodes College

Supervisor: Dr. Gabriel N. Perdue

SIST – Final Presentation

1 August 2019

’

I. MINERvA experiment

a.

Neutrinos and MINERvA detector

b.

Vertex finding and hadron multiplicity problem

8/01/2019

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

IV. Summary & Outlook

III. Inferring CNNs’ accuracy before training time

a.

Architectural characterizations

b.

Predicting CNNs’ accuracy based on characterizations – why

is it useful?

II. Deep Learning

a.

Deep Neural Networks

b.

Convolutional Neural Networks (CNNs)

c.

CNN’s design difficulties

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

•

•

•

8/17/2024

•

•

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

8/17/2024

Hadron

Showers

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

Fully-connected architecture

•

•

•

•

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

There is no universal CNN design for every tasks.

•

And designing an appropriate structure/architecture of CNN

takes a lot of time and effort even for the experts.

•

There is no systematic way to design CNNs: mainly rely on

human intuition and random/grid search.

•

Computationally expensive to train a CNN model.

8/17/2024

Presenter | Presentation Title or Meeting Title

I.

’

’

II.

8/17/2024

Examples of architectural

attributes we extracted (32 in

total):



Number of convolutional

layers.



Number of rectified linear

unit (ReLU) activated

convolutional layers.



Average depth

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

8/17/2024

Presenter | Presentation Title or Meeting Title

Architectural

characterizations

CNN performance (accuracy)

Machine

Learning models

(classification

and regression)

Important architecture

of CNN for physics task

Interpret the models





’

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

8/17/2024

•

Were also able to extract important features to suggest

architectural changes to CNNs.

More details in paper.

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

’

▶︎

8/17/2024

•

Models fitted on two populations of vertex finding CNNs:

Duc Hoang | Inferring Convolutional Neural Network’s accuracy



Proposed a systematic language to characterize

convolutional neural networks architecture.



Successfully demonstrated that we can use those parameters

to predict whether a network is “good”.



There are limitations to predict the exact accuracy, but initial

results are promising. Extension of the attributes set might

help in the future.



One of the early studies about relationship between CNN’s

architecture and its performance.



More details in our up-coming paper.

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy



Special thanks to my supervisor, Dr. Gabriel N. Perdue.



To members of MINERvA’s machine learning group for their

mentorship and useful conversations:



Anushree



Steven (Oak Ridge)



Nafis



Luis



To members of SIST committee and my mentor group and

other awesome interns:



Laura Fields



Sandra Charles



Judy Nunez

8/17/2024



Alexander Martinez



 Raul Campos



 Matthew Alvarez

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

8/17/2024

•

Neural networks and Convolutional Networks visualization:

Source

•

Random Forest:

Source

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

8/17/2024

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

•

Different deep learning

problems require

different network

architecture.

•

However, selecting an

appropriate architecture

for CNNs is usually done

by human intuition or

random search

•

If we have a way to

uniformly characterize a

network architecture,

then it would be

particularly useful.

8/17/2024

Good

for the

task?

Duc Hoang | Inferring Convolutional Neural Network’s accuracy

8/17/2024

Presenter | Presentation Title or Meeting Title

Slide Note

Embed Share

Download Presentation

This presentation by Duc Hoang from Rhodes College explores inferring the accuracy of Convolutional Neural Networks (CNNs) based on their architectural characterizations. The talk covers the MINERvA experiment, deep learning concepts including CNNs, and the significance of predicting CNN accuracy before training. It delves into the challenges of designing CNNs and emphasizes the importance of architectural characterizations in optimizing network performance.

luther Follow

Uploaded on Aug 17, 2024 | 0 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript

Inferring Convolutional Neural Networks accuracy from its architectural characterizations Duc Hoang Rhodes College Supervisor: Dr. Gabriel N. Perdue SIST Final Presentation 1 August 2019

Outline I. MINERvA experiment a. Neutrinos and MINERvA detector b. Vertex finding and hadron multiplicity problem II. Deep Learning a. Deep Neural Networks b. Convolutional Neural Networks (CNNs) c. CNN s design difficulties III. Inferring CNNs accuracy before training time a. Architectural characterizations b. Predicting CNNs accuracy based on characterizations why is it useful? IV. Summary & Outlook 8/01/2019 Duc Hoang | Inferring Convolutional Neural Network s accuracy 2

Neutrinos Neutrinos are fundamental. They are electrically neutral "partners" of the familiar charged leptons (e.g., electrons). They are very light, They very rarely interactwith other particles Proton UU D Quarks Protons Neutrons Electrons 3 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

MINERvA Nuclear effects with a variety of target materials ranging from Helium to Lead. Fine-grained resolution for excellent kinematic measurements. Target 5 Target 4 Water Target Target 3 Target 2 Target 1 He Target 4 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Vertex Finding and Hadron Multiplicity problem strip number Hadron Showers True vertex plane number 5 meters 8/17/2024 5 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Deep Neural Networks Fully-connected architecture Each input multiplied by a weight. Weighted values are summed, Bias is added. Non-linear activation function is applied Trained by varying the parameters to minimize a loss function (quantifies how many mistakes the network makes) 8/17/2024 6 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Convolutional Neural Networks (CNNs) Similar concept to Deep Neural Networks, but highly effective for image inputs, and modern neutrino detectors are imaging detectors. U X V 7 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Difficulty There is no universal CNN design for every tasks. And designing an appropriate structure/architecture of CNN takes a lot of time and effort even for the experts. There is no systematic way to design CNNs: mainly rely on human intuition and random/grid search. Computationally expensive to train a CNN model. 8/17/2024 Presenter | Presentation Title or Meeting Title 8

Objectives Examples of architectural attributes we extracted (32 in total): Number of convolutional layers. Number of rectified linear unit (ReLU) activated convolutional layers. Average depth I. Propose a systematic language to characterize CNN s architecture, and demonstrate that they can be predictive of a CNN s accuracy. II. Suggest architectural changes to CNNs for different physics tasks (vertex finding and hadron multiplicity) 8/17/2024 9 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Method Important architecture of CNN for physics task Interpret the models Machine Learning models (classification and regression) Architectural characterizations CNN performance (accuracy) 10 8/17/2024 Presenter | Presentation Title or Meeting Title

Classification Divide data set into broken and healthy networks. Healthy Broken Use Random Forest and Extremely Randomized Tree to predict each CNN s class. 11 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Classification results Machine Learning models perform significantly better than random guessing (50% when there is no class overflow): Average 5-fold cross- validation scores Model Accuracy on test set 70.3 0.006 % 70.6 % Random Forest 70.2 0.003 % 70.5 % Extremely Randomized Tree Were also able to extract important features to suggest architectural changes to CNNs. More details in paper. 12 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Regression on just healthy networks Fitted an non-linear Ordinary Least Square model on just the healthy networks. 8/17/2024 13 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Regression results Models fitted on two populations of vertex finding CNNs: ?? of OLS Population Number of Healthy CNNs First 0.426 49276 Second 0.295 21415 Combined 0.961 70691 Limitations: Still not enough parameters to characterize the detailed relationship between CNN s architecture and its performance. Planning to extend attribute set in the future. 14 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Outlook & Summary Proposed a systematic language to characterize convolutional neural networks architecture. Successfully demonstrated that we can use those parameters to predict whether a network is good . There are limitations to predict the exact accuracy, but initial results are promising. Extension of the attributes set might help in the future. One of the early studies about relationship between CNN s architecture and its performance. More details in our up-coming paper. 15 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Acknowledgements Special thanks to my supervisor, Dr. Gabriel N. Perdue. To members of MINERvA s machine learning group for their mentorship and useful conversations: Anushree Steven (Oak Ridge) Nafis Luis To members of SIST committee and my mentor group and other awesome interns: Laura Fields Sandra Charles Judy Nunez Alexander Martinez Raul Campos Matthew Alvarez 8/17/2024 16 Duc Hoang | Inferring Convolutional Neural Network s accuracy

References Neural networks and Convolutional Networks visualization: Source Random Forest: Source 8/17/2024 17 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Back-ups 18 8/17/2024 Duc Hoang | Inferring Convolutional Neural Network s accuracy

K-folds cross validation 8/17/2024 19 Duc Hoang | Inferring Convolutional Neural Network s accuracy

Architectural characterizations of CNNs Different deep learning problems require different network architecture. However, selecting an appropriate architecture for CNNs is usually done by human intuition or random search If we have a way to uniformly characterize a network architecture, then it would be particularly useful. Good for the task? ? 8/17/2024 20 Duc Hoang | Inferring Convolutional Neural Network s accuracy