What to Expect of Classifiers: Reasoning about Logistic Regression with Missing Features

Computer Science Department, UCLA

Pasha Khosravi, Yitao Liang, YooJung Choi and Guy Van den Broeck

August 15, 2019

Train

Classifier

(ex. Logistic Regression)

Predict

Test samples with

Missing Features

August 15, 2019

•

Common approach is to fill out the missing features, i.e.

doing imputation.

•

They make unrealistic assumptions (mean, median, etc).

•

More sophisticated methods such as MICE don’t scale to

bigger problems (also have assumptions).

•

We want a more principled way of dealing with this.

August 15, 2019

August 15, 2019

August 15, 2019

Now what? How to do predictions using the two models?

August 15, 2019

August 15, 2019

•

•

August 15, 2019

•

In general, it is intractable for arbitrary pairs of discriminative and

generative models.

•

We proved that even for the simple case where F is Logistic

Regression and P is Naïve Bayes, the task is NP-Hard.

•

How can we compute the expected prediction?

August 15, 2019

What if we only have a generative model that is both good at:

1) Classifying   2) Missing features.

•

Given a discriminative classifier, learn a generative model that

1.

Conforms

 to the classifier.

2.

Maximizes the likelihood of joint feature distribution P(X)

August 15, 2019

ï

We focus on of Conformant Learning involving Logistic Regression and Naïve Bayes

•

Given a NB model there is unique LR model that conform to it

•

Given a LR model there is many NB models that conform to it

August 15, 2019

ï

August 15, 2019

ï

Logistic Regression

Weights

“Best” Conforming

 Naïve Bayes

NaCL

August 15, 2019

Using Cross Entropy to compare

- probabilities of the original classifier vs probabilities of NaCL’s learned model

August 15, 2019

More experimental results in the paper.

August 15, 2019

We saw

Expected Prediction

is very effective with handling missing features.

 What else can we do?

•

Explanations

•

Feature Selection

•

Fairness

August 15, 2019

Remove maximum number of supporting

features until expected classification is about

to change, then show the remaining support

features.

August 15, 2019

•

Expected Prediction is an effective tool for several applications such as

missing data, generating explanations

•

We introduced NaCL, which given a logistic regression model, learns a

Naïve Bayes model that

conforms

with the LR model and also

maximizes joint feature likelihood.

•

Future work would be looking at more expressive pair of models, and

potentially choose models that make the expected prediction tractable

August 15, 2019

What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

GitHub:

github.com/UCLA-StarAI/NaCL

Slide Note

Embed Share

Download

This research discusses common approaches in dealing with missing features in classifiers like logistic regression. It compares generative and discriminative models, exploring the idea of training separate models for feature distribution and classification. Expected Prediction is proposed as a principled way to reason about classifier outcomes. The study aims to provide a more structured approach to handling missing data in classifiers.

naveen Follow

Uploaded on Aug 01, 2024 | 3 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript

Computer Science What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features Pasha Khosravi, Yitao Liang, YooJung Choi and Guy Van den Broeck. Computer Science Department, UCLA Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Motivation Predict } Classifier Train (ex. Logistic Regression) Test samples with Missing Features August 15, 2019 2 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Common Approaches Common approach is to fill out the missing features, i.e. doing imputation. They make unrealistic assumptions (mean, median, etc). More sophisticated methods such as MICE don t scale to bigger problems (also have assumptions). We want a more principled way of dealing with this. August 15, 2019 3 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Generative vs Discriminative Models Notation: Generative Model: A model that defines joint features and class probability distribution, ? ?,? . For example, Na ve Bayes. Discriminative Model: A model that defines class conditional probability distribution, ? ? ?). For example, Logistic Regression. August 15, 2019 4 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Generative vs Discriminative Models Generative Models (ex. Na ve Bayes) Discriminative Models (ex. Logistic Regression) Classification Accuracy Missing Features August 15, 2019 5 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Generative vs Discriminative Models What if we train two models: 1. Generative model for feature distribution ?(?). 2. Discriminative model for the classifier ? ? = ? ? ?). Now what? How to do predictions using the two models? August 15, 2019 6 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Expected Predication Expected Prediction is a principled way to reason about outcome of a classifier, ?(?), w.r.t. a feature distribution ?(?). M: Missing features y: Observed Features August 15, 2019 7 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Expected Predication Intuition Imputation Techniques: Replace the missing-ness uncertainty with oneor multiplepossible inputs, and evaluate the models. Expected Prediction: Considers all possible inputs and compute weighted average of the evaluations. August 15, 2019 8 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Hardness of Taking Expectations How can we compute the expected prediction? In general, it is intractable for arbitrary pairs of discriminative and generative models. We proved that even for the simple case where F is Logistic Regression and P is Na ve Bayes, the task is NP-Hard. August 15, 2019 9 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

What to do instead? What if we only have a generative model that is both good at: 1) Classifying 2) Missing features. Conformant learning Given a discriminative classifier, learn a generative model that 1. Conforms to the classifier. 2. Maximizes the likelihood of joint feature distribution P(X) No missing features Same quality of classification Has missing features No problem, do inference August 15, 2019 10 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Nave Conformant Learning (NaCL) We focus on of Conformant Learning involving Logistic Regression and Na ve Bayes Given a NB model there is unique LR model that conform to it Given a LR model there is many NB models that conform to it August 15, 2019 11 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Nave Conformant Learning (NaCL) We showed that we can write the Na ve Conformant Learning Optimization task as a Geometric Program. Solving geometric programs has an exact and efficient algorithm, and modern GP solvers can handle large problems. For NaCL, we have O(??) number of constraints. ? is the number of features, and ? is the number of classes. August 15, 2019 12 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Nave Conformant Learning (NaCL) } NaCL Logistic Regression Weights Best Conforming Na ve Bayes GitHub: github.com/UCLA-StarAI/NaCL August 15, 2019 13 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Experiments: Fidelity to Original Classifier Using Cross Entropy to compare - probabilities of the original classifier vs probabilities of NaCL s learned model August 15, 2019 14 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Experiments: Classification Accuracy More experimental results in the paper. August 15, 2019 15 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Other Applications We saw Expected Prediction is very effective with handling missing features. What else can we do? Explanations Feature Selection Fairness August 15, 2019 16 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Local Explanations using Missing-ness Goal: To explain an instance of classification Support Features: Making them missing probability goes down Opposing Features: Making them missing probability goes up Sufficient Explanations Remove maximum number of supporting features until expected classification is about to change, then show the remaining support features. August 15, 2019 17 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Conclusion Expected Prediction is an effective tool for several applications such as missing data, generating explanations We introduced NaCL, which given a logistic regression model, learns a Na ve Bayes model that conforms with the LR model and also maximizes joint feature likelihood. Future work would be looking at more expressive pair of models, and potentially choose models that make the expected prediction tractable. August 15, 2019 18 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

Thank You Thank You Thank You What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features GitHub: github.com/UCLA-StarAI/NaCL August 15, 2019 19 Computer Science Department What to Expect of Classifiers? Reasoning about Logistic Regression with missing features

What to Expect of Classifiers: Reasoning about Logistic Regression with Missing Features

Download Presentation

Presentation Transcript

Related

More Related Content