Scalable QoS Prediction for Service Adaptation in Service-Based Applications

undefined

Towards Online, Accurate, and

Scalable QoS Prediction for

Runtime Service Adaptation

Jieming Zhu, Pinjia He, Zibin Zheng,

and Michael R. Lyu

The Chinese University of Hong Kong

ICDCS 2014

Madrid, Spain 30 June-3 July 2014

Outline













Introduction



[ref. http://www.priceline.com]

Introduction



Car rental s

ervices provided by

different companies

Introduction





Response time, throughput, failure probability





Service failures / SLA violations

Failure

Introduction







Loose coupling and dynamic binding



Make use of redundant services



Become resilient against failures of component services

Introduction





When to trigger an adaptation action?



Which working services to be replaced?



Which candidate services to employ?





QoS for working services



Existing work:

e.g., monitoring



QoS for candidate services



Our work:

unsolved problem

Outline













Observations





Dynamic:



Users are distributed worldwide



The workload of service is varying



Network is dynamic



User-specific:



Different users may perceive different QoS





A large number of users as well as services



Prohibitive overhead at runtime

Challenges



Predict the

missing values

Outline













Collaborative Filtering (CF)





User-movie rating prediction (Netflix challenge)



Similar users (e.g., similar preferences)



Similar movies (e.g., similar themes)

movies

users

Rating

matrix

CF vs QoS Prediction





Classic model for CF







Usually solved by gradient descend algorithm (batch mode)

Sum of squared error

Regularization terms

Limitations of MF for QoS prediction





Mismatch with the probabilistic assumption for MF



Degrade its prediction accuracy





Existing QoS values can be continuously updated



However, MF work offline, and cannot adapt to new

observed QoS values

Response Time

Throughput

Limitations of MF for QoS prediction





Users and services may join or leave the environment



MF works on a matrix with a fixed size, not scalable





Our approach: adaptive matrix factorization



Aim to meet the requirements of being online, accurate,

and scalable

Adaptive Matrix Factorization





Box-Cox transformation (to address limitation 1)



Stabilize data variance



Rank-preserving

Key Techniques 1: Data Transformation

Response Time

Throughput

Response Time

Throughput



Online learning (to address limitation 2)



Stochastic gradient descent (SGD)



Adapt to each newly observed data sample



Update a user vector and a service vector at each step



Key Techniques 2: Online Learning

SGD update rules

Online mode





Become robust



Existing users and services keep stable



New users and services converge fast



Unique learning rate for each user/service



Large for new vectors, small for converged vectors

Key Techniques 3: Adaptive Weights

Outline

















Response time (RT):

user-perceived delay of service

invocation (sec)



Throughput (TP):

data transmission rate (kbps)



142 * 4500 * 64

QoS matrix



 users (Planetlab nodes)



4,500

real-world Web services



 time slices, at

min time interval

Experiments





MAE (Mean Absolute Error):

to measure the average

prediction accuracy



MRE (Median Relative Error):

a key metric to identify the

error effect of different magnitudes of prediction values



NMRE (Ninety-Percentile Relative Error)

：

NPRE takes the

90th percentile of all the pairwise relative errors

Experiments





Compared approaches:



UPCC, IPCC, UIPCC:

conventional CF baselines



PMF:

convectional matrix factorization approach



These approaches cannot perform online



Matrix density:

means how many historical data we use

Experiments

Experiments





Compared approaches:



UIPCC



PMF

Experiments

Re-train the entire model

at each time slice

AMF: continuously

and incremental

updating





80% of users and services at time slice 1 as existing users and

services



Add the remaining 20% into the model

Experiments

Robust to new users and services

Outline

















AMF: Adaptive Matrix Factorization



Data transformation, online learning, and adaptive weights



Online, accurate, and scalable





Implement our QoS prediction approach together with

service adaptation mechanisms



Real-world evaluation on case studies

Conclusions

undefined

Our data & code are available at:

http://wsdream.github.io/AMF

Thank you!

Slide Note

Embed Share

Download

This study delves into the challenge of predicting Quality of Service (QoS) for runtime service adaptation in service-based applications. It explores collaborative filtering and adaptive matrix factorization techniques for accurate QoS prediction, aiming towards enhancing service resilience and reducing failures in a dynamic operating environment.

kinnunen_e Follow

Uploaded on Sep 14, 2024 | 0 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript

ICDCS 2014 Madrid, Spain 30 June-3 July 2014 Towards Online, Accurate, and Scalable QoS Prediction for Runtime Service Adaptation Zhu, Pinjia Pinjia He, and Michael R. and Michael R. Lyu The Chinese University of Hong Kong The Chinese University of Hong Kong Jieming Jieming Zhu, He, Zibin Zibin Zheng, Lyu Zheng,

Outline Introduction QoS Prediction Problem Collaborative Filtering Adaptive Matrix Factorization Experiments Conclusion & Future Work 2

Introduction Service-based applications: built on a set of component services Service Service Service Service 3 [ref. http://www.priceline.com]

Introduction Redundant services: functionally-equivalent services provided in the cloud Car rental services provided by different companies 4

Introduction Failure Quality-of-Service (QoS): user requirements Response time, throughput, failure probability Complex operating environment Service failures / SLA violations 5

Introduction Service adaptation: switch a working service to a candidate service at runtime (e.g., B1 B2) Loose coupling and dynamic binding Make use of redundant services Become resilient against failures of component services 6

Introduction Decisions for service adaptation When to trigger an adaptation action? Which working services to be replaced? Which candidate services to employ? Need available QoS information of component services QoS for working services Existing work: e.g., monitoring QoS for candidate services Our work: unsolved problem 7

Outline Introduction QoS Prediction Problem Collaborative Filtering Adaptive Matrix Factorization Experiments Conclusions & Future Work 8

Observations QoS Attributes Dynamic: Users are distributed worldwide The workload of service is varying Network is dynamic User-specific: Different users may perceive different QoS Monitor all QoS values: straightforward yet impractical A large number of users as well as services Prohibitive overhead at runtime 9

Challenges QoS prediction: a promising approach Predict the missing values 10

Outline Introduction QoS Prediction Problem Collaborative Filtering Adaptive Matrix Factorization Experiments Conclusion & Future Work 11

Collaborative Filtering (CF) Collaborative filtering problem User-movie rating prediction (Netflix challenge) Similar users (e.g., similar preferences) Similar movies (e.g., similar themes) movies Rating matrix users 12

CF vs QoS Prediction User-perceived QoS prediction Collaborative filtering for QoS prediction? Collaborative filtering QoS Prediction User- movie rating matrix Rows users Columns movies User-service QoS matrix Rows users Columns services Latent factors: preferences, topics Latent factors: network, workload 13

Classic model for CF Matrix factorization (MF): Minimization formulation: Regularization terms Sum of squared error Usually solved by gradient descend algorithm (batch mode) 14

Limitations of MF for QoS prediction Limitation 1: skewed QoS value distributions Mismatch with the probabilistic assumption for MF Degrade its prediction accuracy Response Time Throughput Limitation 2: time varying QoS values Existing QoS values can be continuously updated However, MF work offline, and cannot adapt to new observed QoS values 15

Limitations of MF for QoS prediction Limitation 3: scalability on new users and services Users and services may join or leave the environment MF works on a matrix with a fixed size, not scalable How to address these limitations? Our approach: adaptive matrix factorization Aim to meet the requirements of being online, accurate, and scalable 16

Adaptive Matrix Factorization Algorithm overview QoS data stream collection Data transformation Online learning and updating Return predicted QoS values 17

Key Techniques 1: Data Transformation Box-Cox transformation (to address limitation 1) Stabilize data variance Rank-preserving Response Time Throughput Response Time 18 Throughput

Key Techniques 2: Online Learning Online learning (to address limitation 2) Stochastic gradient descent (SGD) Adapt to each newly observed data sample Update a user vector and a service vector at each step Online mode SGD update rules Extensible to new users and services 19

Key Techniques 3: Adaptive Weights Adaptive weights (to address limitation 3) Become robust Existing users and services keep stable New users and services converge fast Unique learning rate for each user/service Large for new vectors, small for converged vectors 1.0 1.5 20

Outline Introduction QoS Prediction Problem Collaborative Filtering Adaptive Matrix Factorization Experiments Conclusion & Future Work 21

Experiments Dataset collection Response time (RT): user-perceived delay of service invocation (sec) Throughput (TP): data transmission rate (kbps) 142 * 4500 * 64 QoS matrix 142 users (Planetlab nodes) 4,500 real-world Web services 64 time slices, at 15min time interval 22

Experiments Evaluation Metrics MAE (Mean Absolute Error): to measure the average prediction accuracy MRE (Median Relative Error): a key metric to identify the error effect of different magnitudes of prediction values NMRE (Ninety-Percentile Relative Error) NPRE takes the 90th percentile of all the pairwise relative errors 23

Experiments Performance Comparison Compared approaches: UPCC, IPCC, UIPCC: conventional CF baselines PMF: convectional matrix factorization approach These approaches cannot perform online Matrix density: means how many historical data we use 24

Experiments Impact of data transformation Compared approaches PMF (without data transformation) AMF(? = 1, reduce to linear normalization) AMF (? can be tuned automatically ) 25

Experiments Efficiency analysis Compared approaches: UIPCC Re-train the entire model at each time slice PMF AMF: continuously and incremental updating 26

Outline Introduction QoS Prediction Problem Collaborative Filtering Adaptive Matrix Factorization Experiments Conclusion & Future Work 28

Conclusions QoS prediction for candidate services AMF: Adaptive Matrix Factorization Data transformation, online learning, and adaptive weights Online, accurate, and scalable Future work Implement our QoS prediction approach together with service adaptation mechanisms Real-world evaluation on case studies 29