Predictability and Utilisation Trade-off in Video Stream Decoding Management

 
Predictability and Utilisation Trade-off
in the Dynamic Management of
Multiple Video Stream Decoding on
Network-on-Chip based Homogeneous
Embedded Multi-cores
 
Hashan Roshantha Mendis (Rosh),
Leandro Soares Indrusiak,
Neil Audsley
 
R
h
e
o
n
M
e
d
i
a
 
(Real-time Systems Group, University of York)
 
RTNS 2014 - Versailles, France
 
09/10/2014
 
Outline
 
Motivation
System overview
Application model
Platform model
Task mapping
Admission control tests
Deterministic
Heuristic based
Evaluation method
Results
Conclusion and future work
 
RTNS 2014 - Versailles, France
 
2
 
09/10/2014
 
Motivation
 
Context
NoC based 
soft real-time
 embedded systems
Heuristic
 
based
 admission control of multiple
video stream decoding tasks.
Objectives
Make 
predictable 
admission control decisions
of video stream decoding requests, while
maintaining high system 
utilisation
.
 
RTNS 2014 - Versailles, France
 
3
 
09/10/2014
 
Motivation
 
Metrics definition
Improving predictability:
Minimising the maximum and mean lateness of
the admitted video stream tasks
Reducing the number of dropped task (due to
platform global input buffer overflow)
 
Improving utilisation:
Reduce idle time of processing elements
 
RTNS 2014 - Versailles, France
 
4
 
09/10/2014
 
System overview
 
RTNS 2014 - Versailles, France
 
5
 
09/10/2014
 
System overview - Application model
 
RTNS 2014 - Versailles, France
 
6
 
09/10/2014
 
System overview - Application model
 
RTNS 2014 - Versailles, France
 
7
 
09/10/2014
 
System overview - Application model
 
RTNS 2014 - Versailles, France
 
8
 
09/10/2014
 
Assumptions
Frame-rate = 25 fps
Fixed GoP structure - each job has the
same predefined dependency pattern
End-to-end deadline of job (D
e2e
)
known
Aperiodic tasks
Fixed priority
All tasks of a job arrive at the same
time.
Video stream start/end not known
Task execution times unknown. Worst-
case/average-case can be estimated
 
System overview - Platform model
 
09/10/2014
 
RTNS 2014 - Versailles, France
 
9
 
Processing Elements
Homogeneous
Separate individual kernel
instances.
Same application code
Fixed priority-preemptive
local task scheduler
 
Network-on-Chip
2D Mesh
XY routing
Fixed priority-preemptive
arbitration
 
System overview - Platform model
 
09/10/2014
 
RTNS 2014 - Versailles, France
 
10
Tasks are held until
dependencies are
met and PE task
queue is not full.
 
Processing Elements
Homogeneous
Separate individual kernel
instances.
Same application code
Fixed priority-preemptive
local task scheduler
 
Network-on-Chip
2D Mesh
XY routing
Fixed priority-preemptive
arbitration
 
System overview – Task mapping
 
RTNS 2014 - Versailles, France
 
11
 
09/10/2014
 
Semi-dynamic mapping
The same task mapping
pattern is used for every
subsequent job in a given
video stream.
This 
job-level task to core
mapping
 is established by
the RM upon admission of
a new video stream.
Shortest task queue first
(STQF)
 
Admission control tests
 
Deterministic
Using classical schedulability tests
Heuristic based
Based on subtask-lateness
 
09/10/2014
 
RTNS 2014 - Versailles, France
 
12
 
Admission control tests - Deterministic
 
RTNS 2014 - Versailles, France
 
13
 
09/10/2014
[1] N. Audsley, A. Burns, M. Richardson, K. Tindell, and A. J. Wellings. 
Applying new scheduling theory to static priority pre-emptive scheduling
. Software Eng. Journal, Sept. 1993
[2] Z. Shi, A. Burns, and L. S. Indrusiak. 
Schedulability analysis for real time on-chip communication with wormhole switching
. Int. Journal of Embedded and Real-Time Comms. Systems, 2010.
[3] L. S. Indrusiak. 
End-to-end schedulability tests for multiprocessor embedded systems based on networks-on-chip with priority-preemptive arbitration
. Journal of Sys. Arch., May 2014.
 
Eq 1:
 
Eq 2:
 
Admission control tests - Deterministic
 
RTNS 2014 - Versailles, France
 
14
 
09/10/2014
[1] N. Audsley, A. Burns, M. Richardson, K. Tindell, and A. J. Wellings. 
Applying new scheduling theory to static priority pre-emptive scheduling
. Software Eng. Journal, Sept. 1993
[2] Z. Shi, A. Burns, and L. S. Indrusiak. 
Schedulability analysis for real time on-chip communication with wormhole switching
. Int. Journal of Embedded and Real-Time Comms. Systems, 2010.
[3] L. S. Indrusiak. 
End-to-end schedulability tests for multiprocessor embedded systems based on networks-on-chip with priority-preemptive arbitration
. Journal of Sys. Arch., May 2014.
 
Eq 1:
 
Eq 2:
 
Admission control tests - Deterministic
 
RTNS 2014 - Versailles, France
 
15
 
09/10/2014
 
Schedulability analysis – Non-interferers
We assume there is no overlap between executions
and invocations of different jobs within the same
video stream.
Precedence constraints are taken into account when
determining interfering tasks
Exclusion of possible non-interferers makes analysis
tighter but still safe for deterministic AC decisions.
 
Admission control tests – Heuristic based
 
RTNS 2014 - Versailles, France
 
16
 
09/10/2014
 
Eq 3:
 
Eq 4:
 
Admission control tests – Heuristic based
 
RTNS 2014 - Versailles, France
 
17
 
09/10/2014
 
Instantaneous task lateness
Previous work of determining the subtask
deadline  -  by Kao and Garcia-Molina [4]; termed
Deadline
 
Equal flexibility scheme 
(D-EQF)
Total remaining slack is divided among the
subtasks in proportion to their estimated
execution times (WCET).
[4] B. Kao and H. Garcia-Molina. Deadline assignment in a distributed soft real-time system. IEEE Trans. on Parallel and Distributed Systems, Dec. 1997
 
Admission control tests – buffer overflow
 
RTNS 2014 - Versailles, France
 
18
 
09/10/2014
 
Free space in buffers and task queues
If global input buffers or task queues do not have
sufficient space, taskset will be 
dropped
 without
receiving
Hence buffer overflow is also a cause for rejecting
new video streams.
 
 
Evaluation method
 
RTNS 2014 - Versailles, France
 
19
 
09/10/2014
 
Admission control (AC) tests compared :
Baselines : No AC test, Deterministic  AC,
Heuristic based (0.1 ≤ (
IBL
α
 
, 
TQL
α
 
) ≤ 1.0)
Measurements:
Number of video streams admitted/rejected/admitted
but deadlines missed
Job lateness
Overall average PE busy time
High and Low workloads
Abstract system-level simulation with lightweight NoC
simulation component [5].
35 simulation runs
 
 
 
[5] L. S. Indrusiak and O. M. dos Santos. Fast and accurate transaction-level model of a wormhole network-on-chip with priority preemptive virtual channel arbitration. In DATE 2011, IEEE, 2011.
 
Experiment results
 
RTNS 2014 - Versailles, France
 
20
 
09/10/2014
 
Video stream admission summary
 
As the Heuristic ratios increase, the number of admitted video streams increase;
however at the cost of, increased late streams and many jobs being dropped.
Heu (D-EQF) offers best predictability guarantees : no late streams, higher
admission rate than Deterministic AC-test.
 
Experiment results
 
RTNS 2014 - Versailles, France
 
21
 
09/10/2014
 
Completed job (i.e. GoP) lateness
 
As the Heuristic ratios increase, the distribution, mean and maximum job
lateness increases.
No-AC test shows very high job lateness levels.
 
Experiment results
 
RTNS 2014 - Versailles, France
 
22
 
09/10/2014
 
Percentage system busy time (system utilisation)
 
As the Heuristic ratios increase, the average system utilisation levels increase –
more admitted video streams.
Very low utilisation levels when deterministic AC test is used.
 
Experiment results - summary
 
RTNS 2014 - Versailles, France
 
23
 
09/10/2014
 
Heuristic based
AC tests
(IBL
α
 
, TQL
α
)
 
Higher value of
IBL
α
 
combined
with a mid/high
value of TQL
α
provides best
trade-off
 
Conclusion and future work
 
RTNS 2014 - Versailles, France
 
24
 
09/10/2014
 
Application specific
 task model
Heuristic
 based admission control approach.
Subtask lateness
 evaluated as a ratio of the end-to-end
deadline of the overall taskset.
Improved
 
utilisation
 over the 
deterministic AC test 
and
better predictability guarantees
 than when 
No-AC
 is
used.
The 
IBL
α
 and TQL
α
 
thresholds can be chosen depending
on the specific requirements .
Future work : 
explore better dynamic/semi-dynamic
mapping approaches, compare other ACs
 
 
 
Thank You !
 
Questions ?
 
09/10/2014
 
RTNS 2014 - Versailles, France
 
25
 
 
 
EXTRA SLIDES
 
09/10/2014
 
RTNS 2014 - Versailles, France
 
26
 
Evaluation method
 
RTNS 2014 - Versailles, France
 
27
 
09/10/2014
 
 
Simulation experiment parameters
 
Admission control tests - Deterministic
 
RTNS 2014 - Versailles, France
 
28
 
09/10/2014
 
Example task and flow timeline
 
Admission control tests – Heuristic based
 
RTNS 2014 - Versailles, France
 
29
 
09/10/2014
[4] B. Kao and H. Garcia-Molina. Deadline assignment in a distributed soft real-time system. IEEE Trans. on Parallel and Distributed Systems, Dec. 1997
 
Eq 5:
Slide Note

This paper mainly focuses on heuristic based admission control approaches to balance predictability and utilisation for soft real-time video processing applications running on network on chip based embedded multicores.

Embed
Share

This study explores the trade-off between predictability and utilisation in managing multiple video stream decoding tasks on Network-on-Chip based embedded multi-cores. The objective is to make predictable admission control decisions while maintaining high system utilisation. Metrics include minimizing lateness of tasks and reducing dropped tasks, while improving processing element utilisation. The system overview, application model, and assumptions in the context of soft real-time embedded systems are discussed.

  • Video Stream Decoding
  • Network-on-Chip
  • Embedded Systems
  • Predictability
  • Utilisation

Uploaded on Sep 21, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Predictability and Utilisation Trade-off in the Dynamic Management of Multiple Video Stream Decoding on Network-on-Chip based Homogeneous Embedded Multi-cores Hashan Roshantha Mendis (Rosh), Leandro Soares Indrusiak, Neil Audsley (Real-time Systems Group, University of York) RheonMedia 09/10/2014 RTNS 2014 - Versailles, France

  2. Outline Motivation System overview Application model Platform model Task mapping Admission control tests Deterministic Heuristic based Evaluation method Results Conclusion and future work 2 09/10/2014 RTNS 2014 - Versailles, France

  3. Motivation Context NoC based soft real-time embedded systems Heuristicbased admission control of multiple video stream decoding tasks. Objectives Make predictable admission control decisions of video stream decoding requests, while maintaining high system utilisation. 3 09/10/2014 RTNS 2014 - Versailles, France

  4. Motivation Metrics definition Improving predictability: Minimising the maximum and mean lateness of the admitted video stream tasks Reducing the number of dropped task (due to platform global input buffer overflow) Improving utilisation: Reduce idle time of processing elements 4 09/10/2014 RTNS 2014 - Versailles, France

  5. System overview 5 09/10/2014 RTNS 2014 - Versailles, France

  6. System overview - Application model 6 09/10/2014 RTNS 2014 - Versailles, France

  7. System overview - Application model 7 09/10/2014 RTNS 2014 - Versailles, France

  8. System overview - Application model Assumptions Frame-rate = 25 fps Fixed GoP structure - each job has the same predefined dependency pattern End-to-end deadline of job (De2e) known Aperiodic tasks Fixed priority All tasks of a job arrive at the same time. Video stream start/end not known Task execution times unknown. Worst- case/average-case can be estimated 8 09/10/2014 RTNS 2014 - Versailles, France

  9. System overview - Platform model Processing Elements Homogeneous Separate individual kernel instances. Same application code Fixed priority-preemptive local task scheduler Network-on-Chip 2D Mesh XY routing Fixed priority-preemptive arbitration 9 09/10/2014 RTNS 2014 - Versailles, France

  10. System overview - Platform model Processing Elements Homogeneous Separate individual kernel instances. Same application code Fixed priority-preemptive local task scheduler Tasks are held until dependencies are met and PE task queue is not full. Network-on-Chip 2D Mesh XY routing Fixed priority-preemptive arbitration 10 09/10/2014 RTNS 2014 - Versailles, France

  11. System overview Task mapping Semi-dynamic mapping The same task mapping pattern is used for every subsequent job in a given video stream. This job-level task to core mapping is established by the RM upon admission of a new video stream. Shortest task queue first (STQF) 11 09/10/2014 RTNS 2014 - Versailles, France

  12. Admission control tests Deterministic Using classical schedulability tests Heuristic based Based on subtask-lateness 12 09/10/2014 RTNS 2014 - Versailles, France

  13. Admission control tests - Deterministic ?? response time of task ?? ?? worst-case execution time of task ?? ?? period of task ?? ? ? higher priority tasks blocking ?? ?? response time of flow Msg? ??? interference jitter of flow Msg? ?? basic latency of flow Msg? ??? direct interferers of flow Msg? Schedulability analysis Worst-case response time of a task ?? [1]: ??? ?? ???+1= ??+ ?? Eq 1: ?? ? ? Worst case response time of a flow ???? [2][3]: ? ???+ ??+ ?? ?? ???+1= ??+ Eq 2: ?? ???? ??? A job is considered schedulable if the end-to-end response time of its critical path is less than or equal to the end-to-end deadline (De2e) [1] N. Audsley, A. Burns, M. Richardson, K. Tindell, and A. J. Wellings. Applying new scheduling theory to static priority pre-emptive scheduling. Software Eng. Journal, Sept. 1993 [2] Z. Shi, A. Burns, and L. S. Indrusiak. Schedulability analysis for real time on-chip communication with wormhole switching. Int. Journal of Embedded and Real-Time Comms. Systems, 2010. [3] L. S. Indrusiak. End-to-end schedulability tests for multiprocessor embedded systems based on networks-on-chip with priority-preemptive arbitration. Journal of Sys. Arch., May 2014. 13 09/10/2014 RTNS 2014 - Versailles, France

  14. Admission control tests - Deterministic ?? response time of task ?? ?? worst-case execution time of task ?? ?? period of task ?? ? ? higher priority tasks blocking ?? ?? response time of flow Msg? ??? interference jitter of flow Msg? ?? basic latency of flow Msg? ??? direct interferers of flow Msg? Schedulability analysis Worst-case response time of a task ?? [1]: ??? ?? ???+1= ??+ ?? Eq 1: ?? ? ? Worst case response time of a flow ???? [2][3]: ? ???+ ??+ ?? ?? ???+1= ??+ Eq 2: ?? ???? ??? A job is considered schedulable if the end-to-end response time of its critical path is less than or equal to the end-to-end deadline (De2e) [1] N. Audsley, A. Burns, M. Richardson, K. Tindell, and A. J. Wellings. Applying new scheduling theory to static priority pre-emptive scheduling. Software Eng. Journal, Sept. 1993 [2] Z. Shi, A. Burns, and L. S. Indrusiak. Schedulability analysis for real time on-chip communication with wormhole switching. Int. Journal of Embedded and Real-Time Comms. Systems, 2010. [3] L. S. Indrusiak. End-to-end schedulability tests for multiprocessor embedded systems based on networks-on-chip with priority-preemptive arbitration. Journal of Sys. Arch., May 2014. 14 09/10/2014 RTNS 2014 - Versailles, France

  15. Admission control tests - Deterministic Schedulability analysis Non-interferers We assume there is no overlap between executions and invocations of different jobs within the same video stream. Precedence constraints are taken into account when determining interfering tasks Exclusion of possible non-interferers makes analysis tighter but still safe for deterministic AC decisions. 15 09/10/2014 RTNS 2014 - Versailles, France

  16. Admission control tests Heuristic based Instantaneous task lateness We assume the individual task deadline is a ratio of the overall end-to- end deadline of the job. We make admission decisions based on the instantaneous lateness of the tasks (??) in the Global input buffers and the PE task queues. ????????????? = ?? ?? ??2? ??? Eq 3: ??????????? = ?? ?? ??2? ??? Eq 4: ?? current time ?? task dispatch time ??? 0 ??? 1}, ??? 0 ??? 1} If any of the tasks in the global input buffers or the PE task queues are late, then the new video stream admission request is rejected. 16 09/10/2014 RTNS 2014 - Versailles, France

  17. Admission control tests Heuristic based Instantaneous task lateness Previous work of determining the subtask deadline - by Kao and Garcia-Molina [4]; termed DeadlineEqual flexibility scheme (D-EQF) Total remaining slack is divided among the subtasks in proportion to their estimated execution times (WCET). [4] B. Kao and H. Garcia-Molina. Deadline assignment in a distributed soft real-time system. IEEE Trans. on Parallel and Distributed Systems, Dec. 1997 17 09/10/2014 RTNS 2014 - Versailles, France

  18. Admission control tests buffer overflow Free space in buffers and task queues If global input buffers or task queues do not have sufficient space, taskset will be dropped without receiving Hence buffer overflow is also a cause for rejecting new video streams. 18 09/10/2014 RTNS 2014 - Versailles, France

  19. Evaluation method Admission control (AC) tests compared : Baselines : No AC test, Deterministic AC, Heuristic based (0.1 (IBL , TQL ) 1.0) Measurements: Number of video streams admitted/rejected/admitted but deadlines missed Job lateness Overall average PE busy time High and Low workloads Abstract system-level simulation with lightweight NoC simulation component [5]. 35 simulation runs [5] L. S. Indrusiak and O. M. dos Santos. Fast and accurate transaction-level model of a wormhole network-on-chip with priority preemptive virtual channel arbitration. In DATE 2011, IEEE, 2011. 19 09/10/2014 RTNS 2014 - Versailles, France

  20. Experiment results Video stream admission summary As the Heuristic ratios increase, the number of admitted video streams increase; however at the cost of, increased late streams and many jobs being dropped. Heu (D-EQF) offers best predictability guarantees : no late streams, higher admission rate than Deterministic AC-test. 20 09/10/2014 RTNS 2014 - Versailles, France

  21. Experiment results Completed job (i.e. GoP) lateness As the Heuristic ratios increase, the distribution, mean and maximum job lateness increases. No-AC test shows very high job lateness levels. 21 09/10/2014 RTNS 2014 - Versailles, France

  22. Experiment results Percentage system busy time (system utilisation) As the Heuristic ratios increase, the average system utilisation levels increase more admitted video streams. Very low utilisation levels when deterministic AC test is used. 22 09/10/2014 RTNS 2014 - Versailles, France

  23. Experiment results - summary Heuristic based AC tests (IBL , TQL ) Higher value of IBL combined with a mid/high value of TQL provides best trade-off 23 09/10/2014 RTNS 2014 - Versailles, France

  24. Conclusion and future work Application specific task model Heuristic based admission control approach. Subtask lateness evaluated as a ratio of the end-to-end deadline of the overall taskset. Improvedutilisation over the deterministic AC test and better predictability guarantees than when No-AC is used. The IBL and TQL thresholds can be chosen depending on the specific requirements . Future work : explore better dynamic/semi-dynamic mapping approaches, compare other ACs 24 09/10/2014 RTNS 2014 - Versailles, France

  25. Thank You ! Questions ? 25 09/10/2014 RTNS 2014 - Versailles, France

  26. EXTRA SLIDES 26 09/10/2014 RTNS 2014 - Versailles, France

  27. Evaluation method Simulation experiment parameters Number of workflows High (=16), Low(=8) Heuristic ratios (combinations) 0.1 (IBL , TQL ) 1.0 35 Simulation runs Number of PEs 9 (3x3) PE task queue size 10 Dependency buff size 10 Global input buffer size 12 NoC frequency 1000MHz Videos per workflow (min/max) (6,7) GoPs per video stream (min/max) (7,8) 27 09/10/2014 RTNS 2014 - Versailles, France

  28. Admission control tests - Deterministic Example task and flow timeline 28 09/10/2014 RTNS 2014 - Versailles, France

  29. Admission control tests Heuristic based Instantaneous task lateness Previous work of determining the subtask deadline (di) by Kao and Garcia-Molina [4]; termed Equal flexibility scheme (EQF) ? ?? ??= ??+ ??+ ??2? ?? ?? Eq 5: ??? ?=1 ?=1 ?: number of tasks in the taskset ??: absolute deadline of subtask Total remaining slack is divided among the subtasks in proportion to their estimated execution times (WCET). [4] B. Kao and H. Garcia-Molina. Deadline assignment in a distributed soft real-time system. IEEE Trans. on Parallel and Distributed Systems, Dec. 1997 29 09/10/2014 RTNS 2014 - Versailles, France

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#