CROSSJACK

 
 
C
R
O
S
S
J
A
C
K
J
o
b
 
L
e
v
e
l
 
M
e
t
r
i
c
 
A
c
c
o
u
n
t
i
n
g
/
V
i
s
u
a
l
i
s
a
t
i
o
n
 
f
o
r
 
S
C
A
R
F
 
R
e
q
u
i
r
e
m
e
n
t
s
 
M
a
i
n
 
g
o
a
l
 
f
o
r
 
n
e
w
 
p
l
a
t
f
o
r
m
 
i
s
 
t
o
 
h
e
l
p
 
u
s
e
r
s
 
o
p
t
i
m
i
s
e
 
t
h
e
i
r
 
j
o
b
s
 
Job feedback must be simple and consolidated
 
Platform must be maintainable and easy to implement new features
 
Currently no ‘easy’ way to get metrics specific to your job ...
 
 
G
a
n
g
l
i
a
 
 
o
v
e
r
a
l
l
 
v
i
s
u
a
l
i
s
a
t
i
o
n
s
 
 
 
S
L
U
R
M
 
c
o
m
m
a
n
d
s
 
sacct
 
seff                                                                           scontrol
 
C
h
o
s
e
n
 
T
e
c
h
n
o
l
o
g
i
e
s
 
G
r
a
f
a
n
a
 
f
o
r
 
v
i
s
u
a
l
i
s
a
t
i
o
n
 
P
r
o
m
e
t
h
e
u
s
 
w
i
t
h
 
c
g
r
o
u
p
s
/
n
o
d
e
 
e
x
p
o
r
t
e
r
s
 
f
o
r
 
d
a
t
a
 
c
o
l
l
e
c
t
i
o
n
/
a
c
c
o
u
n
t
i
n
g
 
J
o
b
s
t
a
t
s
 
f
o
r
 
j
o
b
 
f
e
e
d
b
a
c
k
 
Demo…
 
 
W
h
a
t
s
 
n
e
x
t
?
 
Multi node setup
 
GPU metrics and more
 
‘Productionise’
 
 
L
i
n
k
s
:
 
Jobstats
 - 
https://github.com/PrincetonUniversity/jobstats
 
Node Exporter - 
https://github.com/prometheus/node_exporter
 
Cgroups
 Exporter - 
https://github.com/treydock/cgroup_exporter
 
Documentation – confluence
 
Slide Note
Embed
Share

This project focuses on creating a platform to help users optimize their job metrics efficiently. With a goal to provide simple and consolidated job feedback, the platform aims to be maintainable, easy to implement new features, and provide specific job metrics. Utilizing technologies like Grafana, Prometheus, and more, the project aims to enhance job visualization and user experience.

  • Job Metrics
  • Feedback Optimization
  • Data Visualization
  • User Platform
  • Technology Integration

Uploaded on Feb 19, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. CROSSJACK Job Level Metric Accounting/Visualisation for SCARF

  2. Requirements Main goal for new platform is to help users optimise their jobs Job feedback must be simple and consolidated Platform must be maintainable and easy to implement new features Currently no easy way to get metrics specific to your job ...

  3. Ganglia overall visualisations

  4. SLURM commands sacct seff scontrol

  5. Chosen Technologies Grafana for visualisation Prometheus with cgroups/node exporters for data collection/accounting Jobstats for job feedback Demo

  6. Whats next? Multi node setup GPU metrics and more Productionise

  7. Links: Jobstats - https://github.com/PrincetonUniversity/jobstats Node Exporter - https://github.com/prometheus/node_exporter Cgroups Exporter - https://github.com/treydock/cgroup_exporter Documentation confluence

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#