The Concept of Information Packages for Data Warehousing

undefined
Information Package
The concept to gather
requirement in DW
OLAP
Online Analytical Processing (OLAP),
mengandung dua tipe data dasar
(Mulyana, 2014):
1.
M
e
a
s
u
r
e
s
:
 
d
a
t
a
 
b
i
l
a
n
g
a
n
 
y
a
n
g
 
t
e
r
u
k
u
r
m
i
s
a
l
k
a
n
 
k
u
a
n
t
i
t
a
s
,
 
h
a
r
g
a
,
 
n
i
l
a
i
 
r
a
t
a
-
r
a
t
a
,
j
u
m
l
a
h
 
d
s
b
.
 
 
a
d
a
 
d
i
 
t
a
b
e
l
 
f
a
k
t
a
2.
D
i
m
e
n
s
i
o
n
:
 
k
a
t
e
g
o
r
i
 
y
a
n
g
 
d
i
g
u
n
a
k
a
n
u
n
t
u
k
 
m
e
n
g
a
t
u
r
 
m
e
a
s
u
r
e
s
,
 
t
e
r
d
a
p
a
t
t
i
n
g
k
a
t
a
n
 
(
l
e
v
e
l
)
.
 
M
i
s
a
l
n
y
a
 
d
i
m
e
n
s
i
 
w
a
k
t
u
,
d
e
n
g
a
n
 
t
i
n
g
k
a
t
a
n
 
t
a
h
u
n
,
 
k
u
a
r
t
a
l
,
 
b
u
l
a
n
d
a
n
 
h
a
r
i
.
 
 
a
d
a
 
d
i
 
t
a
b
e
l
 
d
i
m
e
n
s
i
Information Package
Why?
Because the requirements of DW 
cannot
be fully determined.
We have noted that the users tend to think in
terms of business dimensions and analyze
measurements along such business
dimensions.
You come up with what is known as an
information package for the specific subject.
An Information Package For
Analyzing Sales
Dimensions
A 
dimension
 is a structure that categorizes
data in order to enable users to answer
business questions.
Example of dimensions are customers,
products, and time.
Hierarchy of Dimension
1:n relationships between  the
levels of a hierarchy.
Going up a level in the hierarchy is
called rolling up and going down a
level in the hierarchy is called
drilling down.
Within the customer dimension,
customers roll up to city. Then
cities roll up to state. Then states
roll up to country. Then countries
roll up to subregion. Finally,
subregions roll up to region.
Roll Up
Drill Down
Granularity
Granularity refers to 
the level of detail or
summarization
 of the units of data in the
data warehouse.
The more detail there is, the lower the level of
granularity. The less detail there is, the higher
the level of granularity.
For example, a simple transaction would be
at a low level of granularity. A summary of all
transactions for the month would be at a high
level of granularity.
Granularity
Granularity
Example of Hotel Occupancy
In this case, we want to come up with an information
package for a hotel chain. The subject in this case is
hotel occupancy
.
We want to analyze occupancy of the rooms in the
various 
branches of the hotel 
chain.
We want to analyze the occupancy by individual
hotels and by 
room types
. So, hotel and room type
are critical business dimensions for the analysis.
As in the other case, we also need to include the 
time
dimension.
I
n
 
t
h
e
 
h
o
t
e
l
 
o
c
c
u
p
a
n
c
y
 
i
n
f
o
r
m
a
t
i
o
n
 
p
a
c
k
a
g
e
,
 
t
h
e
d
i
m
e
n
s
i
o
n
s
 
t
o
 
b
e
 
i
n
c
l
u
d
e
d
 
a
r
e
 
h
o
t
e
l
,
 
r
o
o
m
 
t
y
p
e
,
 
a
n
d
t
i
m
e
.
Information Package Diagram
of Hotel Occupancy
 
The information package diagrams
crystallize the information requirements
for the data warehouse.
They contain the critical metrics
measuring the performance of the
business units, the business dimensions
along which the metrics are analyzed,
and the details of how drill-down and
roll-up analyses are done.
Information Package Diagram
of Automaker Sales
Data pada facts dijadikan measurement pada database OLAP. 
Data facts dijadikan field pada tabel fakta. 
Information Package Diagram
of Automaker Sales
Setiap dimensi dijadikan tabel dimensi yang berelasi dengan tabel fakta. 
Tabel dimensi dapat dinormalisasi (snowflake) maupun tidak (star). 
Referensi
Paulraj Ponniah
Oracle Data Warehouse Guide
Pentaho: Solusi Open Source untuk
membangun Data Warehouse
(Mulyana, 2014)
Tugas 2
BigBook, Inc. is a large book distributor with domestic
and international distribution channels. The
company orders from publishers and distributes
publications to all the leading booksellers. Initially,
you want to build a data warehouse to analyze
shipments that are made from the company’s many
warehouses. Determine the metrics or facts and the
business dimensions. Draw an information package
diagram.
Slide Note
Embed
Share

Gather requirements in a Data Warehouse (DW) by utilizing information packages. Online Analytical Processing (OLAP) involves measures and dimensions for organizing data effectively. Information packages are essential as DW requirements may not be fully determined initially. Explore dimensions, hierarchies, and granularity levels to enhance data analysis. Example scenarios like analyzing sales and hotel occupancy showcase the importance of information packages in driving business insights.

  • Data Warehousing
  • Information Packages
  • OLAP
  • Dimensions
  • Granularity

Uploaded on Oct 04, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Information Package The concept to gather requirement in DW

  2. OLAP Online Analytical Processing (OLAP), mengandung dua tipe data dasar (Mulyana, 2014): 1. Measures: data bilangan yang terukur misalkan kuantitas, harga, nilai rata-rata, jumlah dsb. ada di tabel fakta 2. Dimension: kategori yang digunakan untuk mengatur measures, terdapat tingkatan (level). Misalnya dimensi waktu, dengan tingkatan tahun, kuartal, bulan dan hari. ada di tabel dimensi #

  3. Information Package Why? Because the requirements of DW cannot be fully determined. We have noted that the users tend to think in terms of business dimensions and analyze measurements along such business dimensions. You come up with what is known as an information package for the specific subject. #

  4. An Information Package For Analyzing Sales #

  5. Dimensions A dimension is a structure that categorizes data in order to enable users to answer business questions. Example of dimensions are customers, products, and time. #

  6. Hierarchy of Dimension 1:n relationships between the levels of a hierarchy. Going up a level in the hierarchy is called rolling up and going down a level in the hierarchy is called drilling down. Within the customer dimension, customers roll up to city. Then cities roll up to state. Then states roll up to country. Then countries roll up to subregion. Finally, subregions roll up to region. Drill Down Roll Up #

  7. Granularity Granularity refers to the level of detail or summarization of the units of data in the data warehouse. The more detail there is, the lower the level of granularity. The less detail there is, the higher the level of granularity. For example, a simple transaction would be at a low level of granularity. A summary of all transactions for the month would be at a high level of granularity. #

  8. Granularity #

  9. Granularity #

  10. Example of Hotel Occupancy In this case, we want to come up with an information package for a hotel chain. The subject in this case is hotel occupancy. We want to analyze occupancy of the rooms in the various branches of the hotel chain. We want to analyze the occupancy by individual hotels and by room types. So, hotel and room type are critical business dimensions for the analysis. As in the other case, we also need to include the time dimension. In the hotel occupancy information package, the dimensions to be included are hotel, room type, and time. #

  11. Information Package Diagram of Hotel Occupancy #

  12. The information package diagrams crystallize the information requirements for the data warehouse. They contain the critical metrics measuring the performance of the business units, the business dimensions along which the metrics are analyzed, and the details of how drill-down and roll-up analyses are done. #

  13. Information Package Diagram of Automaker Sales # Data pada facts dijadikan measurement pada database OLAP. Data facts dijadikan field pada tabel fakta.

  14. Information Package Diagram of Automaker Sales # Setiap dimensi dijadikan tabel dimensi yang berelasi dengan tabel fakta. Tabel dimensi dapat dinormalisasi (snowflake) maupun tidak (star).

  15. Referensi Paulraj Ponniah Oracle Data Warehouse Guide Pentaho: Solusi Open Source untuk membangun Data Warehouse (Mulyana, 2014) #

  16. Tugas 2 BigBook, Inc. is a large book distributor with domestic and international distribution channels. The company orders from publishers and distributes publications to all the leading booksellers. Initially, you want to build a data warehouse to analyze shipments that are made from the company s many warehouses. Determine the metrics or facts and the business dimensions. Draw an information package diagram. #

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#