Data Mining

D
a
t
a
 
M
i
n
i
n
g
 
D
a
t
a
 
m
i
n
i
n
g
 
i
s
 
t
h
e
 
p
r
o
c
e
s
s
 
o
f
 
d
i
s
c
o
v
e
r
i
n
g
i
n
t
e
r
e
s
t
i
n
g
 
p
a
t
t
e
r
n
s
 
a
n
d
 
k
n
o
w
l
e
d
g
e
 
f
r
o
m
l
a
r
g
e
 
a
m
o
u
n
t
s
 
o
f
 
d
a
t
a
.
T
h
e
 
d
a
t
a
 
s
o
u
r
c
e
s
 
c
a
n
 
i
n
c
l
u
d
e
 
d
a
t
a
b
a
s
e
s
,
d
a
t
a
 
w
a
r
e
h
o
u
s
e
s
,
 
t
h
e
 
W
e
b
,
 
o
t
h
e
r
 
i
n
f
o
r
m
a
t
i
o
n
r
e
p
o
s
i
t
o
r
i
e
s
,
 
o
r
 
d
a
t
a
 
t
h
a
t
 
a
r
e
 
s
t
r
e
a
m
e
d
 
i
n
t
o
t
h
e
 
s
y
s
t
e
m
 
d
y
n
a
m
i
c
a
l
l
y
.
D
a
t
a
 
m
i
n
i
n
g
 
a
l
s
o
 
s
a
i
d
 
t
o
 
b
e
 
k
n
o
w
l
e
d
g
e
m
i
n
i
n
g
 
f
o
r
m
 
d
a
t
a
 
o
r
 
k
n
o
w
l
e
d
g
e
 
d
i
s
c
o
v
e
r
y
f
r
o
m
 
d
a
t
a
T
h
e
 
k
n
o
w
l
e
d
g
e
 
d
i
s
c
o
v
e
r
y
 
p
r
o
c
e
s
s
 
i
n
v
o
l
v
e
s
a
n
 
i
t
e
r
a
t
i
v
e
 
s
e
q
u
e
n
c
e
T
h
e
 
E
x
p
l
o
s
i
v
e
 
G
r
o
w
t
h
 
o
f
 
D
a
t
a
F
r
o
m
 
t
e
r
a
b
y
t
e
s
 
(
1
0
0
0
4
 
t
o
 
y
o
t
t
a
b
y
t
e
d
s
 
(
1
0
0
0
8
)
S
c
i
e
n
c
e
B
i
o
i
n
f
o
r
m
a
t
i
c
s
S
c
i
e
n
t
i
f
i
c
 
s
t
i
m
u
l
a
t
i
o
n
M
e
d
i
c
a
l
 
r
e
s
e
a
r
c
h
W
i
t
h
 
t
h
e
 
r
i
s
e
 
o
f
 
h
i
g
h
-
t
h
r
o
u
g
h
p
u
t
 
(
H
T
P
)
t
e
c
h
n
o
l
o
g
i
e
s
 
i
n
 
t
h
e
 
l
i
f
e
 
s
c
i
e
n
c
e
s
,
p
a
r
t
i
c
u
l
a
r
l
y
 
i
n
 
m
o
l
e
c
u
l
a
r
 
b
i
o
l
o
g
y
,
 
t
h
e
a
m
o
u
n
t
 
o
f
 
c
o
l
l
e
c
t
e
d
 
d
a
t
a
 
h
a
s
 
g
r
o
w
n
 
i
n
 
a
n
e
x
p
o
n
e
n
t
i
a
l
 
f
a
s
h
i
o
n
 
D
a
t
a
 
r
i
c
h
 
b
u
t
 
i
n
f
o
r
m
a
t
i
o
n
 
p
o
o
r
W
h
a
t
 
d
o
e
s
 
t
h
o
s
e
 
d
a
t
a
 
m
e
a
n
?
H
o
w
 
t
o
 
a
n
a
l
y
s
i
s
 
d
a
t
a
?
D
a
t
a
 
m
i
n
i
n
g
 
 
A
u
t
o
m
a
t
e
d
 
a
n
a
l
y
s
i
s
 
o
f
 
m
a
s
s
i
v
e
d
a
t
a
 
s
e
t
s
1. Data cleaning 
(to remove noise and
inconsistent data)
2. Data integration 
(where multiple data
sources may be combined)
3. Data selection 
(where data relevant to the
analysis task are retrieved from the 
database)
4. Data transformation 
(where data are
transformed and consolidated into forms
appropriate for mining by performing
summary or aggregation operations)
5. Data mining 
(an essential process where
intelligent methods are applied to extract 
data
patterns)
 
6. Pattern evaluation 
(to identify the truly
interesting patterns representing knowledge
based)
7. Knowledge presentation 
(where
visualization and knowledge representation
techniques are used to present mined
knowledge to users)
S
t
e
p
s
 
1
 
t
h
r
o
u
g
h
 
4
 
a
r
e
 
d
i
f
f
e
r
e
n
t
 
f
o
r
m
s
 
o
f
 
d
a
t
a
p
r
e
-
p
r
o
c
e
s
s
i
n
g
,
 
w
h
e
r
e
 
d
a
t
a
 
a
r
e
 
p
r
e
p
a
r
e
d
 
f
o
r
m
i
n
i
n
g
T
h
e
 
d
a
t
a
 
m
i
n
i
n
g
 
s
t
e
p
 
m
a
y
 
i
n
t
e
r
a
c
t
 
w
i
t
h
 
t
h
e
u
s
e
r
 
o
r
 
a
 
k
n
o
w
l
e
d
g
e
 
b
a
s
e
T
h
e
 
i
n
t
e
r
e
s
t
i
n
g
 
p
a
t
t
e
r
n
s
 
a
r
e
 
p
r
e
s
e
n
t
e
d
 
t
o
 
t
h
e
u
s
e
r
 
a
n
d
 
m
a
y
 
b
e
 
s
t
o
r
e
d
 
a
s
 
n
e
w
 
k
n
o
w
l
e
d
g
e
 
i
n
t
h
e
 
k
n
o
w
l
e
d
g
e
 
b
a
s
e
.
Data Warehousing
 
Slide Note
Embed
Share

Data mining is the process of discovering patterns and knowledge from vast data sources such as databases, data warehouses, the web, and more. It involves knowledge discovery and iterative sequences, enabling automated analysis of massive datasets to extract valuable insights. The explosive growth of data presents challenges in understanding and analyzing information effectively. Steps like data cleaning, integration, selection, transformation, and mining are crucial for pattern evaluation and knowledge presentation.

  • Data Mining
  • Knowledge Discovery
  • Big Data
  • Data Analysis
  • Information Retrieval

Uploaded on Feb 19, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Data Mining

  2. Data mining is the process of discovering interesting patterns and knowledge from large amounts of data. The data sources can include databases, data warehouses, the Web, other information repositories, or data that are streamed into the system dynamically.

  3. Data mining also said to be knowledge mining form data or knowledge discovery from data The knowledge discovery process involves an iterative sequence

  4. The Explosive Growth of Data From terabytes (10004to yottabyteds (10008) Science Bioinformatics Scientific stimulation Medical research With the rise of high-throughput (HTP) technologies in particularly in molecular amount of collected data has grown in an exponential fashion the life sciences, biology, the

  5. Data rich but information poor What does those data mean? How to analysis data? Data mining Automated analysis of massive data sets

  6. 1. Data cleaning (to remove noise and inconsistent data) 2. Data integration (where multiple data sources may be combined) 3. Data selection (where data relevant to the analysis task are retrieved from the database)

  7. 4. Data transformation (where data are transformed and consolidated into forms appropriate for mining summary or aggregation operations) by performing 5. Data mining (an essential process where intelligent methods are applied to extract data patterns)

  8. 6. Pattern evaluation (to identify the truly interesting patterns representing knowledge based) 7. Knowledge presentation (where visualization and knowledge representation techniques are used knowledge to users) to present mined

  9. Steps 1 through 4 are different forms of data pre-processing, where data are prepared for mining The data mining step may interact with the user or a knowledge base The interesting patterns are presented to the user and may be stored as new knowledge in the knowledge base.

  10. Data Warehousing

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#