Importance of Data and Statistics in Process Control Training

 
Statistical Process Control (SPC)
Training Guide
 
W
h
a
t
 
i
s
 
d
a
t
a
?
 
Data is factual information (as measurements or statistics)
used as a basic for reasoning, discussion or calculation. 
(Merriam-
Webster Dictionary, m-w.com)
What does this mean?
Data allows us to make educated decisions and take action!
 
W
h
y
 
i
s
 
D
a
t
a
 
I
m
p
o
r
t
a
n
t
?
 
If we don’t collect data about a process then what?
Without data we don’t understand the process / product
 
So what?
Without understanding of the process / product we can’t control the outcome
 
Is it important to control the outcome?
When you can’t control the outcome you are dependent on chance.
You may have a good outcome, you may not.
Without data collection you may not know either way.
 
U
s
e
 
o
f
 
D
a
t
a
 
I’m not collecting data because its non value add.
Without data collection there is no way to identify problems, continuously improve or
ensure you are meeting the voice of the customer.
I’m collecting data, but not looking at it. Is that okay?
No, the collection of data without analysis is a bigger waste than not collecting data
in the first place.
What should I be doing with the data?
DATA
Statistics
Information that can
be understood and
acted on.
 
W
h
a
t
 
a
r
e
 
S
t
a
t
i
s
t
i
c
s
 
A branch of mathematics dealing with the collection, analysis,
interpretation and presentation of masses of numerical data
(Merriam-Webster Dictionary, m-w.com)
What does this mean?
Once data is collected we can use appropriate statistical methods to
describe (understand) our process / product and control (predict) the
process / product outcome
 
S
t
a
t
i
s
t
i
c
s
 
A
l
l
o
w
 
u
s
 
t
o
 
Describe our process / product
 
 
 
 
 
Control (predict) process / product outcomes
 
Sample
(125
Pieces)
Population
(All Product)
 
I
m
p
o
r
t
a
n
t
 
N
o
t
e
s
 
1.
Statistical conclusions require useful data
We need to measure the right thing
Determining what data to collect and how to collect it are important steps in the APQP / continuous
improvement process
 
2.
We need to have confidence the data collected is accurate
A good measurement system is required to collect data
Be sure the measurement system analysis (MSA) is acceptable before collecting data
There is a separate training available for MSA
 
3.
Reduce / Eliminate Waste
Data that doesn’t provide useful information drives waste
We want to gain the most useful information from the least amount of data possible!
 
T
y
p
e
s
 
o
f
 
D
a
t
a
 
Variable Data (Continuous Data)
Measurements on a continuous scale
Examples
Product Dimensions
Weight
Time
Cost
Process parameters (cutting speed, injection pressure, etc.)
 
Attribute Data (Discrete Data)
Data by counting
Examples
Count of defective parts from production
Number of chips on a painted part
 
T
y
p
e
s
 
o
f
 
A
t
t
r
i
b
u
t
e
 
D
a
t
a
 
Binomial Distribution
Pass / Fail (PPM)
Number of defective bezels
Number of defective castings
 
Poisson Distribution
Number of defects (DPMO)
Number of defects 
per
 bezel
Number of defects 
per
 casting
 
W
h
i
c
h
 
T
y
p
e
 
o
f
 
D
a
t
a
 
i
s
 
B
e
t
t
e
r
?
 
Variable Data
 
Pros:
Provides useful information with smaller sample sizes
Can identify common cause concerns at low defect
rates
Can be used to predict product / process outcomes
(trends)
Very useful for continuous improvement activities
(DOE, Regression analysis, etc.)
 
Cons:
Data collection can be more difficult, requiring specific
gauges or measurement methods
Analysis of data requires some knowledge of
statistical methods (Control charting, Regression
analysis, etc.)
 
Attribute Data
 
Pros:
Very easy to obtain
Calculations are simple
Data is usually readily available
Good for metrics reporting / management review
Good for baseline performance
 
Cons:
Data collection can be more difficult, requiring specific
gauges or measurement methods
Analysis of data requires some knowledge of
statistical methods (Control charting, Regression
analysis, etc.)
 
I
n
t
r
o
d
u
c
t
i
o
n
 
t
o
 
S
t
a
t
i
s
t
i
c
s
 
Goal:
1.
Define basic statistical tools
2.
Define types of distributions
3.
Understand the Central Limit Theorem
4.
Understand normal distributions
 
D
e
f
i
n
i
t
i
o
n
s
 
Population: A group of 
all
 possible objects
 
Subgroup (Sample): One or more observations used to analyze the performance of a
process.
 
Distribution: A method of describing the output of a stable source of variation, where
individual values as a group form a pattern that can be described in terms of its
location, spread and shape.
 
M
e
a
s
u
r
e
s
 
o
f
 
D
a
t
a
 
D
i
s
t
r
i
b
u
t
i
o
n
 
Measures of Location
Mean (Average)
The sum of all data values divided by the number of data points
Mode
The most frequently occurring number in a data set
Median (Midpoint)
 
Measures of Spread
Range
The difference between the largest and smallest values of a data set
Standard Deviation
The square root of the squared distances of each data point from the mean, divided by the sample
size. Also known as Sigma (σ)
 
D
i
s
t
r
i
b
u
t
i
o
n
 
T
y
p
e
s
 
Discrete
Binomial
Poisson
Continuous
Normal
Exponential
Weibull
Uniform
 
C
e
n
t
r
a
l
 
L
i
m
i
t
 
T
h
e
o
r
e
m
 
C
L
T
 
P
r
o
p
e
r
t
y
 
1
 
The distribution of the 
sample means
of any population will approximate a
normal distribution as the sample
size increases, even if the population
is non-normal.
 
Because of this property control
charts (for averages) is based on the
normal distribution.
 
Population
 
Population
 
Population
 
Population
 
Distribution of sample
means from any population
T
h
e
 
N
o
r
m
a
l
 
D
i
s
t
r
i
b
u
t
i
o
n
 
µ ± 1σ
 
=
 
68.3%
µ ± 2σ
 
=
 
95.4%
µ ± 3σ
 
=
 
99.7%
 
 2%   14%  34%  34%  14%   2%
 
P
r
o
c
e
s
s
 
C
a
p
a
b
i
l
i
t
y
 
P
r
o
c
e
s
s
 
C
a
p
a
b
i
l
i
t
y
 
 
P
r
o
c
e
s
s
 
C
a
p
a
b
i
l
i
t
y
 
Goal
Understand process capability and specification limits
Understand the procedure of calculating process capability
Understand C
p
, C
pk
, P
p
 and P
pk
 indices
Estimating percentage of process beyond specification limits
Understanding non-normal data
Example capability calculation
 
S
p
e
c
i
f
i
c
a
t
i
o
n
 
L
i
m
i
t
s
 
Individual features
(dimensions) on a product are assigned
specification limits.
 
How do we determine a process is able to
produce a part that meets specification limits?
 
Process Capability!
 
L
S
L
 
U
S
L
 
P
r
o
c
e
s
s
 
C
a
p
a
b
i
l
i
t
y
 
a
n
d
 
S
p
e
c
i
f
i
c
a
t
i
o
n
 
L
i
m
i
t
s
 
Process capability is the ability of a process to meet customer requirements.
 
L
S
L
 
U
S
L
 
Acceptable capability
 
L
S
L
 
U
S
L
 
Product outside of spec limits
 
C
a
l
c
u
l
a
t
i
n
g
 
P
r
o
c
e
s
s
 
C
a
p
a
b
i
l
i
t
y
 
D
e
t
e
r
m
i
n
e
 
S
t
a
b
i
l
i
t
y
 
All sample
means and
ranges and in
control and do
not indicate
obvious trends
 
D
e
t
e
r
m
i
n
e
 
N
o
r
m
a
l
i
t
y
 
Placing all data in a
histogram 
may
 be used
to help determine
normality.  If the data
represents a normal
curve.
 
D
e
t
e
r
m
i
n
e
 
N
o
r
m
a
l
i
t
y
 
A more statistical method is to use the Anderson Darling test for
normality
In Minitab go to:
Stat > Basic Statistics > Normality Test
Select Anderson Darling and click ok
 
I
n
t
e
r
p
r
e
t
i
n
g
 
N
o
r
m
a
l
i
t
y
 
The p-Value must be greater than or
equal to 0.05 to match the normal
distribution.
 
P
r
o
c
e
s
s
 
I
n
d
i
c
e
s
 
Indices of process variation only, in regard to specification;
C
p
 and P
p
Indices of process variation and centering combined, in regard to
specification;
C
pk
 and P
pk
 
N
o
t
e
 
 
Before calculating capability or performance indices we need to make sure
of a couple of things!
1.
The process needs to be stable (in control)
2.
The process needs to be normal
3.
A completed MSA needs to prove the measurement system is
acceptable
 
If the above items are not met understand the results of the capability studies may be inaccurate.
Also, per the AIAG PPAP manual (4
th
 Edition) if the above items are not met corrective actions may be
required prior to PPAP submittal.
 
C
p
 
O
v
e
r
v
i
e
w
 
T
h
i
s
 
i
n
d
e
x
 
i
n
d
i
c
a
t
e
s
 
p
o
t
e
n
t
i
a
l
 
p
r
o
c
e
s
s
 
c
a
p
a
b
i
l
i
t
y
.
Cp is not impacted by the process location and can only be calculated
for bilateral tolerance.
 
C
p
 
C
a
l
c
u
l
a
t
i
o
n
 
I
n
t
e
r
p
r
e
t
i
n
g
 
C
p
<1
1
2
 
Cp = Number of times the car
(distribution) fits in the garage
(specification limit)
 
L
S
L
 
U
S
L
 
Garage Door Width
 
Car
 
C
p
k
 
O
v
e
r
v
i
e
w
 
C
pk
 is a capability index. It takes the process location 
and
 the capability into
account.
C
pk
 can be calculated for both single sided (unilateral) and two sided (bilateral)
tolerances.
For bilateral tolerances C
pk
 is the minimum of CPU and CPL where:
 
C
p
k
 
E
x
a
m
p
l
e
 
= 1.372
 
C
p
 
/
 
C
p
k
 
R
e
v
i
e
w
 
C
p
 
i
n
d
i
c
a
t
e
s
 
h
o
w
 
m
a
n
y
 
p
r
o
c
e
s
s
 
d
i
s
t
r
i
b
u
t
i
o
n
 
w
i
d
t
h
s
 
c
a
n
 
f
i
t
 
w
i
t
h
i
n
s
p
e
c
i
f
i
c
a
t
i
o
n
 
l
i
m
i
t
s
.
 
 
I
t
 
d
o
e
s
 
n
o
t
 
c
o
n
s
i
d
e
r
 
p
r
o
c
e
s
s
 
l
o
c
a
t
i
o
n
.
 
 
B
e
c
a
u
s
e
o
f
 
t
h
i
s
 
i
t
 
o
n
l
y
 
i
n
d
i
c
a
t
e
s
 
p
o
t
e
n
t
i
a
l
 
p
r
o
c
e
s
s
 
c
a
p
a
b
i
l
i
t
y
,
 
n
o
t
 
a
c
t
u
a
l
p
r
o
c
e
s
s
 
c
a
p
a
b
i
l
i
t
y
.
 
C
p
k
 
i
n
d
i
c
a
t
e
s
 
a
c
t
u
a
l
 
p
r
o
c
e
s
s
 
c
a
p
a
b
i
l
i
t
y
,
 
t
a
k
i
n
g
 
i
n
t
o
 
a
c
c
o
u
n
t
 
b
o
t
h
p
r
o
c
e
s
s
 
l
o
c
a
t
i
o
n
 
a
n
d
 
t
h
e
 
w
i
d
t
h
 
o
f
 
t
h
e
 
p
r
o
c
e
s
s
 
d
i
s
t
r
i
b
u
t
i
o
n
.
 
P
p
 
O
v
e
r
v
i
e
w
 
T
h
i
s
 
i
n
d
e
x
 
i
n
d
i
c
a
t
e
s
 
p
o
t
e
n
t
i
a
l
 
p
r
o
c
e
s
s
 
p
e
r
f
o
r
m
a
n
c
e
.
 
 
I
t
 
c
o
m
p
a
r
e
s
t
h
e
 
m
a
x
i
m
u
m
 
a
l
l
o
w
a
b
l
e
 
v
a
r
i
a
t
i
o
n
 
a
s
 
i
n
d
i
c
a
t
e
d
 
b
y
 
t
o
l
e
r
a
n
c
e
 
t
o
 
t
h
e
p
r
o
c
e
s
s
 
p
e
r
f
o
r
m
a
n
c
e
 
P
p
 is not impacted by the process location and can only be
calculated for bilateral tolerance.
 
P
p
 must be used when reporting capability for PPAP
 
Where:
σ     = Standard deviation
USL = Upper specification limit
LSL = Lower specification limit
 
P
p
 
a
n
d
 
C
p
 
P
p
 
C
p
 takes into account within subgroup variation (average range)
P
p
 takes into account between subgroup variation
 
P
p
k
 
O
v
e
r
v
i
e
w
 
P
pk
 is a performance index.  It indicates actual process
performance, taking into account both process location and
overall process variation.
 
P
pk
 shows if a process is actually meeting customer
requirements.
 
P
pk
 can be used for both unilateral and bilateral tolerances.
 
P
pk
 must be used when reporting capability for PPAP
 
P
p
k
 
C
a
l
c
u
l
a
t
i
o
n
 
Ppk is the minimum of PPU and PPL where:
 
P
p
 
/
 
P
p
k
 
R
e
v
i
e
w
 
P
p
 
i
n
d
i
c
a
t
e
s
 
h
o
w
 
m
a
n
y
 
p
r
o
c
e
s
s
 
d
i
s
t
r
i
b
u
t
i
o
n
 
w
i
d
t
h
s
 
c
a
n
 
f
i
t
 
w
i
t
h
i
n
s
p
e
c
i
f
i
c
a
t
i
o
n
 
l
i
m
i
t
s
.
 
 
I
t
 
d
o
e
s
 
n
o
t
 
c
o
n
s
i
d
e
r
 
p
r
o
c
e
s
s
 
l
o
c
a
t
i
o
n
.
 
 
B
e
c
a
u
s
e
 
o
f
t
h
i
s
 
i
t
 
o
n
l
y
 
i
n
d
i
c
a
t
e
s
 
p
o
t
e
n
t
i
a
l
 
p
r
o
c
e
s
s
 
p
e
r
f
o
r
m
a
n
c
e
,
 
n
o
t
 
a
c
t
u
a
l
p
r
o
c
e
s
s
 
p
e
r
f
o
r
m
a
n
c
e
.
 
P
p
k
 
i
n
d
i
c
a
t
e
s
 
a
c
t
u
a
l
 
p
r
o
c
e
s
s
 
p
e
r
f
o
r
m
a
n
c
e
,
 
t
a
k
i
n
g
 
i
n
t
o
 
a
c
c
o
u
n
t
 
b
o
t
h
p
r
o
c
e
s
s
 
l
o
c
a
t
i
o
n
 
a
n
d
 
t
h
e
 
w
i
d
t
h
 
o
f
 
t
h
e
 
p
r
o
c
e
s
s
 
d
i
s
t
r
i
b
u
t
i
o
n
.
 
A
 
F
e
w
 
N
o
t
e
s
 
C
p
k
 
a
n
d
 
P
p
k
 
C
o
m
p
a
r
e
d
 
These two data sets contain the
same data.
 
The top data set is from an
immature process that contains
special cause variation.
 
The bottom data set has the same
within group variation, but has
between group variation removed.
 
The bottom data set shows a
process that is in statistical control.
 
This example may be found in the AIAG SPC (Second
Edition) Manual on page 136
P
r
o
c
e
s
s
 
I
n
d
i
c
e
s
 
R
e
v
i
e
w
Process Indices
1.
Cp = 4.01, Cpk = 4.01
Pp = 4.00 , Ppk = 3.99
2.
Cp = 0.27, Cpk = 0.26
Pp = 0.25, Ppk = 0.24
3.
Cp = 4.01, Cpk = -2.00
Pp = 3.99, Ppk = -2.01
4.
Cp = 4.01, Cpk = 4.00
Pp = 2.01, Ppk = 2.00
Interpretation
 
1.
This process is stable and
produces almost all parts
within specification
2.
Significant common cause
variation exists
3.
Significant special cause
variation exists
4.
Improvement can be made
by centering the process
E
s
t
i
m
a
t
i
n
g
 
P
e
r
c
e
n
t
 
O
u
t
 
o
f
 
S
p
e
c
i
f
i
c
a
t
i
o
n
To estimate the percentage of
product that falls outside of the
specification limits we need to
compute Z
upper
 and Z
lower
For this example assume an
average range of 8.4 from a
stable process using a sample
size of 5
USL = 182
LSL = 160
Z
upper
 is the number of
standard deviations
between the average
and the USL
Z
lower
 is the number of
standard deviations
between the average
and the LSL
189.4
178.6
167.8
E
s
t
i
m
a
t
i
n
g
 
P
e
r
c
e
n
t
 
O
u
t
 
o
f
 
S
p
e
c
i
f
i
c
a
t
i
o
n
USL = 182
LSL = 160
189.4
178.6
167.8
E
s
t
i
m
a
t
i
n
g
 
P
e
r
c
e
n
t
 
o
u
t
 
o
f
 
S
p
e
c
i
f
i
c
a
t
i
o
n
USL = 182
LSL = 160
Z
upper
 = 0.94
189.4
178.6
167.8
 
Next, we need to reference a z table.
From the z table we find 0.94, which
corresponds to a proportion of 0.1736
 
This convert to 17.36% defective or
173,600 PPM
 
U
n
d
e
r
s
t
a
n
d
i
n
g
 
N
o
n
-
N
o
r
m
a
l
 
D
a
t
a
 
What happens when data used for capability is not normally distributed?
 
C
p
 and P
p
 indices are robust in their accuracy with regards to non-normal
data.
 
C
pk
 and P
pk
 indices are 
not
 robust with regards to non-normal data.
 
Calculating (and making decisions based on) P
pk
 and C
pk
 indices assuming
normality with non-normal data can be misleading.
 
U
n
d
e
r
s
t
a
n
d
i
n
g
 
N
o
n
-
N
o
r
m
a
l
 
D
a
t
a
 
What do I do if my data is not normal?
Gather more data
This may or may not be an option given timing and cost.
The Central Limit Theorem (covered earlier) states that 
all
 population means resemble the
normal distribution with larger sample size
 
Transform the data
Using either the Box-Cox or Johnson transformations we can transform non-normal data to a
near normal form
This allows use to accurately calculate Cpk and Ppk indices and the proportion nonconforming
 
Calculate capabilities based on different distributions
 
B
o
x
-
C
o
x
 
T
r
a
n
s
f
o
r
m
a
t
i
o
n
 
B
o
x
-
C
o
x
 
i
n
 
P
r
a
c
t
i
c
e
 
The Box-Cox transformation can be used in Minitab’s capability analysis of normal data.
When running a capability study in Minitab
 select:
Stat > Quality Tools > Capability Sixpack > Normal
Then select “Transform” and check “Box-Cox power Transformation”
 
J
o
h
n
s
o
n
 
T
r
a
n
s
f
o
r
m
a
t
i
o
n
 
The Johnson Transformation uses a system of transformations that yield
approximate normality.
The Johnson Transformation covers:
Bounded data
Log normal data
Unbounded data
This system of transformations cover all possible unimodal distribution
forms (skewness-kurtosis)
This transformation can also be ran in Minitab
 
J
o
h
n
s
o
n
 
T
r
a
n
s
f
o
r
m
a
t
i
o
n
 
i
n
 
P
r
a
c
t
i
c
e
 
The Johnson Transformation can be used in Minitab’s capability analysis of
normal data.
When running a capability study in Minitab
 select:
Stat > Quality Tools > Capability Sixpack > Normal
Then select “Transform” and check “Box-Cox power Transformation”
 
E
x
a
m
p
l
e
 
P
r
o
c
e
s
s
 
C
a
p
a
b
i
l
i
t
y
 
Using the data below captured from an early pre-
production run calculate initial process capability.
 
The process specification is 0.642 +0.001 / -0.002
30 data points were collected (subgroup of 1)
 
I
s
 
t
h
e
 
D
a
t
a
 
S
t
a
b
l
e
?
 
I
s
 
t
h
e
 
D
a
t
a
 
N
o
r
m
a
l
?
 
C
a
l
c
u
l
a
t
e
 
C
a
p
a
b
i
l
i
t
y
 
A
n
a
l
y
z
e
 
R
e
s
u
l
t
s
 
Process Stable?
Yes!
 
 
 
Data set normal?
Yes!
 
 
Capability
Acceptable?
Yes!
 
S
t
a
t
i
s
t
i
c
a
l
 
P
r
o
c
e
s
s
 
C
o
n
t
r
o
l
 
(
S
P
C
)
 
S
P
C
S
T
A
T
I
S
T
I
C
A
L
 
P
R
O
C
E
S
S
 
c
o
n
t
r
o
l
The process talks
through the
control chart
The process
responds
The organization
Listens and reacts
 
-Root cause
-Learning (Understanding)
-Corrective Action
-Share knowledge
W
h
y
 
d
o
 
w
e
 
n
e
e
d
 
p
r
o
c
e
s
s
 
c
o
n
t
r
o
l
?
 
Why prevention instead of detection?
 
Detection tolerates waste, Prevention avoids it.
 
V
a
r
i
a
t
i
o
n
 
There are two types of variation
1.
Common Cause
The many factors that result in variation that is consistently acting on a process.  If only
common causes are present in a process it is considered stable and in control.  If only
common cause variation is present in a process the process outcome is predictable.
 
2.
Special Cause
Also known as assignable causes, special causes are factors that result in variation that only affect
some of the process output.  Special cause variation results in one or more points outside of
controls limits and / or non-random patterns of points within control limits. Special cause
variation is not predictable and, if present in a process, results in instability and out of control
conditions.
C
o
m
m
o
n
 
C
a
u
s
e
 
V
a
r
i
a
t
i
o
n
 
Common cause variation can result in,
for example, slight variation in the size of
a bolt (exaggerated visual)
 
If only common cause
variation is present in a
process it will result in an
output that is stable over
time and predictable
Slide Note
Embed
Share

Data is crucial in making informed decisions and taking action. Without data collection, understanding processes and products becomes difficult, leading to a lack of control over outcomes. Proper data analysis is essential for continuous improvement and meeting customer needs. Statistics play a vital role in describing and predicting process/product outcomes, emphasizing the importance of data accuracy and measurement systems. Utilizing statistical methods enables effective process control and quality management.

  • Data
  • Statistics
  • Process Control
  • Training
  • Quality Management

Uploaded on Jul 02, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Statistical Process Control (SPC) Training Guide GHSP Confidential

  2. What is data? Data is factual information (as measurements or statistics) used as a basic for reasoning, discussion or calculation. (Merriam- Webster Dictionary, m-w.com) What does this mean? Data allows us to make educated decisions and take action! GHSP Confidential

  3. Why is Data Important? If we don t collect data about a process then what? Without data we don t understand the process / product So what? Without understanding of the process / product we can t control the outcome Is it important to control the outcome? When you can t control the outcome you are dependent on chance. You may have a good outcome, you may not. Without data collection you may not know either way. GHSP Confidential

  4. Use of Data I m not collecting data because its non value add. Without data collection there is no way to identify problems, continuously improve or ensure you are meeting the voice of the customer. I m collecting data, but not looking at it. Is that okay? No, the collection of data without analysis is a bigger waste than not collecting data in the first place. What should I be doing with the data? GHSP Confidential

  5. DATA Statistics Information that can be understood and acted on. GHSP Confidential

  6. What are Statistics A branch of mathematics dealing with the collection, analysis, interpretation and presentation of masses of numerical data (Merriam-Webster Dictionary, m-w.com) What does this mean? Once data is collected we can use appropriate statistical methods to describe (understand) our process / product and control (predict) the process / product outcome GHSP Confidential

  7. Statistics Allow us to Describe our process / product Control (predict) process / product outcomes Sample (125 Pieces) Population (All Product) GHSP Confidential

  8. Important Notes 1. Statistical conclusions require useful data We need to measure the right thing Determining what data to collect and how to collect it are important steps in the APQP / continuous improvement process 2. We need to have confidence the data collected is accurate A good measurement system is required to collect data Be sure the measurement system analysis (MSA) is acceptable before collecting data There is a separate training available for MSA 3. Reduce / Eliminate Waste Data that doesn t provide useful information drives waste We want to gain the most useful information from the least amount of data possible! GHSP Confidential

  9. Types of Data Variable Data (Continuous Data) Measurements on a continuous scale Examples Product Dimensions Weight Time Cost Process parameters (cutting speed, injection pressure, etc.) Attribute Data (Discrete Data) Data by counting Examples Count of defective parts from production Number of chips on a painted part GHSP Confidential

  10. Types of Attribute Data Binomial Distribution Pass / Fail (PPM) Number of defective bezels Number of defective castings Poisson Distribution Number of defects (DPMO) Number of defects per bezel Number of defects per casting GHSP Confidential

  11. Which Type of Data is Better? Variable Data Attribute Data Pros: Provides useful information with smaller sample sizes Can identify common cause concerns at low defect rates Can be used to predict product / process outcomes (trends) Very useful for continuous improvement activities (DOE, Regression analysis, etc.) Pros: Very easy to obtain Calculations are simple Data is usually readily available Good for metrics reporting / management review Good for baseline performance Cons: Data collection can be more difficult, requiring specific gauges or measurement methods Analysis of data requires some knowledge of statistical methods (Control charting, Regression analysis, etc.) Cons: Data collection can be more difficult, requiring specific gauges or measurement methods Analysis of data requires some knowledge of statistical methods (Control charting, Regression analysis, etc.) GHSP Confidential

  12. Introduction to Statistics Goal: 1. Define basic statistical tools 2. Define types of distributions 3. Understand the Central Limit Theorem 4. Understand normal distributions GHSP Confidential

  13. Definitions Population: A group of all possible objects Subgroup (Sample): One or more observations used to analyze the performance of a process. Distribution: A method of describing the output of a stable source of variation, where individual values as a group form a pattern that can be described in terms of its location, spread and shape. GHSP Confidential

  14. Measures of Data Distribution Measures of Location Mean (Average) The sum of all data values divided by the number of data points Mode The most frequently occurring number in a data set Median (Midpoint) Measures of Spread Range The difference between the largest and smallest values of a data set Standard Deviation The square root of the squared distances of each data point from the mean, divided by the sample size. Also known as Sigma ( ) GHSP Confidential

  15. Distribution Types Discrete Binomial Poisson Continuous Normal Exponential Weibull Uniform GHSP Confidential

  16. Central Limit Theorem The Central Limit Theorem is the basis for sampling and control charting (of averages). There are 3 properties associated with the CLT; 1. The distribution of the sample means will approximate a normal distribution as the sample size increases, even if the population is non-normal 2. The average of the sample means will be the same as the population mean 3. The distribution of the sample means will be narrower than the distribution of the individuals by a factor of 1 ?, where n is the sample size. GHSP Confidential

  17. CLT Property 1 The distribution of the sample means of any population will approximate a normal distribution as the sample size increases, even if the population is non-normal. Population Population Population Population Because of this property control charts (for averages) is based on the normal distribution. Distribution of sample means from any population GHSP Confidential

  18. The Normal Distribution 50% 50% 1 = 2 = 3 = 68.3% 95.4% 99.7% 2% 14% 34% 34% 14% 2% -3 -2 -1 0 +1 +2 +3 GHSP Confidential

  19. Process Capability Process Capability GHSP Confidential

  20. Process Capability Goal Understand process capability and specification limits Understand the procedure of calculating process capability Understand Cp, Cpk, Pp and Ppk indices Estimating percentage of process beyond specification limits Understanding non-normal data Example capability calculation GHSP Confidential

  21. Specification Limits Individual features (dimensions) on a product are assigned specification limits. How do we determine a process is able to produce a part that meets specification limits? Process Capability! LSL USL GHSP Confidential

  22. Process Capability and Specification Limits Process capability is the ability of a process to meet customer requirements. USL LSL USL LSL Acceptable capability Product outside of spec limits GHSP Confidential

  23. Calculating Process Capability GHSP Confidential

  24. Determine Stability All sample means and ranges and in control and do not indicate obvious trends GHSP Confidential

  25. Determine Normality Placing all data in a histogram may be used to help determine normality. If the data represents a normal curve. GHSP Confidential

  26. Determine Normality A more statistical method is to use the Anderson Darling test for normality In Minitab go to: Stat > Basic Statistics > Normality Test Select Anderson Darling and click ok GHSP Confidential

  27. Interpreting Normality The p-Value must be greater than or equal to 0.05 to match the normal distribution. GHSP Confidential

  28. Process Indices Indices of process variation only, in regard to specification; Cp and Pp Indices of process variation and centering combined, in regard to specification; Cpk and Ppk GHSP Confidential

  29. Note Before calculating capability or performance indices we need to make sure of a couple of things! 1. The process needs to be stable (in control) 2. The process needs to be normal 3. A completed MSA needs to prove the measurement system is acceptable If the above items are not met understand the results of the capability studies may be inaccurate. Also, per the AIAG PPAP manual (4th Edition) if the above items are not met corrective actions may be required prior to PPAP submittal. GHSP Confidential

  30. Cp Overview ???? ?? ????????????=????? ?? ???????? ????????? Cp = Potential Process Capability = ????? ?? ??????? This index indicates potential process capability. Cp is not impacted by the process location and can only be calculated for bilateral tolerance. GHSP Confidential

  31. Cp Calculation Subgroup Size d2 Cp = ??? ??? =??? ??? 6 2 1.128 6?? ? ?2 3 1.693 4 2.059 5 2.326 Where: 6 2.534 USL = Upper specification limit LSL = Lower specification limit ? = Average Range d2 = a constant value based on subgroup sample size 7 2.704 8 2.847 9 2.970 10 3.078 GHSP Confidential

  32. Interpreting Cp Garage Door Width <1 Car 1 2 LSL USL Cp = Number of times the car (distribution) fits in the garage (specification limit) GHSP Confidential

  33. Cpk Overview Cpk is a capability index. It takes the process location and the capability into account. Cpk can be calculated for both single sided (unilateral) and two sided (bilateral) tolerances. For bilateral tolerances Cpk is the minimum of CPU and CPL where: CPU = ??? ? 3?? 3 ?2 Where: ? = Process average USL = Upper specification limit LSL = Lower specification limit ? = Average Range d2 = a constant value based on subgroup sample size ? ?2 is an estimate of the standard deviation ??? ? ? ? ??? 3?? ? ??? ? ?2 and CPL = = = 3 Note that GHSP Confidential

  34. Cpk Example GHSP is provided a machined casting with a hole diameter of 16.5 1.0mm. Subgroup Size d2 2 1.128 3 1.693 The supplier has collected 25 subgroups of 5 measurements and wants to determine the process capability. With a subgroup size of 5 d2 is 2.326 (table to right). 4 2.059 5 2.326 6 2.534 If the process average ( ?) is 16.507 and the average range ( ?) is 0.561, then the Cpk is calculated as follows: 7 2.704 8 2.847 17.5 16.507 0.561 2.326 16.507 15.5 0.561 2.326 = Minimum CPU = , CPL = 9 2.970 3 3 10 3.078 = Minimum CPU = 1.372 , CPL =1.392 = 1.372 GHSP Confidential

  35. Cp / Cpk Review Cp indicates how many process distribution widths can fit within specification limits. It does not consider process location. Because of this it only indicates potential process capability, not actual process capability. Cpk indicates actual process capability, taking into account both process location and the width of the process distribution. GHSP Confidential

  36. Pp Overview This index indicates potential process performance. It compares the maximum allowable variation as indicated by tolerance to the process performance Pp is not impacted by the process location and can only be calculated for bilateral tolerance. Pp must be used when reporting capability for PPAP Where: = Standard deviation USL = Upper specification limit LSL = Lower specification limit Pp = ??? ??? 6?? GHSP Confidential

  37. Pp and Cp Pp Cp takes into account within subgroup variation (average range) Pp takes into account between subgroup variation GHSP Confidential

  38. Ppk Overview Ppk is a performance index. It indicates actual process performance, taking into account both process location and overall process variation. Ppk shows if a process is actually meeting customer requirements. Ppk can be used for both unilateral and bilateral tolerances. Ppk must be used when reporting capability for PPAP GHSP Confidential

  39. Ppk Calculation Ppk is the minimum of PPU and PPL where: PPU = ??? ? ? ??? 3?? and PPL = 3?? Where: ? = Process average USL = Upper specification limit LSL = Lower specification limit = Standard deviation GHSP Confidential

  40. Pp / Ppk Review Pp indicates how many process distribution widths can fit within specification limits. It does not consider process location. Because of this it only indicates potential process performance, not actual process performance. Ppk indicates actual process performance, taking into account both process location and the width of the process distribution. GHSP Confidential

  41. A Few Notes Cpk will always be smaller than Cp, unless the process is centered. If the process is centered the two value will be the same. Ppk will always be smaller than Pp, unless the process is centered. If the process is centered the two values will be the same. Cp and Pp cannot be calculated for unilateral tolerances. Cpk and Ppk can be calculated for both unilateral and bilateral tolerances. Often, short term capability will be calculated using Cp and Cpk. This is because these indices use calculation tells us how good a process could potentially be at its best performance. Pp and Ppk uses the actual (overall or between subgroup) standard deviation to calculate performance. As such, when reporting capability use Pp and Ppk. ? ?2 to estimate standard deviation, which is a calculation of within subgroup variation. This GHSP Confidential

  42. Cpk and Ppk Compared These two data sets contain the same data. The top data set is from an immature process that contains special cause variation. The bottom data set has the same within group variation, but has between group variation removed. The bottom data set shows a process that is in statistical control. This example may be found in the AIAG SPC (Second Edition) Manual on page 136 GHSP Confidential

  43. Process Indices Review Process Indices Interpretation 1. Cp = 4.01, Cpk = 4.01 Pp = 4.00 , Ppk = 3.99 1. This process is stable and produces almost all parts within specification 2. Significant common cause variation exists 3. Significant special cause variation exists 4. Improvement can be made by centering the process 2. Cp = 0.27, Cpk = 0.26 Pp = 0.25, Ppk = 0.24 3. Cp = 4.01, Cpk = -2.00 Pp = 3.99, Ppk = -2.01 4. Cp = 4.01, Cpk = 4.00 Pp = 2.01, Ppk = 2.00 GHSP Confidential

  44. Estimating Percent Out of Specification To estimate the percentage of product that falls outside of the specification limits we need to compute Zupper and Zlower USL = 182 LSL = 160 For this example assume an average range of 8.4 from a stable process using a sample size of 5 178.6 167.8 189.4 Zlower is the number of standard deviations between the average and the LSL Zupper is the number of standard deviations between the average and the USL GHSP Confidential

  45. Estimating Percent Out of Specification ? ?2 = 8.4 2.326 = 3.6 = USL = 182 LSL = 160 Zupper = ??? ? ? Zupper = 182.0 178.6 = 0.94 3.6 178.6 167.8 189.4 ? ??? ? Zlower = The USL is 0.94 from ? The LSL is 5.17 from ? Zlower = 178.6 160.0 = 5.17 3.6 GHSP Confidential

  46. Estimating Percent out of Specification Next, we need to reference a z table. From the z table we find 0.94, which corresponds to a proportion of 0.1736 USL = 182 LSL = 160 This convert to 17.36% defective or 173,600 PPM 178.6 167.8 189.4 Zupper = 0.94 GHSP Confidential

  47. Understanding Non-Normal Data What happens when data used for capability is not normally distributed? Cp and Pp indices are robust in their accuracy with regards to non-normal data. Cpk and Ppk indices are not robust with regards to non-normal data. Calculating (and making decisions based on) Ppk and Cpk indices assuming normality with non-normal data can be misleading. GHSP Confidential

  48. Understanding Non-Normal Data What do I do if my data is not normal? Gather more data This may or may not be an option given timing and cost. The Central Limit Theorem (covered earlier) states that all population means resemble the normal distribution with larger sample size Transform the data Using either the Box-Cox or Johnson transformations we can transform non-normal data to a near normal form This allows use to accurately calculate Cpk and Ppk indices and the proportion nonconforming Calculate capabilities based on different distributions GHSP Confidential

  49. Box-Cox Transformation The goal of the Box-Cox transformation is to identify a Lambda value ( ). The Lambda value will then be used in the function X to transform the data (X) from a non-normal set into a normal data set. The formula for this transformation is W=X where: -5 5 and = 0 for the natural log transformation = 0.5 for the square root transformation GHSP Confidential

  50. Box-Cox in Practice The Box-Cox transformation can be used in Minitab s capability analysis of normal data. When running a capability study in Minitab select: Stat > Quality Tools > Capability Sixpack > Normal Then select Transform and check Box-Cox power Transformation GHSP Confidential

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#