Statistics and Outliers in Data Analysis

 
STATISTICS AND
OUTLIERS
 
Aaron Saks
Process Advancement Leader
Boise, Inc
10/26/2011
Bio
 
Graduated in 2007 – ChemE and PSE
Started career with Boise Inc, Wallula WA
as a process engineer
October 2008 went to work for Envoy
Development
April 2010 returned to Boise as Process
Advancement Leader, Wallula WA
November 1
st
 2011 move to Boise, ID to
become Project Manager for Boise
Packaging
Statistics and Data
 
Why is it important?
Everything we really know, we know because of
data.
As a new engineer, we know how to apply
math and logic to solve problems.
We don’t really know how anything works.
By focusing on the 
data
 we can learn, solve
problems, and teach others.
There is a lot of data out there.
Statistics = The language of data.
Thinking Statistically
 
We can’t just think in terms of the “average”.
Need to think in terms of the 
distribution
 of data,
and the probability of events occurring.
What are good statistical tools for a new
engineer?
Six Sigma methods
Understanding the common probability distributions and
their mean and variance
Understanding histograms and pareto charts
Excel skills
 
Tear Strength
 
4.1% represents ~ 1640 tons below current spec
~40 tons (0.1%) rejected for below-spec MD tear
 
Current Target:
42
Ppkm: Run to Target and Reduce Variation
 
Historically, papermakers would run “in the
warning” all day, as long as the tests are within
specification limits.
This resulted in running off target, with different
means run to run (Poor Quality).
Needed a way to encourage ($$$) running to
targets, and reducing variation within the specs.
Created a variation on the classical Process
Capability metric “Cpk”.
Result 
 a financial incentive for improving
Quality.
P
pkm
 - Running to Target and Reducing Variation
 
P
p
k
m
 
i
s
 
a
n
 
e
x
a
m
p
l
e
 
o
f
 
a
p
r
o
c
e
s
s
 
p
e
r
f
o
r
m
a
n
c
e
m
e
t
r
i
c
.
P
pkm
 captures both deviation
from target and variation
within specification limits.
I
f
 
P
p
k
m
 
>
=
 
1
,
 
t
h
e
n
 
t
h
e
 
p
r
o
c
e
s
s
i
s
 
C
a
p
a
b
l
e
:
 
r
u
n
n
i
n
g
 
t
o
 
t
a
r
g
e
t
a
n
d
 
v
a
r
i
a
t
i
o
n
 
i
s
 
w
e
l
l
 
w
i
t
h
i
n
s
p
e
c
i
f
i
c
a
t
i
o
n
 
l
i
m
i
t
s
To improve Quality we award
operators for the number for
Key Product Properties that
have a Ppkm value above 1.0.
 
Ppkm = 1.02
 
Here is an example of a process that has a P
pkm
>1,
meaning it is fully capable of meeting customer
expectations.
Process is running to target – i.e. the mean is equal
to the target
Variation is well within the Specification (Red) limits.
 
8
P
p
k
m
 
=
 
.
6
5
 
9
 
Running to Target – Excessive variation
 
Acceptable variation – but off Target
Outliers
 
We spend most of our time working on the
outliers
:
Process problems
Lowering cost
Increasing production
Outliers exist in people too
Top 20
Middle 70
Bottom 10
The future leaders of tomorrow will be from the
top 20.
Thoughts for a New Engineer
 
Understand what is expected
Focus on independent learning
Always speak from facts and data
Effective communication creates results
Focus on business results and
accomplishments
You don’t have to be at the top to be a
leader
 
Questions:
 
What has been your biggest challenge?
If you could go back to school what
additional classes would you take?
What's a typical day like?
What kind of products does Boise make?
How long do projects last?
What ethical issues have you encountered?
 
Slide Note
Embed
Share

In the realm of engineering, data plays a crucial role in problem-solving and decision-making processes. Statistics offer a language through which engineers can interpret data, identify outliers, and make informed decisions to enhance process optimization and quality control. Explore the significance of statistical tools like Six Sigma methods, probability distributions, histograms, and pareto charts in empowering engineers to think more holistically about data analysis.

  • Statistics
  • Outliers
  • Data Analysis
  • Engineering
  • Process Optimization

Uploaded on Feb 19, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. STATISTICS AND OUTLIERS Aaron Saks Process Advancement Leader Boise, Inc 10/26/2011

  2. Bio Graduated in 2007 ChemE and PSE Started career with Boise Inc, Wallula WA as a process engineer October 2008 went to work for Envoy Development April 2010 returned to Boise as Process Advancement Leader, Wallula WA November 1st2011 move to Boise, ID to become Project Manager for Boise Packaging

  3. Statistics and Data Why is it important? Everything we really know, we know because of data. As a new engineer, we know how to apply math and logic to solve problems. We don t really know how anything works. By focusing on the data we can learn, solve problems, and teach others. There is a lot of data out there. Statistics = The language of data.

  4. Thinking Statistically We can t just think in terms of the average . Need to think in terms of the distribution of data, and the probability of events occurring. What are good statistical tools for a new engineer? Six Sigma methods Understanding the common probability distributions and their mean and variance Understanding histograms and pareto charts Excel skills

  5. Tear Strength 4.1% represents ~ 1640 tons below current spec Freq% Normdist% ~40 tons (0.1%) rejected for below-spec MD tear 30.0% Current Capability: 30.3 Current Spec: 32 Current Target: 42 25.0% 20.0% 15.0% % 4.1% 10.0% 0.34% 5.0% 0.0% 26.7 27.8 28.9 30.0 31.2 32.3 33.4 34.5 35.6 36.7 37.8 38.9 40.0 41.1 42.2 43.3

  6. Ppkm: Run to Target and Reduce Variation Historically, papermakers would run in the warning all day, as long as the tests are within specification limits. This resulted in running off target, with different means run to run (Poor Quality). Needed a way to encourage ($$$) running to targets, and reducing variation within the specs. Created a variation on the classical Process Capability metric Cpk . Result a financial incentive for improving Quality.

  7. Ppkm - Running to Target and Reducing Variation Ppkm is an example of a process performance metric. Ppkm captures both deviation from target and variation within specification limits. If Ppkm >= 1, then the process is Capable: running to target and variation is well within specification limits To improve Quality we award operators for the number for Key Product Properties that have a Ppkm value above 1.0. 30% 30% Deviation Deviation from Target from Target 25% 25% 20% 20% Upper Spec Limit Upper Spec Limit Lower Spec Limit Lower Spec Limit 15% 15% Average Average Target Target 10% 10% Out of Spec Out of Spec Paper Paper 5% 5% Variation Variation 0% 0% 30% 30% 30% 30% 30% 25% 25% 25% 25% 25% 20% 20% 20% 20% 20% Upper Spec Limit Upper Spec Limit Lower Spec Limit Lower Spec Limit 15% 15% 15% 15% 15% Average Average Target Target 10% 10% 10% 10% 10% 5% 5% 5% 5% 5% Variation Variation 0% 0% 0% 0% 0%

  8. 8 2.6 2.55 2.5 Caliper 2.45 TGT 2.4 LRL URL 2.35 2.3 Ppkm = 1.02 Here is an example of a process that has a Ppkm>1, meaning it is fully capable of meeting customer expectations. Process is running to target i.e. the mean is equal to the target Variation is well within the Specification (Red) limits.

  9. 9 2.6 2.55 2.5 Caliper 2.45 TGT 2.4 LRL URL 2.35 Running to Target Excessive variation 2.3 Ppkm = .65 2.6 2.55 2.5 Caliper 2.45 TGT 2.4 LRL URL 2.35 Acceptable variation but off Target 2.3

  10. Outliers We spend most of our time working on the outliers: Process problems Lowering cost Increasing production Outliers exist in people too Top 20 Middle 70 Bottom 10 The future leaders of tomorrow will be from the top 20.

  11. Thoughts for a New Engineer Understand what is expected Focus on independent learning Always speak from facts and data Effective communication creates results Focus on business results and accomplishments You don t have to be at the top to be a leader

  12. Questions: What has been your biggest challenge? If you could go back to school what additional classes would you take? What's a typical day like? What kind of products does Boise make? How long do projects last? What ethical issues have you encountered?

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#