Earthquake Prediction Using Acoustic Data

QuakeML
Stanley Dillon Hicks, Meher Akhil
Birlangi, Jun Hao (Simon) Hu
Background
Earthquakes can cause billions of dollars in damage and cause large losses of life
Earthquakes also devastate local infrastructure and in rare and unfortunate cases
can cause land to be uninhabitable (Fukushima)
Thus, it is desirable to be able to predict when Earthquakes occur, and prepare for
them.
Background
Predicting earthquakes from seismographic acoustic data can allow for just-in-time
systems to react and mitigate damage in the crucial seconds before an earthquake
strikes
The LANL Dataset provides acoustic data that can be used to predict earthquakes
Main Reference: Leduc, Hulbert, et al.
Los Alamos National Laboratory (LANL)
Simulated earthquakes in the lab and measured acoustic signal and time-to-
failure.
Used a Random Forests model to predict time-to-failure.
Features used:
Variance of acoustic signal
Kurtosis of acoustic signal (skewness of the distribution)
Mean of acoustic signal, rolling window (100, 1000, 10000).
How Machine Learning Can Be Applied
The real world physical model for earthquakes is incomplete
Modern machine learning methods show lots of promise on time series data
Models with fast inference times may provide actionable data faster
Dataset Details
Dataset: LANL Earthquake Prediction (Kaggle) 
A
c
o
u
s
t
i
c
 
D
a
t
a
A
c
o
u
s
t
i
c
 
D
a
t
a
 
o
f
 
l
a
b
o
r
a
t
o
r
y
e
a
r
t
h
q
u
a
k
e
s
 
r
e
c
o
r
d
e
d
 
f
r
o
m
 
a
p
i
e
z
o
c
e
r
a
m
i
c
 
s
e
n
s
o
r
4
1
9
4
 
S
e
g
m
e
n
t
s
 
c
o
n
t
a
i
n
i
n
g
 
1
5
0
,
0
0
0
s
a
m
p
l
e
s
 
r
e
c
o
r
d
e
d
 
a
t
 
4
M
H
Z
T
i
m
e
 
t
o
 
F
a
i
l
u
r
e
T
i
m
e
 
u
n
t
i
l
 
n
e
x
t
 
l
a
b
o
r
a
t
o
r
y
e
a
r
t
h
q
u
a
k
e
 
f
o
r
 
e
a
c
h
 
s
e
g
m
e
n
t
,
 
b
a
s
e
d
o
n
 
a
 
m
e
a
s
u
r
e
 
o
f
 
f
a
u
l
t
 
s
t
r
e
n
g
t
h
Denoising
From reviewing background on Earthquakes, signals should resemble a series
of oscillations, followed by a large spike indicative of a shake.
Also from Leduc, Hulbert, et al. the collected data was collected using a
piezoceramic sensor and not generated using a physical model (e.g. second-
order hyperbolic partial differential equation) so it will be subject to noise.
Also just qualitatively listening to the acoustic signal, it sounds noisy!
We are interested if denoising helps get better results!
Denoising
Classical
High-pass, low-pass, band-pass filters.
Statistical methods
Kalman Filtering (but you need to know the noise model)
MAP Estimation (you need some sort of model for the data)
Sparse Recovery (relatively hard to implement and could be non-convex)
Deep Learning Methods
Linear autoencoders
Treat acoustic signal as one long vector.
Convolutional autoencoders
Compute the spectrogram and feed that into a convolutional autoencoder.
Wavelet Methods
Recent and very popular and perform well against Kalman filtering when dealing with non-
Gaussian noise.
Denoising
Four approaches.
For each approach, the large acoustic data is broken into smaller
segments of length 784.
Classical high-pass filter
Use a 5th or 7th order butterworth filter to remove the low
frequency components.
Wavelet
Using a Daubechies wavelet.
Wavelet + high-pass filter
First the signal is filtered through a high-pass filter, then a wavelet
filter using Haar wavelets.
Linear Autoencoder
300 -> 200 -> 100 -> 200 -> 300 -> 784 Dense layers.
30% Dropout at each hidden layer.
Leaky ReLU activation for each hidden layer.
Feature Extraction
Features were extracted on a
segment by segment basis, general
segment statistics and statistics
based on a rolling window for each
segment ex. 
Across windows of 10,100, and 1000
timestep windows, similar to paper [ref]
Statistics for acoustic signals at the start
and end of each segment
Linear Detrending of signals 
Individual features were then
standardized after being generated
54 features in total generated
Model Details
Initially based off of prior work results we began by fitting a Random Forest
model to the data.
Estimator hyperparameter was selected using Randomized Search
Final Parameters
N_estimators = 1600
Model Details
As an improvement to Random Forest, we will use XGBoost, which improves on Random
forest through Gradient Boosting and allowing for very efficient training and inference 
Hyperparameters were selected through a Random Hyperparameter search with a stratified 5-
Fold Cross Validation 
Final Parameters:
Max Depth = 9
Min Child Weight = 6
Subsampling = 0.9 
Subsampling By Tree = 0.8
Results/Observations - Random Forest
Mean MAE: 3.8460411489774327
Results/Observations - XGBoost
With a 5-Fold Cross Validation: Mean Validation MAE of 2.0866
Results/Observations - XGBoost
Future Work
Implement a convolutional autoencoder for
denoising using the spectrogram as the input
to the network and compare it to linear
autoencoder.
See the difference between noisy data and
denoised data.
References
[1]
https://public.lanl.gov/geophysics/geophysics
/nonlinear/2017/Rouet-LeducGRL17.pdf
[2]
https://onlinelibrary.wiley.com/doi/full/10.1111/j.
1365-246X.2007.03650.x
[3]
https://web.njit.edu/~akansu/PAPERS/ANA-
IWS-WAS-
ELSEVIER%20PHYSCOM%202010.pdf
Slide Note
Embed
Share

Earthquakes can have devastating impacts, both in terms of financial losses and loss of life. Predicting earthquakes using seismographic acoustic data can help mitigate these impacts by enabling timely response systems. This research explores utilizing machine learning models to predict time-to-failure based on features extracted from acoustic signals. The dataset used includes acoustic data from laboratory earthquakes recorded with a piezoceramic sensor. Denoising techniques are also considered to improve the accuracy of earthquake predictions.

  • Earthquake Prediction
  • Acoustic Data
  • Machine Learning
  • Seismographic Data
  • Denoising

Uploaded on Feb 24, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. QuakeML Stanley Dillon Hicks, Meher Akhil Birlangi, Jun Hao (Simon) Hu

  2. Background Earthquakes can cause billions of dollars in damage and cause large losses of life Earthquakes also devastate local infrastructure and in rare and unfortunate cases can cause land to be uninhabitable (Fukushima) Thus, it is desirable to be able to predict when Earthquakes occur, and prepare for them.

  3. Background Predicting earthquakes from seismographic acoustic data can allow for just-in-time systems to react and mitigate damage in the crucial seconds before an earthquake strikes The LANL Dataset provides acoustic data that can be used to predict earthquakes

  4. Main Reference: Leduc, Hulbert, et al. Los Alamos National Laboratory (LANL) Simulated earthquakes in the lab and measured acoustic signal and time-to- failure. Used a Random Forests model to predict time-to-failure. Features used: Variance of acoustic signal Kurtosis of acoustic signal (skewness of the distribution) Mean of acoustic signal, rolling window (100, 1000, 10000).

  5. How Machine Learning Can Be Applied The real world physical model for earthquakes is incomplete Modern machine learning methods show lots of promise on time series data Models with fast inference times may provide actionable data faster

  6. Dataset Details Dataset: LANL Earthquake Prediction (Kaggle) Acoustic Data Time to Failure Acoustic Data of laboratory earthquakes recorded from a piezoceramic sensor 4194 Segments containing 150,000 samples recorded at 4MHZ Time until next laboratory earthquake for each segment, based on a measure of fault strength

  7. Denoising From reviewing background on Earthquakes, signals should resemble a series of oscillations, followed by a large spike indicative of a shake. Also from Leduc, Hulbert, et al. the collected data was collected using a piezoceramic sensor and not generated using a physical model (e.g. second- order hyperbolic partial differential equation) so it will be subject to noise. Also just qualitatively listening to the acoustic signal, it sounds noisy! We are interested if denoising helps get better results!

  8. Denoising Classical Statistical methods Kalman Filtering (but you need to know the noise model) MAP Estimation (you need some sort of model for the data) Sparse Recovery (relatively hard to implement and could be non-convex) Deep Learning Methods Linear autoencoders Treat acoustic signal as one long vector. Convolutional autoencoders Compute the spectrogram and feed that into a convolutional autoencoder. Wavelet Methods Recent and very popular and perform well against Kalman filtering when dealing with non- Gaussian noise. High-pass, low-pass, band-pass filters.

  9. Denoising Four approaches. For each approach, the large acoustic data is broken into smaller segments of length 784. Classical high-pass filter Use a 5th or 7th order butterworth filter to remove the low frequency components. Wavelet Using a Daubechies wavelet. Wavelet + high-pass filter First the signal is filtered through a high-pass filter, then a wavelet filter using Haar wavelets. Linear Autoencoder 300 -> 200 -> 100 -> 200 -> 300 -> 784 Dense layers. 30% Dropout at each hidden layer. Leaky ReLU activation for each hidden layer.

  10. Feature Extraction Features were extracted on a segment by segment basis, general segment statistics and statistics based on a rolling window for each segment ex. Across windows of 10,100, and 1000 timestep windows, similar to paper [ref] Statistics for acoustic signals at the start and end of each segment Linear Detrending of signals Individual features were then standardized after being generated 54 features in total generated

  11. Model Details Initially based off of prior work results we began by fitting a Random Forest model to the data. Estimator hyperparameter was selected using Randomized Search Final Parameters N_estimators = 1600

  12. Model Details As an improvement to Random Forest, we will use XGBoost, which improves on Random forest through Gradient Boosting and allowing for very efficient training and inference Hyperparameters were selected through a Random Hyperparameter search with a stratified 5- Fold Cross Validation Final Parameters: Max Depth = 9 Min Child Weight = 6 Subsampling = 0.9 Subsampling By Tree = 0.8

  13. Results/Observations - Random Forest Mean MAE: 3.8460411489774327

  14. Results/Observations - XGBoost With a 5-Fold Cross Validation: Mean Validation MAE of 2.0866

  15. Results/Observations - XGBoost

  16. Implement a convolutional autoencoder for denoising using the spectrogram as the input to the network and compare it to linear autoencoder. See the difference between noisy data and denoised data. Future Work

  17. References [1] https://public.lanl.gov/geophysics/geophysics /nonlinear/2017/Rouet-LeducGRL17.pdf [2] https://onlinelibrary.wiley.com/doi/full/10.1111/j. 1365-246X.2007.03650.x [3] https://web.njit.edu/~akansu/PAPERS/ANA- IWS-WAS- ELSEVIER%20PHYSCOM%202010.pdf

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#