GitHub - senli1073/LNRL: Label Noise-Robust Learning for Microseismic Arrival Time Picking

Architecture
Introduction
Usage
License

Architecture

Introduction

Label Noise-Robust Learning (LNRL) approach was designed for handling label noise in microseismic tasks with small-scale datasets. LNRL aligns feature representation and label representation distribution in multiple feature spaces, learns the correlation between instances and label noise, and mitigates the impact of label noise.

The code of this project is developed based on SeisT.

Usage

Data Preparation

For training and evaluation

Create a new file named mydata.py in the directory dataset/ to read the metadata and seismograms of the dataset. And the @register_dataset decorator needs to be used to register the custom dataset.

(Please refer to the code example datasets/sos.py)

Training

Model
Before starting training, please make sure that the model code is in the directory models/ and register it using the @register_model decorator. All available models in the project can be inspected by using the following method:
```
>>> from models import get_model_list
>>> get_model_list()
['lnrl','seist']
```
Model Configuration
The configurations of the loss functions, labels, and the corresponding models are in config.py which also provides a detailed explanation of all the fields.

Start training
To start training with a CPU or a single GPU, please use the following command to start training:

python main.py \
  --seed 0 \
  --mode "train_test" \
  --model-name "lnrl" \
  --log-base "./logs" \
  --device "cuda:0" \
  --data "/root/data/Datasets/SOS" \
  --dataset-name "sos" \
  --sigma 600 \
  --data-split true \
  --train-size 0.8 \
  --val-size 0.1 \
  --shuffle true \
  --workers 8 \
  --in-samples 6000 \
  --augmentation true \
  --epochs 200 \
  --patience 30 \
  --batch-size 300

Use torchrun if training with multiple GPUs.

There are also a variety of other custom arguments which are not mentioned above. Use the command python main.py --help to see more details.

Testing

Use the following command to start testing:

python main.py \
  --seed 0 \
  --mode "test" \
  --model-name "lnrl" \
  --log-base "./logs" \
  --device "cuda:0" \
  --data "/root/data/Datasets/SOS" \
  --dataset-name "sos" \
  --data-split true \
  --train-size 0.8 \
  --val-size 0.1 \
  --workers 8 \
  --in-samples 6000 \
  --batch-size 300

It should be noted that the train_size, val_size, and seed in the test phase must be consistent with that training phase. Otherwise, the test results may be distorted.

Acknowledgement

This project refers to some excellent open source projects: PhaseNet, EQTransformer

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets		datasets
images		images
models		models
training		training
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Architecture

Introduction

Usage

Data Preparation

Training

Testing

Acknowledgement

License

About

Packages

Languages

License

senli1073/LNRL

Folders and files

Latest commit

History

Repository files navigation

Architecture

Introduction

Usage

Data Preparation

Training

Testing

Acknowledgement

License

About

Topics

Resources

License

Stars

Watchers

Forks

Packages 0

Languages

Packages