PaddlePaddle · EricKing19 · Aug 19, 2024 · Oct 15, 2024 · Oct 15, 2024 · Oct 15, 2024
diff --git a/docs/zh/examples/preformer.md b/docs/zh/examples/preformer.md
@@ -0,0 +1,239 @@
+# Preformer
+
+开始训练、评估前，请下载数据集文件
+
+开始评估前，请下载或训练生成预训练模型
+
+=== "模型训练命令"
+
+    ``` sh
+    # 模型训练
+    python train.py
+    ```
+
+=== "模型评估命令"
+
+    ``` sh
+    # 模型评估
+    python train.py mode=eval
+    ```
+
+## 1. 背景简介
+
+降水是一种与人类生产生活密切相关的天气现象。准确预测短临降水不仅为农业管理、交通规划以及灾害预防等公共服务提供关键技术支持，也是一项具有挑战性的学术研究任务。近年来，深度学习在气象预测领域取得了重大突破。以多模态三维（高度、经度及纬度）气象数据为研究对象，研究基于深度学习的短临降水预测方法，具有重要的理论研究价值和广阔的应用前景。
+
+Preformer，一种用于短临降水预测的时空Transformer网络，该模型由编码器、演变器和解码器组成。具体而言，编码器通过探索embedding之间的依赖来编码空间特征。通过演变器，从重新排列的embedding中学习全局时间动态特性。最后在解码器中，将时空表征解码为未来降水量。
+
+
+## 2. 模型原理
+
+本章节对 Preformer 的模型原理进行简单地介绍。
+
+### 2.1 编码器
+
+该模块使用两层Transformer，提取空间特征更新节点特征：
+
+``` py linenums="8" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:194:217
+--8<--
+```
+
+### 2.2 演变器
+
+该模块使用两层Transformer，学习全局时间动态特性：
+
+``` py linenums="29" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:220:254
+--8<--
+```
+
+### 2.3 解码器
+
+该模块使用两层卷积，将时空表征解码为未来降水量：
+
+``` py linenums="29" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:257:273
+--8<--
+```
+
+### 2.4 Preformer模型结构
+
+Preformer模型首先使用特征嵌入层对输入信号（过去几小时的气象要素）进行空间特征编码：
+
+``` py linenums="73" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:293:293
+--8<--
+```
+
+``` py linenums="94" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:194:217
+--8<--
+```
+
+然后模型利用演变器将学习空间特征的动态特性，预测未来几小时的气象特征：
+
+``` py linenums="75" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:310:313
+--8<--
+```
+
+``` py linenums="96" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:220:254
+--8<--
+```
+
+最后模型将时空动态特性与初始气象底层特征结合，使用两层卷积预测未来短时降水强度：
+
+``` py linenums="112" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:83:86
+--8<--
+```
+
+``` py linenums="35" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py:112:116
+--8<--
+```
+
+## 3. 模型训练
+
+### 3.1 数据集介绍
+
+案例中使用了预处理的ERA5SQ数据集，属于ERA5再分析数据的一个子集。ERA5SQ包含了全球大气、陆地和海洋的多种变量，分辨率为31公里。该数据集从1979年开始到2018年，每小时提供一次天气状况的估计，非常适合用于降水预测和水汽总量的分析等任务。
+
+数据集被保存为 T x H x W 的矩阵，记录了相应地点和时间的降雨量，其中 T 为时间序列长度，H 和 W 代表按照经纬度划分后的矩阵的高度和宽度。根据年份，数据集按照 7:2:1 划分为训练集、验证集，和测试集。案例中预先计算了降雨数据的均值与标准差，用于后续的正则化操作。
+
+### 3.2 模型训练
+
+#### 3.2.1 模型构建
+
+该案例基于 Preformer 模型实现，用 PaddleScience 代码表示如下：
+
+``` py linenums="79" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:133:133
+--8<--
+```
+
+#### 3.2.2 约束器构建
+
+本案例基于数据驱动的方法求解问题，因此需要使用 PaddleScience 内置的 `SupervisedConstraint` 构建监督约束器。在定义约束器之前，需要首先指定约束器中用于数据加载的各个参数。
+
+训练集数据加载的代码如下:
+
+``` py linenums="20" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:44:79
+--8<--
+```
+
+定义监督约束的代码如下：
+
+``` py linenums="40" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:81:86
+--8<--
+```
+
+#### 3.2.3 评估器构建
+
+本案例训练过程中会按照一定的训练轮数间隔，使用验证集评估当前模型的训练情况，需要使用 `SupervisedValidator` 构建评估器。
+
+验证集数据加载的代码如下:
+
+``` py linenums="44" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:177:191
+--8<--
+```
+
+定义监督评估器的代码如下：
+
+``` py linenums="65" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:195:203
+--8<--
+```
+
+#### 3.2.4 学习率与优化器构建
+
+本案例中学习率大小设置为 `1e-3`，优化器使用 `Adam`，用 PaddleScience 代码表示如下：
+
+``` py linenums="83" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:136:140
+--8<--
+```
+
+#### 3.2.5 模型训练
+
+完成上述设置之后，只需要将上述实例化的对象按顺序传递给 `ppsci.solver.Solver`，然后启动训练。
+
+``` py linenums="88" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:143:156
+--8<--
+```
+
+#### 3.2.6 模型导出
+
+通过设置 `ppsci.solver.Solver` 中的 `eval_during_train` 和 `eval_freq` 参数，可以自动保存在验证集上效果最优的模型参数。
+
+``` py linenums="100" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:158:158
+--8<--
+```
+
 def export(cfg: DictConfig): 
     # set model 
     model = ppsci.arch.PirateNet(**cfg.MODEL) 
     # initialize solver 
     solver = ppsci.solver.Solver(model, cfg=cfg) 
     # export model 
     from paddle.static import InputSpec 
     input_spec = [ 
         {key: InputSpec([None, 1], "float32", name=key) for key in model.input_keys}, 
     ] 
     solver.export(input_spec, cfg.INFER.export_path, with_onnx=False) 
 def inference(cfg: DictConfig): 
     from deploy.python_infer import pinn_predictor 
     predictor = pinn_predictor.PINNPredictor(cfg) 
     data = sio.loadmat(cfg.DATA_PATH) 
     u_ref = data["usol"].astype(dtype)  # (nt, nx) 
     t_star = data["t"].flatten().astype(dtype)  # [nt, ] 
     x_star = data["x"].flatten().astype(dtype)  # [nx, ] 
     tx_star = misc.cartesian_product(t_star, x_star).astype(dtype) 
     input_dict = {"t": tx_star[:, 0:1], "x": tx_star[:, 1:2]} 
     output_dict = predictor.predict(input_dict, cfg.INFER.batch_size) 
     # mapping data to cfg.INFER.output_keys 
     output_dict = { 
         store_key: output_dict[infer_key] 
         for store_key, infer_key in zip(cfg.MODEL.output_keys, output_dict.keys()) 
     } 
     u_pred = output_dict["u"].reshape([len(t_star), len(x_star)]) 
     plot(t_star, x_star, u_ref, u_pred, cfg.output_dir) 
 def export(cfg: DictConfig): 
     # set model 
     model = ppsci.arch.PirateNet(**cfg.MODEL) 
  
     # initialize solver 
     solver = ppsci.solver.Solver(model, cfg=cfg) 
     # export model 
     from paddle.static import InputSpec 
  
     input_spec = [ 
         {key: InputSpec([None, 1], "float32", name=key) for key in model.input_keys}, 
     ] 
     solver.export(input_spec, cfg.INFER.export_path, with_onnx=False) 
  
  
 def inference(cfg: DictConfig): 
     from deploy.python_infer import pinn_predictor 
  
     predictor = pinn_predictor.PINNPredictor(cfg) 
     data = sio.loadmat(cfg.DATA_PATH) 
     u_ref = data["usol"].astype(dtype)  # (nt, nx) 
     t_star = data["t"].flatten().astype(dtype)  # [nt, ] 
     x_star = data["x"].flatten().astype(dtype)  # [nx, ] 
     tx_star = misc.cartesian_product(t_star, x_star).astype(dtype) 
  
     input_dict = {"t": tx_star[:, 0:1], "x": tx_star[:, 1:2]} 
     output_dict = predictor.predict(input_dict, cfg.INFER.batch_size) 
     # mapping data to cfg.INFER.output_keys 
     output_dict = { 
         store_key: output_dict[infer_key] 
         for store_key, infer_key in zip(cfg.MODEL.output_keys, output_dict.keys()) 
     } 
     u_pred = output_dict["u"].reshape([len(t_star), len(x_star)]) 
  
     plot(t_star, x_star, u_ref, u_pred, cfg.output_dir) 
+#### 3.2.7 测试集上评估模型
+
+训练完成后，启动评估流程在测试集上评估模型。
+
+``` py linenums="112" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py:160:160
+--8<--
+```
+
+
+## 4. 完整代码
+
+数据集接口：
+
+``` py linenums="1" title="ppsci\data\dataset\era5sq_dataset.py"
+--8<--
+ppsci\data\dataset\era5sq_dataset.py
+--8<--
+```
+
+模型结构：
+
+``` py linenums="1" title="ppsci/arch/preformer.py"
+--8<--
+ppsci/arch/preformer.py
+--8<--
+```
+
+模型训练：
+
+``` py linenums="1" title="examples/preformer/train.py"
+--8<--
+examples/preformer/train.py
+--8<--
+```
+
+配置文件：
+
+``` py linenums="1" title="examples/preformer/conf/train.yaml"
+--8<--
+examples/preformer/conf/train.yaml
+--8<--
+```
-数据集接口：
-
-``` py linenums="1" title="ppsci\data\dataset\era5sq_dataset.py"
--8<--
-ppsci\data\dataset\era5sq_dataset.py
--8<--
-```
-
-模型结构：
-
-``` py linenums="1" title="ppsci/arch/preformer.py"
--8<--
-ppsci/arch/preformer.py
--8<--
-```
-
-模型训练：
-
-``` py linenums="1" title="examples/preformer/train.py"
--8<--
-examples/preformer/train.py
--8<--
-```
-
-配置文件：
-
-``` py linenums="1" title="examples/preformer/conf/train.yaml"
--8<--
-examples/preformer/conf/train.yaml
--8<--
-```
+``` py linenums="1" title="examples/preformer/main.py"
+--8<--
+examples/preformer/main.py
+--8<--
-数据集接口：
-
-``` py linenums="1" title="ppsci\data\dataset\era5sq_dataset.py"
--8<--
-ppsci\data\dataset\era5sq_dataset.py
--8<--
-```
-
-模型结构：
-
-``` py linenums="1" title="ppsci/arch/preformer.py"
--8<--
-ppsci/arch/preformer.py
--8<--
-```
-
-模型训练：
-
-``` py linenums="1" title="examples/preformer/train.py"
--8<--
-examples/preformer/train.py
--8<--
-```
-
-配置文件：
-
-``` py linenums="1" title="examples/preformer/conf/train.yaml"
--8<--
-examples/preformer/conf/train.yaml
--8<--
-```
+``` py linenums="1" title="examples/preformer/main.py"
+--8<--
+examples/preformer/main.py
+--8<--
diff --git a/examples/preformer/conf/train.yaml b/examples/preformer/conf/train.yaml
@@ -0,0 +1,72 @@
+defaults:
+  - ppsci_default
+  - TRAIN: train_default
+  - TRAIN/ema: ema_default
+  - TRAIN/swa: swa_default
+  - EVAL: eval_default
+  - INFER: infer_default
+  - hydra/job/config/override_dirname/exclude_keys: exclude_keys_default
+  - _self_
+
+hydra:
+  run:
+    # dynamic output directory according to running time and override name
+    dir: outputs_preformer
+  job:
+    name: ${mode} # name of logfile
+    chdir: false # keep current working directory unchanged
+  sweep:
+    # output directory for multirun
+    dir: ${hydra.run.dir}
+    subdir: ./
+
+# general settings
+mode: train # running mode: train/eval
+seed: 1024
+output_dir: ${hydra:run.dir}
+log_freq: 20
+
+# set training hyper-parameters
+SQ_LEN: 6
+IMG_H: 192
+IMG_W: 256
+USE_SAMPLED_DATA: false
+
+# set train data path
+TRAIN_FILE_PATH: /patch/to/ERA5/
+DATA_MEAN_PATH: examples/weather/datasets/era5/stat/mean.nc
+DATA_STD_PATH: examples/weather/datasets/era5/stat/std.nc
+
+# set evaluate data path
+VALID_FILE_PATH: /patch/to/ERA5/
+
+# model settings
+MODEL:
+  input_keys: ["input"]
+  output_keys: ["output"]
+  shape_in:
+    - 6
+    - 12
+    - ${IMG_H}
+    - ${IMG_W}
+
+# training settings
+TRAIN:
+  epochs: 150
+  save_freq: 20
+  eval_during_train: true
+  eval_freq: 20
+  lr_scheduler:
+    epochs: ${TRAIN.epochs}
+    learning_rate: 0.001
+    by_epoch: true
+  batch_size: 16
+  pretrained_model_path: null
+  checkpoint_path: null
+
+# evaluation settings
+EVAL:
+  pretrained_model_path: null
+  compute_metric_by_batch: true
+  eval_with_no_grad: true
+  batch_size: 16
diff --git a/examples/preformer/datasets/era5/stat/mean.nc b/examples/preformer/datasets/era5/stat/mean.nc
diff --git a/examples/preformer/datasets/era5/stat/std.nc b/examples/preformer/datasets/era5/stat/std.nc