AnalytiCup-2022

This is the source code for CIKM 2022 AnalytiCup Competition [link]. Another repository [link]

Team name: GTML-GEAR, Final Score: 55.2703

To run the code:

cd data
python process.py
cd ..
python setup.py install
python federatedscope/main.py --cfg federatedscope/gfl/baseline/myconf_on_cikmcup.yaml --client_cfg federatedscope/gfl/baseline/myconf_on_cikmcup_per_client.yaml

Figure 1. Algorithm framework of GNN model.

Brief introduction of the developed algorithm:

The algorithm framework of adopted GNN model is shown in Figure 1.

The hidden layer dimension is set to 128, and the number of local updates is 10 epochs (except client9, 10, and 13, which are set to 1).

We add BatchNorm layer in the GNN_Net_Graph before classification

We embed the edge_attr of the graph data and adopt the GINEConv layer to encode the graph data

We apply sum pooling (global_add_pool) to the pooling layer to make the model better distinguish between graph structures

We use Jumping Knowledge attention in the GNN model to adaptively capture information from multi-hop neighbors

We set the dropout rate to 0 to better improve the performance of the regression task.

At the same time, we extend the node attributes of the graph data. Specifically, the edge attributes of the nodes are aggregated and added and then concatenated into the node attributes. Implementation in data/process.py

Finally, we incorporate validation set data during training to improve our score.

Figure 2. Algorithm framework of Federated Learning.

If it is a federated learning method:

Our solution is an algorithm based on federated learning.

We use "FedBN+ft" as the main framework of heterogeneous task federated learning, which is shown in Figure 2.

First, the type of information transmitted between client and server is "model_para", which includes "sample_size" and "model_para_all".

Second, our federated learning algorithm is the baseline "FedBN".

Furthermore, we adopt a "finetune" approach before evaluating, which uses the training, validation, and test data (excluding labels) to finetune the "running_mean" and "running_var" of the BatchNorm layer.

Finally, the large variation in the size of the client local data results in a large variation in the number of local updates performed by each client in each round of communication. To face this problem, we differentially adjust the number of local updates on each client to improve the convergence speed and performance of the global model (similar to FedNova).

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
benchmark		benchmark
data		data
demo		demo
doc		doc
environment		environment
federatedscope		federatedscope
imgs		imgs
materials		materials
scripts		scripts
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.style.yapf		.style.yapf
LICENSE		LICENSE
README.md		README.md
meta.yaml		meta.yaml
run_reorganized_standalone.sh		run_reorganized_standalone.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnalytiCup-2022

About

Releases

Packages

Languages

License

iammcy/AnalytiCup-2022

Folders and files

Latest commit

History

Repository files navigation

AnalytiCup-2022

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages