Differences between FT-Transformer implementations #40

jpgard · 2022-11-16T18:19:50Z

jpgard
Nov 16, 2022

Thanks for the great repos and for making this code open-source!

I noticed the disclaimer in the README:

Warning: if you are a researcher (not a practitioner) and plan to use the FT-Transformer model as a baseline in your paper, please, use the implementation that was used in the original paper (not from the rtdl package): [ft_transformer.py](https://github.com/Yura52/tabular-dl-revisiting-models/blob/main/bin/ft_transformer.py).

It is clear that there are differences in implementation between the Transformer class in this repo and the FTTransformer class in rtdl. I am wondering: what is the practical consequence of these differences?

If we are not interested in reproducing the exact results of the NeurIPS 2021 paper (but instead using FT-Transformer as a baseline for other tasks, where its architecture and hyperparameters will be tuned anyway), is there any reason not to use the (more mature) implementation in rtdl?

Yura52 · 2022-11-16T20:29:05Z

Yura52
Nov 16, 2022
Maintainer

Glad that the code is useful!

Today, I would go for the implementation in rtdl for such use cases as you described (I think the original disclaimer was written when rtdl was less mature). There should be no practical differences between the two implementations in terms of task performance.

P.S. FYI, this NeurIPS 2022 paper may also be relevant.

1 reply

Yura52 Nov 16, 2022
Maintainer

Also, I recommend pinning the version of rtdl in your requirements.txt, since API changes are possible in rtdl.

jpgard · 2022-11-16T21:51:00Z

jpgard
Nov 16, 2022
Author

Awesome! Thanks for the quick reply - and I have seen the paper, great work!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Differences between FT-Transformer implementations #40

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Differences between FT-Transformer implementations #40

jpgard Nov 16, 2022

Replies: 2 comments · 1 reply

Yura52 Nov 16, 2022 Maintainer

Yura52 Nov 16, 2022 Maintainer

jpgard Nov 16, 2022 Author

jpgard
Nov 16, 2022

Replies: 2 comments 1 reply

Yura52
Nov 16, 2022
Maintainer

Yura52 Nov 16, 2022
Maintainer

jpgard
Nov 16, 2022
Author