high-performance {tabnet} profvis on CPU #79

cregouby · 2022-01-31T17:52:22Z

This issue aims at improving tabnet performance through common tools and understanding of where to put effort on

proposed performance script

The goal here is to have the largest batch available to run on a CPU, in order to favor time spent in compute over time spent in data movement.

library(tabnet)

# use local caching
d_train <- data.table::fread(pins::pin("https://s3.amazonaws.com/benchm-ml--main/train-0.1m.csv"), stringsAsFactors=TRUE)
d_test <- data.table::fread(pins::pin("https://s3.amazonaws.com/benchm-ml--main/test.csv"))

## align cat. values (factors)
d_train_test <- rbind(d_train, d_test)
n1 <- nrow(d_train)
n2 <- nrow(d_test)
d_train <- d_train_test[1:n1,]
d_test <- d_train_test[(n1+1):(n1+n2),]


system.time({
  md <- tabnet_fit(dep_delayed_15min ~ . ,d_train, device="cpu",
                   epochs = 5, batch_size = 1024^2,
                   virtual_batch_size=262144, verbose = TRUE)
})

result table proposed

CPU Linux

Actual CPU profile	Expected CPU profile	Actual profvis flame graph
		!!

Profviz Data

CPU Windows

Actual CPU profile	Expected CPU profile	Actual profvis flame graph

Profviz Data

CPU MacOS

Actual CPU profile	Expected CPU profile	Actual profvis flame graph

Profviz Data

This was referenced Feb 1, 2022

Dataloader single worker and default batch_size makes R tabnet 4-15x slower than pytorch tabnet #37

Open

GPU not in use with tidymodels and tabnet #80

Closed

Feature/better defaults improves performance through better torch dataloader usage #83

Merged

cregouby added the wontfix This will not be worked on label Apr 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

high-performance {tabnet} profvis on CPU #79

high-performance {tabnet} profvis on CPU #79

cregouby commented Jan 31, 2022 •

edited

Loading

high-performance {tabnet} profvis on CPU #79

high-performance {tabnet} profvis on CPU #79

Comments

cregouby commented Jan 31, 2022 • edited Loading

proposed performance script

result table proposed

CPU Linux

Profviz Data

CPU Windows

Profviz Data

CPU MacOS

Profviz Data

cregouby commented Jan 31, 2022 •

edited

Loading