You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I am trying to understand the image serialization parameters utilized to generate the dataset files for which the accuracy is reported on the rn18 and rn50 here (https://github.com/libffcv/ffcv-imagenet). The given example uses the following notation:
And the reported dataset setup parameters are listed as
ResNet-50 training: 50% JPEG 500px side length
ResNet-18 training: 10% JPEG 400px side length
This means, for example, that for resnet-50, 50% of the dataset is JPEG encoded, remaining raw (i.e., write_mode=proportion). If this is the case, what is the JPEG quality used for encoding (i.e., last input parameter given to write_imagenet.sh file)? Is it 90 as shown in the example?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi, I am trying to understand the image serialization parameters utilized to generate the dataset files for which the accuracy is reported on the rn18 and rn50 here (https://github.com/libffcv/ffcv-imagenet). The given example uses the following notation:
And the reported dataset setup parameters are listed as
This means, for example, that for resnet-50, 50% of the dataset is JPEG encoded, remaining raw (i.e.,
write_mode=proportion
). If this is the case, what is the JPEG quality used for encoding (i.e., last input parameter given towrite_imagenet.sh
file)? Is it90
as shown in the example?Beta Was this translation helpful? Give feedback.
All reactions