Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add avx2 optimization for encoder and decoder #153

Merged
merged 3 commits into from
Sep 9, 2024

Conversation

tszumski
Copy link
Contributor

@tszumski tszumski commented Sep 3, 2024

  1. Fix bitstream mismatch between ojph_encode_codeblock() and ojph_encode_codeblock_avx512()
  2. ojph_encode_codeblock_avx2() implementation
  3. ojph_decode_codeblock_avx2() implementation

Based on my tests:
Encoder: performance gain ~37% vs C-code, (~7.5% performance drop vs AVX512-code​)
Decoder: performance gain ~7%

@aous72
Copy link
Owner

aous72 commented Sep 4, 2024

Hi Tomasz,

Thank you for this highly-needed PR.
I will try to merge it this week; I just need to run a few tests/checks.

Kind regards,
Aous.

@aous72 aous72 merged commit 58ed06d into aous72:master Sep 9, 2024
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants