-
Notifications
You must be signed in to change notification settings - Fork 318
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Math: Optimize 16-bit matrix multiplication function
Implemented optimizations in the 16-bit matrix multiplication function by changing accumulator data type from int64_t to int32_t. This reduces the instruction cycle count i.e. by ~8.18% for matrix multiplication. Enhance pointer arithmetic within loops for better readability and compiler optimization. Eliminate unnecessary conditionals by directly handling Q0 data in the algorithm core logic. Performance gains from these optimisation include a 36.31% reduction in memory usage for matrix multiplication function Signed-off-by: Shriram Shastry <malladi.sastry@intel.com>
- Loading branch information
1 parent
40c3503
commit 8502790
Showing
1 changed file
with
22 additions
and
37 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters