Porting/ncnn

From RCS Wiki
Revision as of 05:11, 18 December 2023 by JeremyRand (talk | contribs) (VSX Targets)
Jump to navigation Jump to search

Finished

In progress

  • CI missing for POWER9/Clang
  • Replace SSE with native VSX

VSX Targets

When running Real-ESRGAN in ncnn on POWER9, most CPU time (over 81%) is spent inside gemm_transB_packed_tile in convolution_3x3_winograd.h, which uses SSE2. This may be a good target for rewriting in native VSX.

See Also