Skip to content
Snippets Groups Projects

Clean up / speed up various SIMD functions

Merged Frank Bossen requested to merge bossen/VVCSoftware_VTM:simd1 into master
  • In InterPrediction::rightShiftMSB, use existing floorLog2 function

  • In addAvg_SSE, avoid unnecessary use of 32-bit path

  • In copyBufferSimd, reduce number of loops

  • In paddingSimd, take advantage of padding extent being either 1 or 2

  • In addBIOAvg4_SSE, reduce number of operations and avoid nasty Xmm register -> memory -> integer register path

Overall, a decoder runtime reduction of about 3% is expected

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
  • Loading
  • Loading
Please register or sign in to reply
Loading