Performance numbers for 1920x1080p luma-only interpolation:
GPU:
- AMD Radeon HD 7750 - 200 FPS (8 cores)
- Intel HD Graphics 4000 - 60 FPS (16 cores)
- NVidia ION GPU - 27 FPS (2 cores)
- Intel CPU driver - 90 FPS (one core, 3570K@3400MHz)
- AMD CPU driver -20 FPS (one core, 3570K@3400MHz)
- Non-OpenCL implementation (reference SSE4.2) - 175 FPS, one core(synthetic test).
Probably I missed something. So, I will continue my attempts to optimize these filters, because it should be significantly faster. For now filter was implemented using vector instructions (16x). And only 3 vector multiplications performed per pixel.
So, theoretical peak is for various devices should be:
UPD: some improvements was done and now results for AMD GPU is about twice better:
Updated results for 4k motion compensation
So, theoretical peak is for various devices should be:
- For AMD it should be ~ 200 FPS.
- For NVidia ION ~50 FPS.
- For Intel CPU & AMD CPU driver - as we already noticed ~175 FPS, one core.
- And for Intel HD 4000 it should be ~125 FPS.
UPD: some improvements was done and now results for AMD GPU is about twice better:
Updated results for 4k motion compensation
Hi Martin,
ReplyDeleteI am interested to know about how you got 200 fps on AMD's GPU. more information on how u achieved will be helpful for me.
The casino with roulette machines | Vannienailor4166 Blog
ReplyDeleteCasino roulette game is one https://vannienailor4166blog.blogspot.com/ of the most deccasino popular casino games in https://jancasino.com/review/merit-casino/ Malaysia. It offers the latest games casinosites.one with the wooricasinos.info best odds, with big payouts and easy