Re: Another graphics card question
Posted: Mon Sep 26, 2022 12:29 am
This is very interesting. I did not really expect LHR cards to be impacted this much and I am a admittedly a little skeptical that LHR would hobble OpenCL performance in this way...
Multi-core performance between a 5600X and 5900X should be definitely be noticeable in general usage.
The GPU version of StarTools still uses the CPU in many parts of algorithms where GPU acceleration is not possible or desirable (e.g. cases where GPU performance would be inferior to using the CPU). There are many such cases, usually involving complex branching and execution paths, or where it is not possible to efficiently parallelize calculations.
Modules in StarTools almost always consist of multiple steps, filters, convolutions, etc. It wholly depends on the constituent steps and algorithms how such a step is executed. For example, a convolution with a custom kernel (lots of brute-force multiplies, easy to parallelize) is much faster on the GPU, whereas a median filter (lots of branching, sorting, harder to parallelize efficiently) is much faster on the CPU.
Decon is indeed the most CPU and GPU demanding module, and you should see both CPU and GPU getting a workout, taking turns during every iteration.
Hope this helps,
Multi-core performance between a 5600X and 5900X should be definitely be noticeable in general usage.
The GPU version of StarTools still uses the CPU in many parts of algorithms where GPU acceleration is not possible or desirable (e.g. cases where GPU performance would be inferior to using the CPU). There are many such cases, usually involving complex branching and execution paths, or where it is not possible to efficiently parallelize calculations.
Modules in StarTools almost always consist of multiple steps, filters, convolutions, etc. It wholly depends on the constituent steps and algorithms how such a step is executed. For example, a convolution with a custom kernel (lots of brute-force multiplies, easy to parallelize) is much faster on the GPU, whereas a median filter (lots of branching, sorting, harder to parallelize efficiently) is much faster on the CPU.
Decon is indeed the most CPU and GPU demanding module, and you should see both CPU and GPU getting a workout, taking turns during every iteration.
Hope this helps,