Skip to content
Snippets Groups Projects

algo: Optimize host code in GPU Hitfinder.

Merged Felix Weiglhofer requested to merge fweig/cbmroot:gpu-optimization into master

Introduces three optimizations to the host code of the STS Hitfinder:

  • Allocate all buffers beforehand on GPU, do dynamic allocation per TS on CPU (allocating the pinned memory for GPU is very slow)
  • Speedup sorting Digis by module by using a lookup table to map the sensor address to an index and parallelizing the copy
  • Copy hits to pinned memory to speed up copy from the GPU
Edited by Felix Weiglhofer

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
Please register or sign in to reply
Loading