Function Baseline (ms) Optimized (ms) Speedup matrix_multiply 342.12 189.44 1.81x 5.1 Targeted tuning via annotation Add to your C++ code:
Apply with:
[[xbestpp::hot(iterations=1000000)]] void compute() ... Then run: xbestpp
[output] format = "html" threshold_speedup = 1.10 # only show improvements >10% xbestpp