Appendix B. Benchmark Usage

Table of Contents

B.1. CELL Benchmark Usage
B.2. conv
B.3. conv2d
B.4. copy
B.5. corr
B.6. CUDA Benchmark Usage
B.7. CVSIP Benchmark Usage
B.8. dist_vmul
B.9. dot
B.10. fastconv
B.11. fft
B.12. fftm
B.13. FFTW3 Benchmark Usage
B.14. fir
B.15. HPEC_KERNEL Benchmark Usage
B.16. IPP Benchmark Usage
B.17. LAPACK Benchmark Usage
B.18. maxval
B.19. mcopy
B.20. meansqval
B.21. meanval
B.22. memwrite
B.23. memwrite_simd
B.24. MPI Benchmark Usage
B.25. prod
B.26. prod_var
B.27. pwarp
B.28. SAL Benchmark Usage
B.29. sfilt
B.30. sort
B.31. stream
B.32. sumsqval
B.33. sumval
B.34. sumval_simd
B.35. svmul
B.36. task_switch
B.37. vdiv
B.38. vma
B.39. vmagsq
B.40. vmmul
B.41. vmul
B.42. vmul_c
B.43. vmul_par

This appendix presents the usage display from each benchmark program. A benchmark prints the display when invoked with no parameters or with the single parameter -0.

B.1. CELL Benchmark Usage

This appendix presents the displays from the CELL benchmarks.

B.1.1. fastconv

fastconv -- fast convolution benchmark for Cell BE
 Sweeping pulse size:
    -1 -- IP, native complex, distributed, single-expr
    -2 -- IP, native complex, distributed, Fastconv object

   -11 -- OP, inter complex,  non-dist
   -12 -- IP, inter complex,  non-dist, single FC
   -13 -- IP, inter complex,  non-dist, multi FC

   -21 -- OP, split complex,  non-dist
   -22 -- IP, split complex,  non-dist
   -23 -- IP, split complex,  non-dist, multi FC

   -32 -- Multiple coeff vectors in time domain, IP, native complex, non-dist, single FC
   -42 -- Multiple coeff vectors in freq domain, IP, native complex, non-dist, single FC