B.23. memwrite_simd

memwrite_simd -- SIMD memory write bandwidth
  -1 -- write a float scalar into all elements of a view
        using an explicit SIMD loop
  -2 -- same using a loop unrolled 4 times