Optimizing (zext A + zext B) * C, to (VMULL A, C) + (VMULL B, C) during