Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD.