I think the following will work but perhaps you should confirm that the IP is doing the right thing:
For the splitter:
din0 is C Y, please make sure that this is the correct order in time (or correct endianess if in parallel)
dout0 should be C and you should halve the width of the control packets
dout1 should be Y
A slightly different way of doing the same thing is given in the VIP user guide with the 2 pixels per port option
For the joiner:
din0 should be Y
din1 should be C
dout0 should be C Y and the source of non-image packet should be din0