A video from B station is parsed out into two M4S formatted media streams, one for video and another for audio. To play the video with normal controls in the frontend, how do we handle these individual M4S streams? The video can only load one M4S at a time, not allowing for merged playback. Is this a job for pre-processing or is there a mature playback component that can achieve this?