Open source vision language model JoyAI-VL-Interaction from JD.com watches live video streams and speaks without being ...