Abstract: Pre-trained vision-language models (VLMs) and language models (LMs) have recently garnered significant attention due to their remarkable ability to represent textual concepts, opening up new ...
Abstract: Global warming has significantly increased the frequency of forest fires. Unmanned aerial vehicles (UAVs) provide rapid response and real-time monitoring, offering unique advantages over ...
* Equal contribution. †Co-corresponding author. Each image is paired with one or more text instances with polygon-level annotations. The dataset follows a consistent annotation format, detailed in ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results