Examples of Object Query Language

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Abstract: Large Vision-Language Models (LVLMs) suffer from severe object hallucinations, leading them to frequently generate outputs that do not correspond to the image content, significantly reducing ...

IEEE

Optimizing Query-by-Example Spoken Term Detection with Audio-to-Token Sequence Clustering and Query-Guided Retrieval

Abstract: Query-by-Example Spoken Term Detection (QbE-STD) retrieves relevant audio files corresponding to a spoken query, without relying on explicit word-level textual transcriptions. In ...

New memory system helps robots interact and work side-by-side with humans

A robot on a factory floor can carry parts, scan shelves, and move around people with growing skill. What it still struggles ...

How to use Google query expansion to improve content visibility

Your content may already be surfacing for searches you never planned for. Here's how to identify those opportunities and act ...

ascopubs.org

Evaluating reliability of large language models for patient queries: Concordance with NCCN invasive breast cancer guideline using ChatGPT, DeepSeek, and Gemini.

Impact of real-time artificial intelligence ultrasound system based on breast density in C4 breast lesions.

14d

Show inaccessible results

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Optimizing Query-by-Example Spoken Term Detection with Audio-to-Token Sequence Clustering and Query-Guided Retrieval

New memory system helps robots interact and work side-by-side with humans

How to use Google query expansion to improve content visibility

Evaluating reliability of large language models for patient queries: Concordance with NCCN invasive breast cancer guideline using ChatGPT, DeepSeek, and Gemini.

MIT develops spatial long-term memory framework for AI robots

No more sidecar files: AWS introduces S3 Annotations

WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation