Search

Results: 2
Visual-and-Language Multimodal Fusion for Sweeping Robot Navigation Based on CNN and GRU
Effectively fusing information between the visual and language modalities remains a significant challenge. To achieve deep integration of natural language and visual information, this research introduces a multimodal fusion...