Visiobot: Conversational Image Recognition Chatbot
Main Article Content
Abstract
As artificial intelligence advances, new possibilities for conversational agents have been made possible by the combination of computer vision and natural language processing. A thorough analysis of the VisioBot, a conversational chatbot made for image recognition tasks, is provided in this survey. In order to comprehend how VisioBot processes and interprets images in real-time conversations, we look at the different approaches, frameworks, and architectures used in the system. The survey examines the main obstacles to integrating language and vision models, such as multimodal data processing, captioning images, and visual input user interaction. Furthermore, we evaluate the present state of image recognition chatbots and pinpoint research gaps in areas like scalability, accuracy, and contextual awareness. This survey intends to give researchers and developers working in the domains of artificial intelligence and human-computer interaction a thorough grasp of VisioBot's potential uses and future development by examining recent developments in this area.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.