Wang Xiao (王啸)Ph.D. Student
Department of Computer Science and Engineering |
![]() |
This project unifies three core capabilities—(1) the planning, reasoning, and automatic code-generation power of large language models; (2) the environmental understanding afforded by vision–language models; and (3) real-time intent recognition from speech and pose—and deploys them on the Misty social robot. Guided by the MistyCoPilot scheduler, which automatically invokes Automisty-generated code, the system can flexibly compose functional modules on demand, enabling highly abstract and complex human–robot interaction tasks in speech-language pathology (SLP) scenarios.
Last Updated on March, 2024 |