Abstract: This work proposes TimeChat, a time-sensitive multi-modal large language model specifically designed for long video understanding. Our model incorporates two key architectural contributions: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results