Pegasus-v1 Technical Report

Jung, Raehyuk; Go, Hyojun; Yi, Jaehyuk; Jang, Jiho; Kim, Daniel; Suh, Jay; Lee, Aiden; Han, Cooper; Lee, Jae; Kim, Jeff; Kim, Jin-Young; Kim, Junwan; Park, Kyle; Lee, Lucas; Ha, Mars; Seo, Minjoon; Jo, Abraham; Park, Ed; Kianinejad, Hassan; Kim, SJ; Moon, Tony; Jeong, Wade; Popescu, Andrei; Kim, Esther; Yoon, EK; Heo, Genie; Choi, Henry; Kang, Jenna; Han, Kevin; Seo, Noah; Nguyen, Sunny; Won, Ryan; Park, Yeonhoo; Giuliani, Anthony; Chung, Dave; Yoon, Hans; Le, James; Ahn, Jenny; Lee, June; Saini, Maninder; Sanders, Meredith; Lee, Soyoung; Kim, Sue; Couture, Travis

Full-text links:

Download:

Current browse context:

cs.MM

< prev | next >

new | recent | 2404

Computer Science > Multimedia

Title: Pegasus-v1 Technical Report

(Submitted on 23 Apr 2024)

Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's architecture, training strategies, and its performance in benchmarks on video conversation, zero-shot video question answering, and video summarization. We also explore qualitative characteristics of Pegasus-1 , demonstrating its capabilities as well as its limitations, in order to provide readers a balanced view of its current state and its future direction.

Subjects:	Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.14687 [cs.MM]
	(or arXiv:2404.14687v1 [cs.MM] for this version)

Submission history

From: Hyojun Go [view email]
[v1] Tue, 23 Apr 2024 02:32:57 GMT (39507kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.14687

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Multimedia

Title: Pegasus-v1 Technical Report

Submission history