Automatic Generation of a Coherent Story from a Set of Images

Aljawy, Zainy

Automatic Generation of a Coherent Story from a Set of Images

dc.contributor.advisor	Mian, Ajaml
dc.contributor.advisor	Hassan, Ghulam Mubashar
dc.contributor.author	Aljawy, Zainy
dc.date.accessioned	2024-01-11T12:01:19Z
dc.date.available	2024-01-11T12:01:19Z
dc.date.issued	2023-12
dc.description.abstract	This dissertation explores vision and language (V&L) algorithms. While (V&L) succeeds in image and video captioning tasks, the dynamic Visual Storytelling Task (VST) remains challenging. VST demands coherent stories from a set of images, requiring grammatical accuracy, flow, and style. The dissertation addresses these challenges. Chapter 2 presents a framework utilizing an advanced language model. Chapters 3 and 4 introduce novel techniques that integrate rich visual representation to enhance generated stories. Chapter 5 introduces a new storytelling dataset with a comprehensive analysis. Chapter 6 proposes a state-of-the-art Transformer-based model for generating coherent and informative story descriptions from image sets.
dc.format.extent	138
dc.identifier.citation	Zainy M. Malakan. (2023). Automatic Generation of a Coherent Story from a Set of Images. [Doctoral Thesis, The University of Western Australia].
dc.identifier.uri	https://hdl.handle.net/20.500.14154/71156
dc.language.iso	en_US
dc.publisher	Saudi Digital Library
dc.subject	Storytelling
dc.subject	Sequential Vision Understanding
dc.subject	Computer Vision
dc.subject	image and video captioning
dc.subject	Deep Learning
dc.subject	Transformer
dc.subject	Advanced Language Model
dc.title	Automatic Generation of a Coherent Story from a Set of Images
dc.type	Thesis
sdl.degree.department	Computer Science and Software Engineering
sdl.degree.discipline	Computer Vision and Artificial Intelligence
sdl.degree.grantor	The University of Western Australia
sdl.degree.name	Doctor of Philosophy
sdl.thesis.source	SACM - Australia

Collections

SACM - Australia

Automatic Generation of a Coherent Story from a Set of Images

Files

Collections