Automatic Multi-Document Text Summarization
dc.contributor.advisor | Mark, Hepple | |
dc.contributor.author | Samah, Alatawi | |
dc.date.accessioned | 2023-11-19T06:53:53Z | |
dc.date.available | 2023-11-19T06:53:53Z | |
dc.date.issued | 2023-11-03 | |
dc.description.abstract | This research explores automatic multi-document text summarization as a solution to automate the summarisstation of substantial amounts of online data. The goal is to produce succinct summaries that effectively communicate the primary idea of published works without using redundant language.This study aims to thoroughly explore the strengths and weaknesses of the Flan-T5 language model for text summarization, using the samsum dataset as a benchmark for dialogue summarization tasks. Through systematic experimentation varying hyperparameters, fine-tuning approaches, and training conditions, we will provide comprehensive insight into the model’s capabilities and limitations. By testing Flan-T5 under diverse scenarios, we will uncover how factors like batch size, learning rate, and dataset size affect its ability to extract key information from source texts. Our research will elucidate the complex interplay between hyperparameters and summary quality, underscoring their importance in optimizing summarization performance. Overall, this investigation will deliver a nuanced understanding of how the Flan-T5 model performs on dialogue summarization, highlighting avenues for improvement. | |
dc.format.extent | 48 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14154/69708 | |
dc.language.iso | en | |
dc.publisher | Saudi Digital Library | |
dc.subject | text summarization | |
dc.subject | Flan T5 | |
dc.title | Automatic Multi-Document Text Summarization | |
dc.type | Thesis | |
sdl.degree.department | Computer Science | |
sdl.degree.discipline | Advanced Computer Science | |
sdl.degree.grantor | University of Sheffield | |
sdl.degree.name | Master's Degree |