LSTM Encoder Decoder Based Text Highlight Abstraction Method Using Summaries Extracted by PageRank

dc.contributor.authorAltundogan T.G.
dc.contributor.authorKarakose M.
dc.date.accessioned2024-07-22T08:03:35Z
dc.date.available2024-07-22T08:03:35Z
dc.date.issued2023
dc.description.abstractAutomatic highlighting from texts is an abstractive summarization problem that is frequently focused on in natural language processing. In encoder-decoder architectures, developed for abstractive summarization, as the size of the input array increases, the learning ability of the architecture becomes difficult. To solve this problem, the focus is on minimizing this disadvantage of encoder - decoder architectures by using the Attention mechanism. In this study, we used an LSTM encoder - decoder with an attention mechanism to perform the highlight abstraction process. In addition, we used an extractive summarization step as a preprocess to increase the learning ability of the encoder - decoder architecture and reduce the input text size. We preferred the PageRank method in the extractive summarization process here. In the PageRank method, sentence vectors were extracted by using Glove embeddings to calculate similarities of text sentences. The proposed approach achived the extractive summarization by 67.6% and abstractive summarization by 59.6% in ROUGE-1 score. © 2023 IEEE.
dc.identifier.DOI-ID10.1109/IT57431.2023.10078652
dc.identifier.urihttp://akademikarsiv.cbu.edu.tr:4000/handle/123456789/12357
dc.language.isoEnglish
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectAbstracting
dc.subjectArchitecture
dc.subjectDecoding
dc.subjectLong short-term memory
dc.subjectSignal encoding
dc.subjectAbstraction methods
dc.subjectAbstractive summarization
dc.subjectAttention
dc.subjectAttention mechanisms
dc.subjectEncoder-decoder
dc.subjectEncoder-decoder architecture
dc.subjectExtractive summarizations
dc.subjectLearning abilities
dc.subjectPage ranks
dc.subjectPageRank methods
dc.subjectNatural language processing systems
dc.titleLSTM Encoder Decoder Based Text Highlight Abstraction Method Using Summaries Extracted by PageRank
dc.typeConference paper

Files