Text Summarization using NLP


Authors : Kesanapalli Lakshmi Priyanka; Dr. Vinay V Hedge

Volume/Issue : Volume 8 - 2023, Issue 9 - September

Google Scholar : https://bit.ly/3TmGbDi

Scribd : https://tinyurl.com/yc2jyf6b

DOI : https://doi.org/10.5281/zenodo.8355268

Abstract : Text summarization is an area within natural language processing (NLP) that revolves around producing brief and condensed summaries from extended passages of text. The exponential growth of digital content has given rise to a vast quantity of textual information, creating a challenge for individuals to stay abreast of this information overload. While previous advancements in text summarization have marked significant achievements, there remains an existing void in adequately addressing the specific requirements for summarizing general textual content. The project's goal is to create a summarization system that generates concise summaries by using creative methods in natural language processing and sophisticated machine learning algorithms. This system will help fill the informational divide between lengthy texts and condensed summaries.The primary objective is to create an efficient and effective summarization model that enables text summarization and speech synthesis integrating the gTTS library, enabling the transformation of summaries into speech. We strived to empower users by developing customization options that grant them the ability to define summary attributes such as length and style, culminating in personalized and precisely tailored summarization outputs. This project seamlessly integrates web scraping, frequency-based text summarization, and a user-friendly Flask interface, enhancing content consumption and accessibility. Users input URLs, initiating efficient processes of extracting essential text, generating concise summaries, and estimating reading time. Web scraping extracts data for text summarization, using frequency- based scoring for succinct summaries. The Flask interface empowers users to input URLs, triggering content extraction and summarization. The project finds applications in content understanding, gTTS-enabled accessibility, and efficient information management. Beneficial for education, it aids in quick comprehension of complex subjects, supported by estimated reading time. Merging technology with user-centric design, it enriches learning, research, and content assimilation across domains. An empowering tool for academia, professionals, and personal exploration, it navigates the digital realm effectively. The project's integrated approach of web scraping, frequency-based text summarization, and Flask interface yields efficient content extraction, concise summaries, and estimated reading time. Quantitative analysis involves comparing the generated summaries' quality, coherence, and accuracy with existing literature.

Keywords : NLP, gTTS library, Flask, TextRank algorithm, URLs

Text summarization is an area within natural language processing (NLP) that revolves around producing brief and condensed summaries from extended passages of text. The exponential growth of digital content has given rise to a vast quantity of textual information, creating a challenge for individuals to stay abreast of this information overload. While previous advancements in text summarization have marked significant achievements, there remains an existing void in adequately addressing the specific requirements for summarizing general textual content. The project's goal is to create a summarization system that generates concise summaries by using creative methods in natural language processing and sophisticated machine learning algorithms. This system will help fill the informational divide between lengthy texts and condensed summaries.The primary objective is to create an efficient and effective summarization model that enables text summarization and speech synthesis integrating the gTTS library, enabling the transformation of summaries into speech. We strived to empower users by developing customization options that grant them the ability to define summary attributes such as length and style, culminating in personalized and precisely tailored summarization outputs. This project seamlessly integrates web scraping, frequency-based text summarization, and a user-friendly Flask interface, enhancing content consumption and accessibility. Users input URLs, initiating efficient processes of extracting essential text, generating concise summaries, and estimating reading time. Web scraping extracts data for text summarization, using frequency- based scoring for succinct summaries. The Flask interface empowers users to input URLs, triggering content extraction and summarization. The project finds applications in content understanding, gTTS-enabled accessibility, and efficient information management. Beneficial for education, it aids in quick comprehension of complex subjects, supported by estimated reading time. Merging technology with user-centric design, it enriches learning, research, and content assimilation across domains. An empowering tool for academia, professionals, and personal exploration, it navigates the digital realm effectively. The project's integrated approach of web scraping, frequency-based text summarization, and Flask interface yields efficient content extraction, concise summaries, and estimated reading time. Quantitative analysis involves comparing the generated summaries' quality, coherence, and accuracy with existing literature.

Keywords : NLP, gTTS library, Flask, TextRank algorithm, URLs

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe