Keyword_extraction_using_NLP

This project demonstrates how to extract keywords from raw text using three popular NLP techniques:

🔢 TF-IDF (Term Frequency-Inverse Document Frequency)
🧠 RAKE (Rapid Automatic Keyword Extraction)
🔗 TextRank (Graph-based ranking algorithm)

Developed entirely in Google Colab using Python and nltk, rake_nltk, sklearn, and networkx.

🚀 Project Objective

To compare different keyword extraction methods and understand how each performs on a sample corpus of text. This is useful for applications in:

Search engine optimization
Summarization tools
Content classification
Information retrieval

📊 Methods & Workflow

Data Input: Raw text string (manually input).
Preprocessing: Tokenization, stopword removal.
TF-IDF: Extracts top words based on statistical frequency.
RAKE: Extracts keyword phrases based on word co-occurrence and frequency.
TextRank: Builds a graph of words and uses PageRank to find the most relevant ones.

🛠️ Libraries Used

nltk
rake_nltk
sklearn
networkx
matplotlib

📓 Notebook

✨ Output

Each method extracts a ranked list of keywords from the same input text.
This helps visually and practically compare how different techniques interpret "importance."

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Nlp_project.ipynb		Nlp_project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keyword_extraction_using_NLP

🚀 Project Objective

📊 Methods & Workflow

🛠️ Libraries Used

📓 Notebook

✨ Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Keyword_extraction_using_NLP

🚀 Project Objective

📊 Methods & Workflow

🛠️ Libraries Used

📓 Notebook

✨ Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages