Information Retrieval (IR) Models Overview

This project focuses on building and understanding various Information Retrieval (IR) models to retrieve relevant documents based on keyword search. It covers basic and advanced techniques to enhance document ranking and retrieval accuracy.

Objectives

Develop a basic document search engine.
Implement ranking mechanisms for relevance-based document retrieval.

Key Features

Keyword Matching:
- Matches query terms with document terms.
- Limitation: Only considers exact matches, ignoring synonyms and context.
TF-IDF Scoring:
- Term Frequency (TF): Measures the frequency of terms in a document.
- Inverse Document Frequency (IDF): Evaluates the uniqueness of terms across all documents.
- Combined TF-IDF: Ranks documents by their importance based on query terms.
Cosine Similarity:
- Measures the cosine of the angle between two vectors (query vs. document) to determine their similarity.

Information Retrieval Models

Structured Models:
- Organizes documents using structured data for easier navigation.
Non-Overlapped List Model:
- Matches terms from queries with non-overlapping document lists.
Proximal Nodes Model:
- Represents documents as nodes in a graph, focusing on term proximity.
Set-Theoretic Models:
- Boolean Model: Matches documents based on Boolean logic.
- Extended Boolean Model: Incorporates fuzzy logic for partial matching.
- Fuzzy IR Model: Handles imprecise queries using fuzzy logic.
Hypertext Model:
- Links documents via hypertext for web-like navigation.
Probabilistic Models:
- Belief Network Model: Uses probabilistic reasoning to determine relevance.
Neural Network Models:
- Leverages deep learning (e.g., CNNs, RNNs, transformers) for semantic understanding and contextual relevance.

Evolution of IR Models

The field of IR has evolved from simple keyword-based methods to advanced neural network techniques, enabling improved accuracy and relevance in document retrieval.

Contributors

Usama Mehboob (2021-CS-10)
Hamza Rasheed (2021-CS-26)
Bilal Baig (2021-CS-36)
Usman Asghar (2021-CS-46)

Supervisor

Prof. Dr. Khaldoon Khurshid

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Assignment 1		Assignment 1
Assignment 2		Assignment 2
Assignment 3		Assignment 3
Assignment 4		Assignment 4
Assignment 5		Assignment 5
Assignment 6		Assignment 6
Assignment 7		Assignment 7
Final Integration		Final Integration
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Information Retrieval (IR) Models Overview

Objectives

Key Features

Information Retrieval Models

Evolution of IR Models

Contributors

Supervisor

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Information Retrieval (IR) Models Overview

Objectives

Key Features

Information Retrieval Models

Evolution of IR Models

Contributors

Supervisor

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages