Skip to content

KshBlitz/JobMate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Cybersecurity Job Aggregation and Analysis Tool

A Python-based automation tool that discovers, filters, and organizes cybersecurity job postings from multiple platforms using API integration and data processing.

The system automates the job discovery process by querying a job search API, filtering relevant cybersecurity roles, extracting experience requirements from job descriptions, and exporting the results into a structured Excel dataset.


Project Overview

Searching for cybersecurity jobs manually across platforms such as LinkedIn, Indeed, or company career portals can be time-consuming and repetitive. Each platform requires repeated searches, manual filtering, and reading long job descriptions.

This project automates the process by building a data pipeline that:

  • Retrieves job listings from an external job search API
  • Filters results to retain only cybersecurity-related roles
  • Extracts experience requirements from job descriptions
  • Organizes job data into a structured dataset
  • Exports results to Excel for quick review and application

Key Features

  • Automated job discovery using API integration
  • Multi-role cybersecurity job search
  • Multi-location job aggregation
  • Duplicate job removal
  • Cybersecurity keyword filtering
  • Experience extraction from job descriptions
  • Structured dataset generation using Pandas
  • Excel export with timestamp-based file naming

Technology Stack

Technology Purpose


Python Core programming language Requests API communication Pandas Data processing and dataset generation Regex Experience extraction from job descriptions RapidAPI (JSearch API) Job listing data source Excel (.xlsx) Structured dataset output


System Architecture

Job Search API (RapidAPI - JSearch)
            │
            ▼
      Python Script
            │
            ▼
     Duplicate Removal
            │
            ▼
     Keyword Filtering
            │
            ▼
 Experience Extraction
            │
            ▼
    Pandas DataFrame
            │
            ▼
     Excel Dataset
            │
            ▼
      User Review

Job Search Strategy

Roles Queried

  • SOC Analyst
  • Cyber Security Analyst
  • SIEM Engineer
  • Threat Analyst
  • Detection Engineer
  • Incident Response Analyst

Locations Queried

  • Pune
  • Mumbai
  • Navi Mumbai

Filtering Logic

The system filters jobs using cybersecurity-specific keywords to ensure role relevance.

Keywords:

soc, security, cyber, siem, threat, incident, detection, blue team, sentinel, defender

If a job title or description does not contain these keywords, it is discarded.


Experience Extraction Logic

The system extracts experience requirements from job descriptions using regular expressions.

Example patterns detected:

  • 1 year
  • 2+ years
  • 1-3 years
  • minimum 2 years
  • 3 yrs

Regex used:

\d+\+?\s*(?:-|to)?\s*\d*\s*(year|years|yr|yrs)

If no experience requirement is detected, the value is recorded as Not specified.


Data Processing Workflow

  1. Query API for each job role
  2. Query API for each location
  3. Receive JSON job data
  4. Remove duplicate job links
  5. Filter jobs using cybersecurity keywords
  6. Extract experience requirement from job descriptions
  7. Structure data using pandas
  8. Sort jobs by posting date
  9. Export results to Excel

Output Dataset Structure

Column Description


Title Job role Company Hiring organization Location Job location Posted Time since posting Experience Needed Extracted from job description Remote Remote availability Publisher Job source Job Description Truncated job description Apply Link Direct job application URL


Example Output

Title Company Location Experience


SOC Analyst Deloitte Pune 1--3 years Threat Analyst Accenture Mumbai 2+ years SIEM Engineer Capgemini Pune Not specified


How to Run the Project

Install dependencies

pip install requests pandas openpyxl

Add RapidAPI Key

Add your RapidAPI key in the script:

headers = {
    "X-RapidAPI-Key": "YOUR_API_KEY",
    "X-RapidAPI-Host": "jsearch.p.rapidapi.com"
}

Run the script

python job_fetcher.py


Limitations

  • Experience extraction depends on job description wording
  • Salary extraction not implemented
  • No automated job scoring yet
  • API rate limits possible

Future Improvements

  • Job relevance scoring using CV skills
  • Automatic filtering for ≤3 years experience
  • Daily scheduled job scraping
  • Telegram job alerts
  • Database storage instead of Excel
  • Dashboard visualization

Conclusion

This project demonstrates how API integration, text processing, and automation can simplify cybersecurity job discovery.

By aggregating job postings into a structured dataset, the system reduces manual job search effort and allows users to quickly identify relevant SOC and cybersecurity job opportunities.

About

A job collection engine that gathers and organizes relevant opportunities so you spend less time searching and more time preparing.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages