Skip to content

Latest commit

 

History

History
31 lines (25 loc) · 1.46 KB

File metadata and controls

31 lines (25 loc) · 1.46 KB

Lexicographic Context Mapper

Lexigraph is a Python and C-based tool designed to generate keyword-in-context concordances from text files, providing users with powerful text indexing and contextual analysis. This tool facilitates the exploration of large text datasets by indexing words and offering a detailed view of their contextual usage.

Key Features (Completed Tasks):

  • Dynamic Memory Management: Implemented efficient memory handling in C, with Valgrind confirming no memory leaks.
  • Flexible Input/Output: Added support for processing text from standard input or files, with results outputted to the console or files.
  • Exclusion Lists: Customizable exclusion lists (e.g., common conjunctions) are implemented for words to be ignored during indexing.
  • Regular Expressions: Integrated Python’s regex capabilities for precise pattern matching and keyword identification.
  • Text Exploration: Enhanced readability by capitalizing indexed words in their original context for easy navigation.
  • Platform Compatibility: Fully compatible with Linux environments, ensuring stable performance across different systems.

How to Use:

Clone the repository:

git clone https://github.com/username/lexigraph.git

Running the Program:

For Python:

python3 lexigraph.py --in input.txt --out output.txt

For C:

gcc -o lexigraph lexigraph.c
./lexigraph < input.txt > output.txt

Contributions are welcome!