A lightweight pipeline to download, parse, and analyze protein structures using their PDB IDs. Built with Biopython, it enables inspection of chain organization and amino acid composition from 3D protein data fetched from RCSB PDB.
- Downloads
.pdbfiles directly from RCSB PDB - Parses protein structures using Biopython
- Identifies chains and residues in the model
- Computes amino acid composition (e.g., GLY, ALA, LEU counts)
- Works in both command-line and interactive mode
Interactive mode:
python protein_parser.pyCommand-line mode:
python protein_parser.py 8WIXprotein-structure-prediction/
├── sample_data/
│ └── 8WIX.pdb Sample structure for testing
├── protein_parser.py Main Python script
└── README.md
Downloaded 8WIX.pdb successfully!
Structure Summary for 8WIX
Chains Found: 2
Total Residues: 284
Amino Acid Composition:
GLY: 35
ALA: 20
LEU: 40
SER: 18
...
- Python 3.7+
- Biopython
- requests
pip install biopython requestsMIT License. See LICENSE for details.