🏆 Developed for the Spectral Bridge Challenge at Cognizance, IIT Roorkee
🥉 Achieved 3rd Place in the competition
This project focuses on reconstructing missing portions of audio signals using sparse observed waveform points.
The challenge involves predicting missing target points from limited context observations while preserving the underlying spectral structure of the signal.
The proposed solution combines:
- Transformer-based neural architectures
- Gaussian Process interpolation priors
- Neural Process representations
- Context-aware normalization
- In-context learning strategies
The framework is designed to reconstruct sparse waveforms accurately under limited observation settings.
Each audio sample consists of:
- 100 time steps
- 20 observed context points
- 80 missing target points
The model must infer the missing waveform values using only the provided context points.
Important constraints:
- Each sample is independent
- No information can be transferred between samples
- Prediction must rely only on the sample-specific context observations
Dataset used:
Spectral Graffiti Dataset
https://www.kaggle.com/datasets/fda137/spectral-graffiti/data
Dataset properties:
- ~80,000 samples
- 100 time points per sample
- 20 context points
- 80 target points
- Sampling rate: 1kHz
The solution follows a hybrid reconstruction pipeline combining classical signal interpolation and deep neural learning.
Multiple Gaussian Process predictions with different kernel scales are computed to capture signal behavior across multiple frequencies.
These GP priors help the model estimate waveform continuity and smoothness.
A transformer-based architecture processes the context points and predicts missing targets.
The model dynamically learns latent spectral representations for each waveform sample.
Signals are normalized using context statistics:
value_norm = (value - mean_context) / std_contextThis stabilizes training across samples with varying amplitudes and signal distributions.
Training pipeline includes:
- Huber Loss
- AdamW Optimizer
- Learning Rate Warmup
- Cosine Learning Rate Decay
- Gradient Clipping
- Early Stopping
These strategies improve convergence stability and generalization performance.
Competition evaluation uses:
computed only on hidden target points where:
Is_Context = 0| Metric | Score |
|---|---|
| MSE | ~0.0018 |
| RMSE | ~0.043 |
The inference process follows:
- Extract context points
- Compute Gaussian Process interpolation features
- Pass features through the transformer model
- Predict missing waveform targets
- Generate submission CSV
Sample_ID, Time_ms, Predicted_ValueOnly rows with:
Is_Context = 0are included in final submissions.
- Python
- PyTorch
- Transformers
- Gaussian Processes
- Neural Processes
- NumPy
- Pandas
- Matplotlib
- Signal Processing
.
├── spectral_bridge_solution.ipynb
├── preprocessing/
├── models/
├── training/
├── evaluation/
├── utils/
└── submissions/spectral_bridge_solution.ipynbThe notebook will:
- preprocess the dataset
- train the reconstruction model
- perform validation
- generate predictions
- export final submission files
- Audio signal reconstruction
- Sparse waveform prediction
- Neural signal interpolation
- In-context learning research
- Time-series forecasting
- Spectral analysis systems
- Hybrid GP + Transformer reconstruction framework
- Context-aware signal representation learning
- Sparse waveform interpolation under low-observation settings
- Robust signal reconstruction using neural attention mechanisms
- Competition-grade inference pipeline