Urban Scene Understanding for Artistic Video Stylization

A deep learning project for real-time artistic stylization of dashcam videos. It uses a semantic segmentation model, trained on the CityScape dataset, to understand urban scenes and transform them into an oil painting-like style.

Project Structure

Urban-Scene-Understanding/
├── notebooks/
│   ├── inference.ipynb          # Main notebook for running video stylization
│   └── training.ipynb           # Notebook for model training and experimentation
├── sample_data/
│   ├── videos/                  # Sample video files for testing
│   │   ├── test-1.mp4
│   │   └── test-2.mp4
│   ├── images/                  # Sample images for testing
│   │   ├── test_video-1.png
│   │   ├── test_video-2.png
│   │   ├── frame0.jpg
│   │   └── ...
│   └── custom_dataset/          # Custom annotated data
│       ├── images/              # Original frames
│       └── masks/               # Custom segmentation masks
├── models/
│   ├── stylization_model_v1/    # Pre-trained model version 1
│   └── stylization_model_v2/    # Pre-trained model version 2 (latest)
├── archive/                     # Archived experimental models and data
└── README.md

Key Features

Real-time Video Processing: Applies semantic segmentation and artistic stylization to video streams.
Artistic Stylization: Converts segmented scenes into an "oil painting" style with custom color mapping.
Pre-trained Models: Includes multiple pre-trained models ready for inference.
Deep Learning Backbone: Built with TensorFlow and Keras, utilizing U-Net-like architectures for segmentation.

How It Works

The processing pipeline is straightforward:

A frame is captured from the input video.
The frame is preprocessed (denoising, brightness/contrast adjustment) and resized.
The preprocessed frame is passed to a pre-trained semantic segmentation model.
The model outputs a segmentation mask, classifying each pixel into categories like road, vehicle, building, etc.
A custom function maps these categories to a specific color palette, creating the oil painting effect.
The final stylized frame is displayed alongside the original.

Technologies Used

TensorFlow / Keras - Deep learning framework
OpenCV - Computer vision and video processing
NumPy - Numerical computations
Matplotlib - Visualization and plotting
Jupyter Notebook - Interactive development environment

How to Run

Prerequisites

Ensure you have the required Python libraries installed:

pip install tensorflow opencv-python matplotlib numpy

Running the Inference

Clone this repository:

git clone https://github.com/jayan110105/Urban-Scene-Understanding.git
cd Urban-Scene-Understanding

Open the inference notebook:

jupyter notebook notebooks/inference.ipynb

In the notebook, you can:
- Change the video_path variable to point to your test video
- Change the img_path variable to test on individual images
- Run the cells to see the stylized output

Running Training (Optional)

If you want to train your own models:

Prepare the CityScape dataset in the appropriate directory structure
Open notebooks/training.ipynb
Follow the training pipeline in the notebook

Models

This repository contains pre-trained models in the models/ directory:

stylization_model_v1: First version of the stylization model
stylization_model_v2: Latest and most refined version (recommended)

The models are trained on the CityScape dataset and optimized for urban scene understanding with artistic output.

Sample Data

The sample_data/ directory contains:

Videos: Sample dashcam footage for testing the real-time stylization
Images: Individual frames for quick testing and debugging
Custom Dataset: Your own annotated data for further training or evaluation

Notes

The project uses legacy TensorFlow SavedModel format. For newer Keras versions, you may need to adapt the model loading code.
Video processing works best with dashcam footage containing urban scenes similar to the CityScape dataset.
The "oil painting" effect is achieved through custom color mapping of segmentation classes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Urban Scene Understanding for Artistic Video Stylization

Project Structure

Key Features

How It Works

Technologies Used

How to Run

Prerequisites

Running the Inference

Running Training (Optional)

Models

Sample Data

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
archive		archive
models		models
notebooks		notebooks
sample_data		sample_data
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Urban Scene Understanding for Artistic Video Stylization

Project Structure

Key Features

How It Works

Technologies Used

How to Run

Prerequisites

Running the Inference

Running Training (Optional)

Models

Sample Data

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages