MathVisionNet

Overview

MathVisionNet is a machine learning model aimed at converting handwritten math expressions into LaTeX. It is a personal project of mine desgined to learn how these systems work. I plan on making different architectures to see how they compare and even trying different techniques for data augmentation to see how they might affect training.

At the end of training I want to figure out ways I can optimize the model so that it can run on limited hardware efficiently.

Model Architectures

CNN + LSTM Encoder-Decoder model (Currently working on)
CNN + LSTM with Attention model
CNN + Transformer Decoder model
Vision Transformer + Transformer Decoder model

Dataset

Starting out I am going to use MathWriting 2024 dataset to train these models but for all these models I need a much larger dataset so I will combine this datasets and see what I can do.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
vocab.json		vocab.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MathVisionNet

Overview

Model Architectures

Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Languages

PyDev19/MathVisionNet

Folders and files

Latest commit

History

Repository files navigation

MathVisionNet

Overview

Model Architectures

Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages