Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
20 changes: 20 additions & 0 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
# Use an official Python runtime as a parent image
FROM python:3.8-slim

# Set the working directory to /app
WORKDIR /app

# Copy only the requirements.txt file into the container
COPY requirements.txt /app/

# Install the required dependencies. This layer will be cached as long as requirements.txt doesn't change.
RUN pip install -r requirements.txt

# Now copy the rest of your application files (this happens after dependencies are installed)
COPY . /app/

# Make port 8888 available to the outside world
EXPOSE 8888

# Run Jupyter Notebook when the container launches
CMD ["jupyter", "notebook", "--ip=0.0.0.0", "--port=8888", "--no-browser", "--allow-root", "--NotebookApp.token=''"]
842 changes: 842 additions & 0 deletions pdfextract.ipynb

Large diffs are not rendered by default.

9 changes: 9 additions & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
torch==2.0.0
transformers==4.24.0
tensorboardX==2.0
lxml==4.9.1
seqeval==0.0.12
Pillow==9.3.0
pdf2image==1.17.0
pytesseract==0.3.13
jupyter==1.1.1
Binary file added traildata/.DS_Store
Binary file not shown.
Empty file.
Loading