DocTr去扭曲

2024-08-21 16:03:39 +08:00
parent 0e17f2b9aa
commit add29504e2
26 changed files with 2212 additions and 469 deletions
--- a/doc_dewarp/README.md
+++ b/doc_dewarp/README.md
@@ -0,0 +1,73 @@
+# DocTrPP: DocTr++ in PaddlePaddle
+
+## Introduction
+
+This is a PaddlePaddle implementation of DocTr++. The original paper is [DocTr++: Deep Unrestricted Document Image Rectification](https://arxiv.org/abs/2304.08796). The original code is [here](https://github.com/fh2019ustc/DocTr-Plus).
+
+![demo](https://github.com/GreatV/DocTrPP/assets/17264618/4e491512-bfc4-4e69-a833-fd1c6e17158c)
+
+## Requirements
+
+You need to install the latest version of PaddlePaddle, which is done through this [link](https://www.paddlepaddle.org.cn/).
+
+## Training
+
+1. Data Preparation
+
+To prepare datasets, refer to [doc3D](https://github.com/cvlab-stonybrook/doc3D-dataset).
+
+2. Training
+
+```shell
+sh train.sh
+```
+
+or
+
+```shell
+export OPENCV_IO_ENABLE_OPENEXR=1
+export CUDA_VISIBLE_DEVICES=0
+
+python train.py --img-size 288 \
+    --name "DocTr++" \
+    --batch-size 12 \
+    --lr 2.5e-5 \
+    --exist-ok \
+    --use-vdl
+```
+
+3. Load Trained Model and Continue Training
+
+```shell
+export OPENCV_IO_ENABLE_OPENEXR=1
+export CUDA_VISIBLE_DEVICES=0
+
+python train.py --img-size 288 \
+    --name "DocTr++" \
+    --batch-size 12 \
+    --lr 2.5e-5 \
+    --resume "runs/train/DocTr++/weights/last.ckpt" \
+    --exist-ok \
+    --use-vdl
+```
+
+## Test and Inference
+
+Test the dewarp result on a single image:
+
+```shell
+python predict.py -i "crop/12_2 copy.png" -m runs/train/DocTr++/weights/best.ckpt -o 12.2.png
+```
+![document image rectification](https://raw.githubusercontent.com/greatv/DocTrPP/main/doc/imgs/document_image_rectification.jpg)
+
+## Export to onnx
+
+```
+pip install paddle2onnx
+
+python export.py -m ./best.ckpt --format onnx
+```
+
+## Model Download
+
+The trained model can be downloaded from [here](https://github.com/GreatV/DocTrPP/releases/download/v0.0.2/best.ckpt).