Skip to content

Commit 49407c1

Browse files
author
EC2 Default User
committed
[ADD] Tesseract for Korean
1 parent 7fd3812 commit 49407c1

File tree

2 files changed

+333
-4
lines changed

2 files changed

+333
-4
lines changed

genai/aws-gen-ai-kr/00_setup/install_tesseract.sh

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,9 @@ cd /usr/local/share/tessdata
2929
wget https://github.com/tesseract-ocr/tessdata/raw/main/osd.traineddata
3030
wget https://github.com/tesseract-ocr/tessdata/raw/main/eng.traineddata
3131
wget https://github.com/tesseract-ocr/tessdata/raw/main/hin.traineddata
32+
wget https://github.com/tesseract-ocr/tessdata_best/blob/main/kor.traineddata
33+
wget https://github.com/tesseract-ocr/tessdata_best/blob/main/kor_vert.traineddata
34+
wget https://github.com/tesseract-knu2019/tesseract-knu2019.github.io/raw/master/tessdata/koreng2.traineddata.gz
3235

3336
echo "## Step 4"
3437
echo "export TESSDATA_PREFIX=/usr/local/share/tessdata" >> ~/.bash_profile

0 commit comments

Comments
 (0)