command line tesseract with makebox option gives incorrect and overlapping bounding boxes

### Current Behavior

Using tesseract to OCR simple English text.  Scanned document at 600 dpi.  Cleaned up to be pure black and white pixels.  Invoking tesseract with command:

	tesseract --psm 6 page.tiff page.box makebox

I've attached images to show the types of box errors that I get.  Mainly it seems to be a set of three boxes, two seem okay, but then a third overlaps half of the other two boxes, often with a box which is clearly too tall -- the image has an "e" followed by an "n", and both have reasonable boxes, but then there is a third that is full-height, and overlaps the right half of the "e" and the left half of the "n".  Same sort of thing for the trailing "he" in "the".  Even does this for a text input that "looks" almost identical to another which it gets right.

<img width="1106" height="452" alt="Image" src="https://github.com/user-attachments/assets/1dc988d1-5035-4e54-ab0f-cb1161563dc0" />
<img width="904" height="218" alt="Image" src="https://github.com/user-attachments/assets/818b8111-c0fe-4a43-bda2-5b5f777d5066" />
<img width="1108" height="248" alt="Image" src="https://github.com/user-attachments/assets/9bda06d3-83d1-4c33-baa6-7fb75d5beac4" />

### Expected Behavior

I would expect that each box actually fits the character it is associated with, and other than some kerning situations, boxes don't overlap.


### Suggested Fix

_No response_

### tesseract -v

tesseract 5.5.0
 leptonica-1.85.0
  libgif 5.2.2 : libjpeg 6b (libjpeg-turbo 3.1.0) : libpng 1.6.44 : libtiff 4.7.0 : zlib 1.3.1.zlib-ng : libwebp 1.5.0
 Found AVX2
 Found AVX
 Found FMA
 Found SSE4.1
 Found libcurl/8.11.1 OpenSSL/3.2.6 zlib/1.3.1.zlib-ng brotli/1.1.0 libidn2/2.3.8 libpsl/0.21.5 libssh/0.11.3/openssl/zlib nghttp2/1.64.0 OpenLDAP/2.6.10


### Operating System

_No response_

### Other Operating System

Fedora Core 42 Linux

### uname -a

Linux jkl 6.16.8-200.fc42.x86_64 #1 SMP PREEMPT_DYNAMIC Fri Sep 19 17:47:18 UTC 2025 x86_64 GNU/Linux


### Compiler

_No response_

### CPU

_No response_

### Virtualization / Containers

_No response_

### Other Information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

command line tesseract with makebox option gives incorrect and overlapping bounding boxes #4472

Current Behavior

Expected Behavior

Suggested Fix

tesseract -v

Operating System

Other Operating System

uname -a

Compiler

CPU

Virtualization / Containers

Other Information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

command line tesseract with makebox option gives incorrect and overlapping bounding boxes #4472

Description

Current Behavior

Expected Behavior

Suggested Fix

tesseract -v

Operating System

Other Operating System

uname -a

Compiler

CPU

Virtualization / Containers

Other Information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions