Tokenize endpoint for LLMs and Visual models #3849

przepeck · 2025-12-08T14:23:59Z

🛠 Summary

CVS-177520
Adding tokenize endpoint same as for embeddings -https://github.com/openvinotoolkit/model_server/blob/main/demos/embeddings/README.md#usage-of-tokenize-endpoint-release-20254-or-weekly

🧪 Checklist

Unit tests added.
The documentation updated.
Change follows security best practices.
``

mzegla · 2025-12-08T15:14:45Z

src/tokenize/tokenize_parser.cpp

        for (size_t i = 0; i < size; ++i) {
            if (attentionMaskPtr[i] == 0 && !pad_to_max_length) {
-                break;
+                continue;


Why do we change that? Do we expect that once we see first 0 in the attention mask any further elements can be different (non-zero)?

I encountered the issue that with batched data, attention mask was starting with 0 and then the output was correct

src/test/llm/llmnode_test.cpp

src/llm/http_llm_calculator.cc

src/llm/servable.cpp

src/llm/http_llm_calculator.cc

src/llm/servable.hpp

mzegla · 2025-12-09T12:23:01Z

src/test/llm/tokenize_endpoint_test.cpp

+        ASSERT_EQ(tokenArray.Size(), 25);
+    }
+}
+


Looks like there is no test for add_special_tokens=true

This is problematic for vison model used here since special tokens are reserved here for images 🤔 I can implement this method only for lm models.

docs/llm/quickstart.md

src/llm/http_llm_calculator.cc

Co-authored-by: Miłosz Żeglarski <milosz.zeglarski@intel.com>

przepeck added 4 commits December 3, 2025 15:17

tokenize endpoint for llms but without params

86998f4

tests

eb5ddf5

updated docs

2ab7021

adding note about additional parameters

87b4566

przepeck requested review from dkalinowski and mzegla December 8, 2025 14:23

mzegla reviewed Dec 8, 2025

View reviewed changes

przepeck added 4 commits December 9, 2025 09:06

adding test file for tokenize endpoint

9d4d55b

adding new test file

3fb5d53

clang-format

f418366

changing a place of tokenizing logic to servable instead of calculator

1c427f3

przepeck requested a review from mzegla December 9, 2025 11:36

clang format

0874521

mzegla reviewed Dec 9, 2025

View reviewed changes

przepeck added 4 commits December 9, 2025 13:25

style

8bacbea

review suggestions

630cb7a

style

f0b5725

style

213312e

dkalinowski reviewed Dec 10, 2025

View reviewed changes

docs/llm/quickstart.md Outdated Show resolved Hide resolved

dkalinowski reviewed Dec 10, 2025

View reviewed changes

docs/llm/quickstart.md Outdated Show resolved Hide resolved

review suggestions

e631047

dkalinowski approved these changes Dec 10, 2025

View reviewed changes

mzegla reviewed Dec 10, 2025

View reviewed changes

src/llm/http_llm_calculator.cc Outdated Show resolved Hide resolved

mzegla approved these changes Dec 10, 2025

View reviewed changes

przepeck and others added 3 commits December 10, 2025 14:22

adding link to embedding version

2d34205

Update src/llm/http_llm_calculator.cc

6187b8d

Co-authored-by: Miłosz Żeglarski <milosz.zeglarski@intel.com>

Merge branch 'main' into przepeck/llm_tokenize_endpoint

6e39dfd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tokenize endpoint for LLMs and Visual models #3849

Tokenize endpoint for LLMs and Visual models #3849

Uh oh!

przepeck commented Dec 8, 2025

Uh oh!

mzegla Dec 8, 2025

Uh oh!

przepeck Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzegla Dec 9, 2025

Uh oh!

przepeck Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Tokenize endpoint for LLMs and Visual models #3849

Are you sure you want to change the base?

Tokenize endpoint for LLMs and Visual models #3849

Uh oh!

Conversation

przepeck commented Dec 8, 2025

🛠 Summary

🧪 Checklist

Uh oh!

mzegla Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

przepeck Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mzegla Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

przepeck Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants