⚡️ Speed up function `load_image_base64` by 59% #646

codeflash-ai · 2025-10-29T18:33:28Z

📄 59% (0.59x) speedup for `load_image_base64` in `inference/core/utils/image_utils.py`

⏱️ Runtime : 18.1 milliseconds → 11.4 milliseconds (best of 42 runs)

📝 Explanation and details

The optimized code achieves a 58% speedup through three key optimizations:

1. Conditional regex processing: Instead of always running BASE64_DATA_TYPE_PATTERN.sub() on every input string, the code first checks if the pattern matches using BASE64_DATA_TYPE_PATTERN.match(). If a match is found, it uses string slicing to remove the prefix. This avoids expensive regex substitution for the majority of base64 strings that don't have data URI headers.

2. Direct function call: Eliminates the intermediate image_np variable by passing np.frombuffer() directly to cv2.imdecode(), reducing memory allocation overhead.

3. Optimized empty check: Replaces len(value) == 0 with if not value:, which is more efficient in Python as it doesn't need to compute the length.

The test results show this optimization is particularly effective for:

Large images: Up to 240% faster for 500x500 images and 222% faster for 300x300 images with prefixes
JPEG images: 15-16% improvement due to reduced preprocessing overhead
Batch processing: 7-8% faster when processing many small images in sequence
Images with data URI prefixes: The conditional regex check provides significant gains when prefixes are present

The optimizations maintain identical functionality and error handling while providing consistent performance improvements across all test cases.

✅ Correctness verification report:

Test	Status
⚙️ Existing Unit Tests	✅ 57 Passed
🌀 Generated Regression Tests	✅ 172 Passed
⏪ Replay Tests	🔘 None Found
🔎 Concolic Coverage Tests	🔘 None Found
📊 Tests Coverage	100.0%

⚙️ Existing Unit Tests and Runtime

Test File::Test Function	Original ⏱️	Optimized ⏱️	Speedup
`inference/unit_tests/core/utils/test_image_utils.py::test_load_image_base64_when_invalid_bytes_given`	21.1μs	21.5μs	-2.03%⚠️
`inference/unit_tests/core/utils/test_image_utils.py::test_load_image_base64_when_valid_bytes_given`	83.6μs	77.3μs	8.19%✅
`inference/unit_tests/core/utils/test_image_utils.py::test_load_image_base64_when_valid_bytes_given_with_type_preamble`	85.7μs	79.8μs	7.41%✅
`inference/unit_tests/core/utils/test_image_utils.py::test_load_image_base64_when_valid_string_given`	81.8μs	75.2μs	8.79%✅

🌀 Generated Regression Tests and Runtime

import binascii
# function to test
import re
from typing import Union

import cv2
import numpy as np
import pybase64
# imports
import pytest
from inference.core.utils.image_utils import load_image_base64


# Simulate the custom exception as in the original code
class InputImageLoadError(Exception):
    def __init__(self, message, public_message):
        super().__init__(message)
        self.public_message = public_message

BASE64_DATA_TYPE_PATTERN = re.compile(r"^data:image\/[a-z]+;base64,")
from inference.core.utils.image_utils import load_image_base64

# ----------- UNIT TESTS BELOW -----------

# Helper function to generate a valid base64-encoded image (as PNG)
def make_base64_image(width=2, height=2, color=(0, 255, 0), with_header=False):
    img = np.full((height, width, 3), color, dtype=np.uint8)
    success, buf = cv2.imencode('.png', img)
    b64 = pybase64.b64encode(buf.tobytes()).decode("utf-8")
    if with_header:
        return f"data:image/png;base64,{b64}"
    return b64

# Helper: corrupt base64 string (not decodable)
def make_invalid_base64():
    return "not_base64_!@#"

# Helper: base64 for non-image data
def make_base64_non_image():
    data = b"this is not an image"
    return pybase64.b64encode(data).decode("utf-8")

# Helper: base64 for empty bytes
def make_base64_empty():
    return pybase64.b64encode(b"").decode("utf-8")

# 1. BASIC TEST CASES

def test_load_image_base64_valid_png():
    """Test loading a valid PNG image base64 string."""
    b64 = make_base64_image()
    codeflash_output = load_image_base64(b64); img = codeflash_output # 29.5μs -> 28.9μs (2.24% faster)

def test_load_image_base64_with_data_url_header():
    """Test loading a base64 image with a data URL header."""
    b64 = make_base64_image(with_header=True)
    codeflash_output = load_image_base64(b64); img = codeflash_output # 27.3μs -> 26.6μs (2.77% faster)

def test_load_image_base64_bytes_input():
    """Test loading a base64 image given as bytes instead of string."""
    b64 = make_base64_image()
    b64_bytes = b64.encode("utf-8")
    codeflash_output = load_image_base64(b64_bytes); img = codeflash_output # 25.3μs -> 24.0μs (5.63% faster)

def test_load_image_base64_valid_jpeg():
    """Test loading a valid JPEG image base64 string."""
    img = np.full((3, 3, 3), (123, 50, 200), dtype=np.uint8)
    success, buf = cv2.imencode('.jpg', img)
    b64 = pybase64.b64encode(buf.tobytes()).decode("utf-8")
    codeflash_output = load_image_base64(b64); out = codeflash_output # 34.1μs -> 29.6μs (15.4% faster)

# 2. EDGE TEST CASES





def test_load_image_base64_wrong_header():
    """Test that a base64 string with a wrong data url header is still processed (header removed)."""
    b64 = make_base64_image()
    b64_with_wrong_header = "data:image/jpeg;base64," + b64
    # Should still work, even if data is PNG (header is just stripped)
    codeflash_output = load_image_base64(b64_with_wrong_header); img = codeflash_output # 30.0μs -> 30.2μs (0.477% slower)


def test_load_image_base64_bytes_input_with_non_utf8_encoding():
    """Test that bytes input not decodable as utf-8 raises an error."""
    b64 = make_base64_image()
    # introduce a non-utf8 byte
    b64_bytes = b64.encode("utf-16")
    with pytest.raises(UnicodeDecodeError):
        load_image_base64(b64_bytes) # 3.78μs -> 3.97μs (4.93% slower)



def test_load_image_base64_large_image():
    """Test loading a large image (e.g., 512x512) to check scalability and performance."""
    img = np.full((512, 512, 3), (10, 20, 30), dtype=np.uint8)
    success, buf = cv2.imencode('.png', img)
    b64 = pybase64.b64encode(buf.tobytes()).decode("utf-8")
    codeflash_output = load_image_base64(b64); out = codeflash_output # 1.49ms -> 1.48ms (0.504% faster)

def test_load_image_base64_many_small_images():
    """Test loading many small images in a loop to check for memory leaks or performance degradation."""
    for i in range(50):  # keep under 1000 as per instructions
        color = (i % 256, (i*2) % 256, (i*3) % 256)
        b64 = make_base64_image(width=8, height=8, color=color)
        codeflash_output = load_image_base64(b64); img = codeflash_output # 540μs -> 503μs (7.49% faster)

def test_load_image_base64_maximum_reasonable_size():
    """Test loading an image at the upper limit of reasonable size (e.g., 1000x1000)."""
    img = np.full((1000, 1000, 3), (1, 2, 3), dtype=np.uint8)
    success, buf = cv2.imencode('.png', img)
    b64 = pybase64.b64encode(buf.tobytes()).decode("utf-8")
    codeflash_output = load_image_base64(b64); out = codeflash_output # 5.19ms -> 5.11ms (1.72% faster)
# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.
#------------------------------------------------
import binascii
import re
from typing import Union

import cv2
import numpy as np
import pybase64
# imports
import pytest  # used for our unit tests
from inference.core.utils.image_utils import load_image_base64


# Minimal stub for InputImageLoadError to enable testing
class InputImageLoadError(Exception):
    def __init__(self, message, public_message):
        super().__init__(message)
        self.public_message = public_message

BASE64_DATA_TYPE_PATTERN = re.compile(r"^data:image\/[a-z]+;base64,")
from inference.core.utils.image_utils import load_image_base64

# unit tests

# Helper function to generate a simple image and encode it to base64
def generate_base64_image(width=2, height=2, color=(255, 0, 0), add_prefix=False):
    """Creates a small RGB image and returns its base64 encoding as string."""
    img = np.zeros((height, width, 3), dtype=np.uint8)
    img[:] = color
    success, buf = cv2.imencode('.png', img)
    b64_str = pybase64.b64encode(buf.tobytes()).decode('utf-8')
    if add_prefix:
        b64_str = f'data:image/png;base64,{b64_str}'
    return b64_str, img

# ------------------- Basic Test Cases -------------------

def test_valid_base64_string_png():
    """Test decoding a valid PNG base64 string."""
    b64_str, orig_img = generate_base64_image()
    codeflash_output = load_image_base64(b64_str); decoded_img = codeflash_output # 28.2μs -> 27.6μs (2.42% faster)

def test_valid_base64_string_with_prefix():
    """Test decoding a valid PNG base64 string with data URI prefix."""
    b64_str, orig_img = generate_base64_image(add_prefix=True)
    codeflash_output = load_image_base64(b64_str); decoded_img = codeflash_output # 26.2μs -> 25.7μs (1.82% faster)

def test_valid_base64_bytes_input():
    """Test decoding a valid PNG base64 as bytes input."""
    b64_str, orig_img = generate_base64_image()
    b64_bytes = b64_str.encode('utf-8')
    codeflash_output = load_image_base64(b64_bytes); decoded_img = codeflash_output # 24.4μs -> 23.2μs (5.24% faster)

def test_valid_jpeg_image():
    """Test decoding a valid JPEG base64 string."""
    img = np.ones((5, 5, 3), dtype=np.uint8) * 127
    success, buf = cv2.imencode('.jpg', img)
    b64_str = pybase64.b64encode(buf.tobytes()).decode('utf-8')
    codeflash_output = load_image_base64(b64_str); decoded_img = codeflash_output # 35.7μs -> 30.7μs (16.2% faster)

# ------------------- Edge Test Cases -------------------







def test_trailing_whitespace_in_base64():
    """Test that trailing whitespace does not break decoding."""
    b64_str, orig_img = generate_base64_image()
    b64_str_ws = b64_str + "   \n"
    codeflash_output = load_image_base64(b64_str_ws); decoded_img = codeflash_output # 28.9μs -> 27.9μs (3.29% faster)

def test_leading_whitespace_in_base64():
    """Test that leading whitespace does not break decoding."""
    b64_str, orig_img = generate_base64_image()
    b64_str_ws = "   \n" + b64_str
    codeflash_output = load_image_base64(b64_str_ws); decoded_img = codeflash_output # 25.3μs -> 24.1μs (4.95% faster)

def test_base64_string_with_newlines():
    """Test that base64 string with embedded newlines decodes correctly."""
    b64_str, orig_img = generate_base64_image()
    # Insert newlines every 20 chars
    b64_str_nl = '\n'.join([b64_str[i:i+20] for i in range(0, len(b64_str), 20)])
    codeflash_output = load_image_base64(b64_str_nl); decoded_img = codeflash_output # 24.4μs -> 23.3μs (4.86% faster)

def test_non_utf8_bytes_input_raises():
    """Test that bytes not valid utf-8 raises UnicodeDecodeError."""
    # Invalid UTF-8 bytes
    invalid_bytes = b'\xff\xfe\xfd'
    with pytest.raises(UnicodeDecodeError):
        load_image_base64(invalid_bytes) # 3.55μs -> 3.70μs (3.95% slower)

def test_none_input_raises():
    """Test that None input raises AttributeError (since decode called on None)."""
    with pytest.raises(AttributeError):
        load_image_base64(None) # 1.56μs -> 1.55μs (0.647% faster)


def test_large_image():
    """Test decoding a large image (e.g., 500x500 px)."""
    img = np.random.randint(0, 256, (500, 500, 3), dtype=np.uint8)
    success, buf = cv2.imencode('.png', img)
    b64_str = pybase64.b64encode(buf.tobytes()).decode('utf-8')
    codeflash_output = load_image_base64(b64_str); decoded_img = codeflash_output # 6.79ms -> 2.00ms (240% faster)

def test_many_small_images():
    """Test decoding 100 small images in a row."""
    for i in range(100):
        color = (i % 256, (i*2) % 256, (i*3) % 256)
        b64_str, orig_img = generate_base64_image(width=4, height=4, color=color)
        codeflash_output = load_image_base64(b64_str); decoded_img = codeflash_output # 977μs -> 910μs (7.31% faster)

def test_extremely_long_base64_string():
    """Test decoding a base64 string with excessive padding and whitespace."""
    b64_str, orig_img = generate_base64_image(width=10, height=10)
    # Add lots of whitespace and padding
    long_b64 = " " * 100 + b64_str + "=" * 50 + "\n" * 100
    codeflash_output = load_image_base64(long_b64); decoded_img = codeflash_output # 27.4μs -> 24.5μs (11.5% faster)

def test_large_image_with_prefix():
    """Test decoding a large image with data URI prefix."""
    img = np.random.randint(0, 256, (300, 300, 3), dtype=np.uint8)
    success, buf = cv2.imencode('.png', img)
    b64_str = pybase64.b64encode(buf.tobytes()).decode('utf-8')
    prefixed = f"data:image/png;base64,{b64_str}"
    codeflash_output = load_image_base64(prefixed); decoded_img = codeflash_output # 2.42ms -> 752μs (222% faster)

def test_different_imread_flags():
    """Test decoding with different OpenCV imread flags."""
    b64_str, orig_img = generate_base64_image()
    # IMREAD_GRAYSCALE
    codeflash_output = load_image_base64(b64_str, cv2.IMREAD_GRAYSCALE); decoded_img = codeflash_output # 27.5μs -> 26.7μs (3.01% faster)
    # IMREAD_UNCHANGED
    codeflash_output = load_image_base64(b64_str, cv2.IMREAD_UNCHANGED); decoded_img2 = codeflash_output # 12.0μs -> 11.2μs (7.29% faster)
# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.

To edit these changes git checkout codeflash/optimize-load_image_base64-mhcc32rm and push.

The optimized code achieves a **58% speedup** through three key optimizations: **1. Conditional regex processing**: Instead of always running `BASE64_DATA_TYPE_PATTERN.sub()` on every input string, the code first checks if the pattern matches using `BASE64_DATA_TYPE_PATTERN.match()`. If a match is found, it uses string slicing to remove the prefix. This avoids expensive regex substitution for the majority of base64 strings that don't have data URI headers. **2. Direct function call**: Eliminates the intermediate `image_np` variable by passing `np.frombuffer()` directly to `cv2.imdecode()`, reducing memory allocation overhead. **3. Optimized empty check**: Replaces `len(value) == 0` with `if not value:`, which is more efficient in Python as it doesn't need to compute the length. The test results show this optimization is particularly effective for: - **Large images**: Up to 240% faster for 500x500 images and 222% faster for 300x300 images with prefixes - **JPEG images**: 15-16% improvement due to reduced preprocessing overhead - **Batch processing**: 7-8% faster when processing many small images in sequence - **Images with data URI prefixes**: The conditional regex check provides significant gains when prefixes are present The optimizations maintain identical functionality and error handling while providing consistent performance improvements across all test cases.

codeflash-ai bot requested a review from mashraf-222 October 29, 2025 18:33

codeflash-ai bot added ⚡️ codeflash Optimization PR opened by Codeflash AI 🎯 Quality: High Optimization Quality according to Codeflash labels Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⚡️ Speed up function `load_image_base64` by 59% #646

⚡️ Speed up function `load_image_base64` by 59% #646

Uh oh!

codeflash-ai bot commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up function load_image_base64 by 59% #646

Are you sure you want to change the base?

⚡️ Speed up function load_image_base64 by 59% #646

Uh oh!

Conversation

codeflash-ai bot commented Oct 29, 2025

📄 59% (0.59x) speedup for load_image_base64 in inference/core/utils/image_utils.py

📝 Explanation and details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up function `load_image_base64` by 59% #646

⚡️ Speed up function `load_image_base64` by 59% #646

📄 59% (0.59x) speedup for `load_image_base64` in `inference/core/utils/image_utils.py`