⚡️ Speed up function `cartesian_to_axial` by 23% #81

codeflash-ai · 2025-10-29T05:30:38Z

📄 23% (0.23x) speedup for `cartesian_to_axial` in `src/bokeh/util/hex.py`

⏱️ Runtime : 1.68 milliseconds → 1.37 milliseconds (best of 116 runs)

📝 Explanation and details

The optimized code achieves a 22% speedup through several key performance improvements:

1. Eliminated Redundant Math Calculations

Precomputes math.sqrt(3.0) once and reuses it in both HEX_FLAT and HEX_POINTY arrays, avoiding duplicate square root calculations on every function call
Line profiler shows the constant creation overhead reduced from 139ms to 128ms total

2. Avoided Unnecessary Arithmetic Operations

Guards against aspect_scale=1 cases to skip multiplication/division when the scale factor has no effect
When aspect_scale=1, uses simpler x/size instead of x/size * 1 or y/size / 1
This optimization is particularly effective for the common case where aspect scaling isn't needed

3. Improved NumPy Rounding Performance

Replaced np.round() with np.rint(), which is often faster for large arrays
Line profiler shows rounding operations reduced from 883ms to 241ms total - a 73% improvement

4. Optimized Memory Usage

Uses np.asarray(..., dtype=int) instead of .astype(int), which avoids unnecessary copying when the array is already the correct dtype
Reduces memory allocation overhead in the return statement

Test Results Analysis:
The optimizations show consistent performance gains across all test scenarios:

Scalar inputs: 40-45% faster (most benefit from reduced overhead)
Small arrays: 15-25% faster
Large arrays (1000+ elements): 14-17% faster (np.rint improvement shines here)
aspect_scale != 1 cases: 7-12% faster (conditional logic reduces unnecessary operations)

The performance gains are most pronounced for single-point calculations and cases with default aspect scaling, making this optimization particularly valuable for common usage patterns.

✅ Correctness verification report:

Test	Status
⚙️ Existing Unit Tests	✅ 6 Passed
🌀 Generated Regression Tests	✅ 53 Passed
⏪ Replay Tests	🔘 None Found
🔎 Concolic Coverage Tests	🔘 None Found
📊 Tests Coverage	100.0%

⚙️ Existing Unit Tests and Runtime

Test File::Test Function	Original ⏱️	Optimized ⏱️	Speedup
`unit/bokeh/util/test_hex.py::Test_cartesian_to_axial.test_default_aspect_flattop`	44.7μs	38.9μs	14.9%✅
`unit/bokeh/util/test_hex.py::Test_cartesian_to_axial.test_default_aspect_pointytop`	36.5μs	32.0μs	13.9%✅

🌀 Generated Regression Tests and Runtime

from __future__ import annotations

import math
from typing import Any

import numpy as np
# imports
import pytest  # used for our unit tests
from bokeh.util.hex import cartesian_to_axial

# unit tests

# ------------------------
# Basic Test Cases
# ------------------------

def test_single_point_flattop_origin():
    # Test origin for flattop orientation, size=1
    q, r = cartesian_to_axial(0, 0, 1, 'flattop') # 30.1μs -> 21.3μs (41.3% faster)

def test_single_point_pointytop_origin():
    # Test origin for pointytop orientation, size=1
    q, r = cartesian_to_axial(0, 0, 1, 'pointytop') # 25.3μs -> 17.8μs (42.3% faster)

def test_single_point_flattop_positive():
    # Test a simple positive coordinate for flattop
    q, r = cartesian_to_axial(1, 0, 1, 'flattop') # 24.3μs -> 16.8μs (44.8% faster)

def test_single_point_pointytop_positive():
    # Test a simple positive coordinate for pointytop
    q, r = cartesian_to_axial(0, 1, 1, 'pointytop') # 24.7μs -> 17.2μs (43.3% faster)

def test_array_input_flattop():
    # Test array input for flattop orientation
    x = np.array([0, 1, 2])
    y = np.array([0, 1, 2])
    q, r = cartesian_to_axial(x, y, 1, 'flattop') # 37.1μs -> 31.0μs (19.6% faster)

def test_array_input_pointytop():
    # Test array input for pointytop orientation
    x = np.array([0, 1, 2])
    y = np.array([0, 1, 2])
    q, r = cartesian_to_axial(x, y, 1, 'pointytop') # 31.3μs -> 26.2μs (19.7% faster)

def test_aspect_scale_pointytop():
    # Test aspect_scale for pointytop
    x = np.array([0, 1])
    y = np.array([0, 1])
    q1, r1 = cartesian_to_axial(x, y, 1, 'pointytop', aspect_scale=1) # 31.4μs -> 26.1μs (20.4% faster)
    q2, r2 = cartesian_to_axial(x, y, 1, 'pointytop', aspect_scale=2) # 18.0μs -> 16.3μs (10.5% faster)

def test_aspect_scale_flattop():
    # Test aspect_scale for flattop
    x = np.array([0, 1])
    y = np.array([0, 1])
    q1, r1 = cartesian_to_axial(x, y, 1, 'flattop', aspect_scale=1) # 28.8μs -> 24.4μs (17.8% faster)
    q2, r2 = cartesian_to_axial(x, y, 1, 'flattop', aspect_scale=2) # 17.6μs -> 15.6μs (12.7% faster)

# ------------------------
# Edge Test Cases
# ------------------------

def test_negative_coordinates():
    # Test negative coordinates
    x = np.array([-1, -2, -3])
    y = np.array([-1, -2, -3])
    q, r = cartesian_to_axial(x, y, 1, 'flattop') # 28.0μs -> 24.7μs (13.5% faster)

def test_large_size_parameter():
    # Test with a very large hex size
    x = np.array([1000, 2000])
    y = np.array([1000, 2000])
    q, r = cartesian_to_axial(x, y, 1000, 'pointytop') # 28.6μs -> 24.1μs (18.5% faster)

def test_small_size_parameter():
    # Test with a very small hex size
    x = np.array([0.001, 0.002])
    y = np.array([0.001, 0.002])
    q, r = cartesian_to_axial(x, y, 0.001, 'flattop') # 28.8μs -> 23.2μs (24.4% faster)


def test_invalid_orientation():
    # Test invalid orientation string
    x = np.array([1])
    y = np.array([1])
    # Should default to pointytop if not flattop, but test explicit bad input
    # Should not raise, but should behave as pointytop
    q1, r1 = cartesian_to_axial(x, y, 1, 'not_a_valid_orientation') # 46.0μs -> 39.6μs (16.3% faster)
    q2, r2 = cartesian_to_axial(x, y, 1, 'pointytop') # 18.2μs -> 15.3μs (18.9% faster)

def test_non_array_inputs():
    # Test with Python scalars instead of arrays
    q, r = cartesian_to_axial(1, 1, 1, 'flattop') # 29.8μs -> 20.8μs (43.2% faster)

def test_mismatched_array_lengths():
    # Test with mismatched array lengths for x and y
    x = np.array([0, 1, 2])
    y = np.array([0, 1])
    with pytest.raises(ValueError):
        cartesian_to_axial(x, y, 1, 'flattop') # 15.7μs -> 14.5μs (8.50% faster)



def test_aspect_scale_zero():
    # Test aspect_scale=0 (should collapse all to zero or raise)
    x = np.array([1, 2])
    y = np.array([1, 2])
    q, r = cartesian_to_axial(x, y, 1, 'pointytop', aspect_scale=0) # 46.3μs -> 41.2μs (12.5% faster)

def test_aspect_scale_negative():
    # Test aspect_scale negative
    x = np.array([1, 2])
    y = np.array([1, 2])
    q, r = cartesian_to_axial(x, y, 1, 'pointytop', aspect_scale=-1) # 33.6μs -> 28.8μs (16.9% faster)

# ------------------------
# Large Scale Test Cases
# ------------------------

def test_large_array_flattop():
    # Test with a large array of coordinates for flattop
    N = 1000
    x = np.linspace(-100, 100, N)
    y = np.linspace(-100, 100, N)
    q, r = cartesian_to_axial(x, y, 10, 'flattop') # 45.6μs -> 39.3μs (16.1% faster)

def test_large_array_pointytop():
    # Test with a large array of coordinates for pointytop
    N = 1000
    x = np.linspace(-100, 100, N)
    y = np.linspace(-100, 100, N)
    q, r = cartesian_to_axial(x, y, 10, 'pointytop') # 42.5μs -> 36.6μs (16.0% faster)

def test_large_array_mixed_values():
    # Test with a large array of mixed values
    N = 1000
    x = np.random.uniform(-100, 100, N)
    y = np.random.uniform(-100, 100, N)
    q, r = cartesian_to_axial(x, y, 10, 'flattop') # 48.6μs -> 43.0μs (12.8% faster)

def test_large_array_aspect_scale():
    # Test with a large array and non-default aspect_scale
    N = 1000
    x = np.random.uniform(-100, 100, N)
    y = np.random.uniform(-100, 100, N)
    q1, r1 = cartesian_to_axial(x, y, 10, 'pointytop', aspect_scale=1) # 48.4μs -> 42.3μs (14.4% faster)
    q2, r2 = cartesian_to_axial(x, y, 10, 'pointytop', aspect_scale=0.5) # 34.5μs -> 32.1μs (7.43% faster)

def test_large_array_extreme_values():
    # Test with a large array of extreme values
    N = 1000
    x = np.linspace(-1e6, 1e6, N)
    y = np.linspace(-1e6, 1e6, N)
    q, r = cartesian_to_axial(x, y, 1e3, 'flattop') # 43.5μs -> 37.0μs (17.5% faster)
# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.
#------------------------------------------------
from __future__ import annotations

import math
from typing import Any

import numpy as np  # used for array manipulation
# imports
import pytest  # used for our unit tests
from bokeh.util.hex import cartesian_to_axial

# unit tests

# ---------------------------
# BASIC TEST CASES
# ---------------------------

def test_single_origin_pointytop():
    # Origin should map to (0, 0) in pointytop orientation
    q, r = cartesian_to_axial(0, 0, 1, 'pointytop') # 29.0μs -> 21.0μs (38.2% faster)

def test_single_origin_flattop():
    # Origin should map to (0, 0) in flattop orientation
    q, r = cartesian_to_axial(0, 0, 1, 'flattop') # 25.3μs -> 17.5μs (44.7% faster)

def test_simple_point_pointytop():
    # (size, 0) should map to (1, 0) in pointytop orientation
    q, r = cartesian_to_axial(1, 0, 1, 'pointytop') # 24.5μs -> 17.3μs (42.1% faster)

def test_simple_point_flattop():
    # (0, -size) should map to (0, 1) in flattop orientation
    q, r = cartesian_to_axial(0, -1, 1, 'flattop') # 24.1μs -> 16.6μs (45.2% faster)

def test_array_input_pointytop():
    # Array input for pointytop orientation
    xs = np.array([0, 1, 2])
    ys = np.array([0, 0, 0])
    q, r = cartesian_to_axial(xs, ys, 1, 'pointytop') # 35.8μs -> 31.0μs (15.6% faster)

def test_array_input_flattop():
    # Array input for flattop orientation
    xs = np.array([0, 0, 0])
    ys = np.array([0, -1, -2])
    q, r = cartesian_to_axial(xs, ys, 1, 'flattop') # 31.2μs -> 25.9μs (20.6% faster)

def test_non_integer_size_pointytop():
    # Non-integer size should still work
    q, r = cartesian_to_axial(2.5, 0, 0.5, 'pointytop') # 27.5μs -> 19.1μs (43.8% faster)

def test_non_integer_size_flattop():
    # Non-integer size should still work
    q, r = cartesian_to_axial(0, -2.5, 0.5, 'flattop') # 25.1μs -> 17.5μs (43.3% faster)

# ---------------------------
# EDGE TEST CASES
# ---------------------------

def test_negative_coordinates_pointytop():
    # Negative coordinates should be handled correctly
    q, r = cartesian_to_axial(-1, -1, 1, 'pointytop') # 24.2μs -> 17.1μs (41.8% faster)

def test_negative_coordinates_flattop():
    # Negative coordinates should be handled correctly
    q, r = cartesian_to_axial(-1, -1, 1, 'flattop') # 22.2μs -> 17.1μs (29.8% faster)

def test_large_coordinates_pointytop():
    # Large coordinates should not overflow
    q, r = cartesian_to_axial(1e6, 1e6, 1, 'pointytop') # 24.0μs -> 17.0μs (40.8% faster)

def test_large_coordinates_flattop():
    # Large coordinates should not overflow
    q, r = cartesian_to_axial(1e6, 1e6, 1, 'flattop') # 23.4μs -> 17.0μs (37.4% faster)

def test_small_size_pointytop():
    # Very small size should not cause division errors
    q, r = cartesian_to_axial(1, 1, 1e-6, 'pointytop') # 23.8μs -> 16.8μs (41.9% faster)

def test_small_size_flattop():
    # Very small size should not cause division errors
    q, r = cartesian_to_axial(1, 1, 1e-6, 'flattop') # 24.0μs -> 16.6μs (44.2% faster)

def test_zero_size_pointytop():
    # Zero size should raise ZeroDivisionError
    with pytest.raises(ZeroDivisionError):
        cartesian_to_axial(1, 1, 0, 'pointytop') # 1.59μs -> 1.45μs (9.44% faster)

def test_zero_size_flattop():
    # Zero size should raise ZeroDivisionError
    with pytest.raises(ZeroDivisionError):
        cartesian_to_axial(1, 1, 0, 'flattop') # 1.58μs -> 1.43μs (10.3% faster)

def test_invalid_orientation():
    # Invalid orientation should default to pointytop
    q, r = cartesian_to_axial(1, 0, 1, 'invalid') # 34.4μs -> 25.0μs (37.8% faster)

def test_aspect_scale_pointytop():
    # Aspect scale should affect pointytop orientation horizontally
    q1, r1 = cartesian_to_axial(1, 0, 1, 'pointytop', aspect_scale=1) # 26.2μs -> 18.3μs (43.0% faster)
    q2, r2 = cartesian_to_axial(1, 0, 1, 'pointytop', aspect_scale=2) # 11.9μs -> 9.09μs (30.7% faster)

def test_aspect_scale_flattop():
    # Aspect scale should affect flattop orientation vertically
    q1, r1 = cartesian_to_axial(0, -1, 1, 'flattop', aspect_scale=1) # 23.5μs -> 17.0μs (38.2% faster)
    q2, r2 = cartesian_to_axial(0, -1, 1, 'flattop', aspect_scale=2) # 12.1μs -> 8.80μs (37.6% faster)

def test_array_mixed_types():
    # Mixed types in arrays should work if convertible to float
    xs = np.array([0, 1, 2.0])
    ys = np.array([0, 0, 0])
    q, r = cartesian_to_axial(xs, ys, 1, 'pointytop') # 36.6μs -> 31.1μs (17.6% faster)



def test_empty_array_input():
    # Empty arrays should return empty arrays
    xs = np.array([])
    ys = np.array([])
    q, r = cartesian_to_axial(xs, ys, 1, 'pointytop') # 40.1μs -> 34.8μs (15.4% faster)

# ---------------------------
# LARGE SCALE TEST CASES
# ---------------------------

def test_large_array_input_pointytop():
    # Large array input for pointytop orientation
    xs = np.arange(1000)
    ys = np.zeros(1000)
    q, r = cartesian_to_axial(xs, ys, 1, 'pointytop') # 50.9μs -> 43.4μs (17.3% faster)

def test_large_array_input_flattop():
    # Large array input for flattop orientation
    xs = np.zeros(1000)
    ys = -np.arange(1000)
    q, r = cartesian_to_axial(xs, ys, 1, 'flattop') # 44.8μs -> 38.5μs (16.4% faster)

def test_large_random_array_pointytop():
    # Large random array input for pointytop orientation
    rng = np.random.default_rng(123)
    xs = rng.uniform(-1000, 1000, 1000)
    ys = rng.uniform(-1000, 1000, 1000)
    q, r = cartesian_to_axial(xs, ys, 1, 'pointytop') # 49.1μs -> 42.9μs (14.5% faster)

def test_large_random_array_flattop():
    # Large random array input for flattop orientation
    rng = np.random.default_rng(456)
    xs = rng.uniform(-1000, 1000, 1000)
    ys = rng.uniform(-1000, 1000, 1000)
    q, r = cartesian_to_axial(xs, ys, 1, 'flattop') # 48.7μs -> 42.3μs (15.2% faster)

def test_large_array_non_integer_size():
    # Large array input with non-integer size
    xs = np.arange(1000)
    ys = np.arange(1000)
    q, r = cartesian_to_axial(xs, ys, 0.5, 'pointytop') # 46.8μs -> 40.1μs (16.8% faster)
# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.
#------------------------------------------------
from bokeh.util.hex import cartesian_to_axial

def test_cartesian_to_axial():
    cartesian_to_axial(0, 0, float('nan'), 'flattop', aspect_scale=0.5)

def test_cartesian_to_axial_2():
    cartesian_to_axial(0, 1, float('nan'), 'pointytop', aspect_scale=0.5)

To edit these changes git checkout codeflash/optimize-cartesian_to_axial-mhbk4cgn and push.

The optimized code achieves a **22% speedup** through several key performance improvements: **1. Eliminated Redundant Math Calculations** - Precomputes `math.sqrt(3.0)` once and reuses it in both `HEX_FLAT` and `HEX_POINTY` arrays, avoiding duplicate square root calculations on every function call - Line profiler shows the constant creation overhead reduced from 139ms to 128ms total **2. Avoided Unnecessary Arithmetic Operations** - Guards against `aspect_scale=1` cases to skip multiplication/division when the scale factor has no effect - When `aspect_scale=1`, uses simpler `x/size` instead of `x/size * 1` or `y/size / 1` - This optimization is particularly effective for the common case where aspect scaling isn't needed **3. Improved NumPy Rounding Performance** - Replaced `np.round()` with `np.rint()`, which is often faster for large arrays - Line profiler shows rounding operations reduced from 883ms to 241ms total - a 73% improvement **4. Optimized Memory Usage** - Uses `np.asarray(..., dtype=int)` instead of `.astype(int)`, which avoids unnecessary copying when the array is already the correct dtype - Reduces memory allocation overhead in the return statement **Test Results Analysis:** The optimizations show consistent performance gains across all test scenarios: - **Scalar inputs**: 40-45% faster (most benefit from reduced overhead) - **Small arrays**: 15-25% faster - **Large arrays (1000+ elements)**: 14-17% faster (np.rint improvement shines here) - **aspect_scale != 1 cases**: 7-12% faster (conditional logic reduces unnecessary operations) The performance gains are most pronounced for single-point calculations and cases with default aspect scaling, making this optimization particularly valuable for common usage patterns.

codeflash-ai bot requested a review from mashraf-222 October 29, 2025 05:30

codeflash-ai bot added ⚡️ codeflash Optimization PR opened by Codeflash AI 🎯 Quality: High Optimization Quality according to Codeflash labels Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⚡️ Speed up function `cartesian_to_axial` by 23% #81

⚡️ Speed up function `cartesian_to_axial` by 23% #81

Uh oh!

codeflash-ai bot commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up function cartesian_to_axial by 23% #81

Are you sure you want to change the base?

⚡️ Speed up function cartesian_to_axial by 23% #81

Uh oh!

Conversation

codeflash-ai bot commented Oct 29, 2025

📄 23% (0.23x) speedup for cartesian_to_axial in src/bokeh/util/hex.py

📝 Explanation and details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up function `cartesian_to_axial` by 23% #81

⚡️ Speed up function `cartesian_to_axial` by 23% #81

📄 23% (0.23x) speedup for `cartesian_to_axial` in `src/bokeh/util/hex.py`