2 Efficient Search

Now that we have started considering ordered sets, we can introduce what is arguably the most beautiful algorithm in the history of Computer Science: binary search. A quintessential algorithm that shows how a well-structured search space is exponentially easier to search than an arbitrary one.

To build some intuition for binary search, let’s consider we have an ordered sequence of items; that is, we always have that if i < j, then l[i] <= l[j]. This simple constraint introduces a very powerful condition in our search problem: if we are looking for x, and x < y, then we know no item after y in the sequence can be x.

Convince yourself of this simple truth before moving on.

This fundamentally changes how fast we can search. Why? Because now every test that we perform–every time we ask whether x < y for some y–we gain a lot of information, not only about y, but about every other item greater or equal than y.

This is the magical leap we always need in order to write a fast algorithm–fast as in, it doesn’t need to check every single thing. We need a way to gather more information from every operation, so we have to do less operations. Let’s see how we can leverage this powerful intuition to make search not only faster, but exponentially faster when items are ordered.

Consider the set of items \(x_1, ..., y, ..., x_n\). We are searching for item \(x\), and we choose to test whether \(x \leq y\). We have two choices, either \(x \leq y\), or, on the contrary, \(x > y\). We want to gain the maximum amount of information in either case. The question is, how should we pick \(y\)?

If we pick \(y\) too close to either end, we can get lucky and cross off a large number of items. For example if \(y\) is in the last 5% of the sequence, and it turns out \(x > y\), we have just removed the first 95% of the sequence without looking at it! But of course, we won’t get that lucky too often. In fact, if \(x\) is a random input, it could potentially be anywhere in the sequence. Under the fairly mild assumption that \(x\) should be uniformly distributed among all indices in the sequences, we will get this lucky exactly 5% of the time. The other 95! we have almost as much work to do as in the beginning.

It should be obvious by now that the best way to pick \(y\) either case is to choose the middle of the sequence. In that way I always cross off 50% of the items, regardless of luck. This is good, we just removed a huge chunk. But it gets even better.

Now, instead of looking for \(x\) linearly in the remaining 50% of the items, we do the exact same thing again! We take the middle point of the current half, and now we can cross off another 25% of the items. If we keep repeating this over and over, how fast will we be left with just one item? Keep that thought in mind.

Binary Search

Before doing the math, here is the most straightforward implemenation of binary search. We will use two indices, l(eft) and r(ight) to keep track of the current sub-sequence we are analyzing. As long as l <= r there is at least one item left to test. Once l > r, we must conclude x is not in the sequence.

Here goes the code.

from typing import Sequence
from codex.types import Ordering, default_order

def binary_search[T](
    x: T, items: Sequence[T], f: Ordering[T] = None
) -> int | None:
    if f is None:
        f = default_order

    l, r = 0, len(items)-1

    while l <= r:
        m = (l + r) // 2
        res = f(x, items[m])

        if res == 0:
            return m
        elif res < 0:
            r = m - 1
        else:
            l = m + 1

Here is a minimal test.

from codex.search.binary import binary_search

def test_binary_search():
    items = [0,1,2,3,4,5,6,7,8,9]

    assert binary_search(3, items) == 3
    assert binary_search(10, items) is None

Bisection

Standard binary search is excellent for determining if an element exists, but it provides no guarantees about which index is returned if the sequence contains duplicates. In many applications—such as range queries or maintaining a sorted list—we need to find the specific boundaries where an element resides or where it should be inserted to maintain order.

This is the problem of bisection. We define two variants: bisect_left and bisect_right.

The bisect_left function finds the first index where an element x could be inserted while maintaining the sorted order of the sequence. If x is already present, the insertion point will be before (to the left of) any existing entries. Effectively, it returns the index of the first element that is not “less than” x.

def bisect_left[T](
    x: T, items: Sequence[T], f: Ordering[T] = None
) -> int:
    if f is None:
        f = default_order

    l, r = 0, len(items)

    while l < r:
        m = (l + r) // 2
        if f(items[m], x) < 0:
            l = m + 1
        else:
            r = m

    return l

The logic here is subtle: instead of returning immediately when an element matches, we keep narrowing the window until l and r meet. By setting r = m when items[m] >= x, we ensure the right boundary eventually settles on the first occurrence.

Conversely, bisect_right (sometimes called bisect_upper) finds the last possible insertion point. If x is present, the index returned will be after (to the right of) all existing entries. This is useful for finding the index of the first element that is strictly “greater than” x.

def bisect_right[T](
    x: T, items: Sequence[T], f: Ordering[T] = None
) -> int:
    if f is None:
        f = default_order

    l, r = 0, len(items)

    while l < r:
        m = (l + r) // 2
        if f(x, items[m]) < 0:
            r = m
        else:
            l = m + 1

    return l

In this variant, we only move the left boundary l forward if x >= items[m], which pushes the search toward the end of a block of identical values.

Since both functions follow the same halving principle as standard binary search, their performance characteristics are identical:

Time Complexity: \(O(\log n)\), as we halve the search space in every iteration of the while loop.
Space Complexity: \(O(1)\), as we only maintain two integer indices regardless of the input size.

To ensure these boundaries are calculated correctly, especially with duplicate elements, we use the following test cases:

from codex.search.binary import bisect_left, bisect_right

def test_bisection_boundaries():
    # Sequence with a "block" of 2s
    items = [1, 2, 2, 2, 3]

    # First index where 2 is (or could be)
    assert bisect_left(2, items) == 1

    # Index after the last 2
    assert bisect_right(2, items) == 4

    # If element is missing, both return the same insertion point
    assert bisect_left(1.5, items) == 1
    assert bisect_right(1.5, items) == 1

def test_bisection_extremes():
    items = [1, 2, 3]
    assert bisect_left(0, items) == 0
    assert bisect_right(4, items) == 3

Binary Search on Predicates

The true power of binary search extends far beyond finding a number in a list. We can generalize the algorithm to find the “boundary” of any monotonic predicate.

A predicate \(p\) is monotonic if, once it becomes true for some index \(i\), it remains true for all \(j>i\). We can use binary search to find the smallest index \(i\) such that \(p(i)\) is true. This is often called binary searching on the answer. Instead of searching through a physical collection of items, we are searching through an abstract decision space.

from typing import Callable

def find_first(
    low: int, high: int, p: Callable[[int], bool]
) -> int | None:
    """
    Finds the first index in [low, high] for which p(index) is True.
    Assumes p is monotonic: if p(i) is True, p(i+1) is also True.
    """
    ans = None
    l, r = low, high

    while l <= r:
        m = (l + r) // 2
        if p(m):
            ans = m
            r = m - 1
        else:
            l = m + 1

    return ans

Consider the problem of finding the integer square root of a very large number —that is, the largest integer such that . While we could use math.sqrt, binary search allows us to find this value using only integer arithmetic, which is vital in fields like cryptography or when dealing with arbitrary-precision integers.

Our predicate \(p\) is: “Is \(x^2 > n\)?” This is monotonic: if \(x^2 > n\), then \((x+1)^2\) is certainly greater than \(n\). By finding the first \(x\) where \(x^x > n\), we know that \(x-1\) is our desired integer square root.

def integer_sqrt(n: int) -> int:
    if n < 0:
        raise ValueError("Square root not defined for negative numbers")
    if n < 2:
        return n

    # Find the first x such that x*x > n
    first_too_big = find_first(1, n, lambda x: x * x > n)

    return first_too_big - 1

This approach reveals a deep connection between searching and optimization. Many problems that ask for a “minimum possible \(x\) such that \(p(x)\) is possible” can be solved by binary searching over the value of \(x\), provided that the possibility \(p\) is monotonic relative to \(x\).

Whenever you encounter a problem where a “yes” answer for a value \(x\) implies a “yes” for all values larger than \(x\), you are no longer looking for an item—you are looking for a threshold. Binary search is the most efficient way to discover it.

Verification

We can verify this generalized search and its application to the integer square root problem with the following tests.

from codex.search.binary import find_first, integer_sqrt

def test_find_first():
    # Predicate: is the number >= 7?
    nums = [1, 3, 5, 7, 9, 11]
    # find_first returns the index
    idx = find_first(0, len(nums) - 1, lambda i: nums[i] >= 7)
    assert idx == 3
    assert nums[idx] == 7

def test_integer_sqrt():
    assert integer_sqrt(16) == 4
    assert integer_sqrt(15) == 3
    assert integer_sqrt(17) == 4
    assert integer_sqrt(0) == 0
    assert integer_sqrt(1) == 1
    assert integer_sqrt(10**20) == 10**10

Conclusion

Searching is arguably the most important problem in Computer Science. In this first chapter, we have only scratched the surface of this vast field, but in doing so, we have discovered one of the fundamental truths of computation: structure matters–a lot.

When we know nothing about the structure of our problem or the collection of items we are searching through, we have no choice but to rely on exhaustive methods like linear search. In these cases, we must check every single item to determine if it is the one we care about.

However, as soon as we introduce some structure–specifically, some order–the landscape changes completely. Binary search allows us to exploit this structure to find an element as fast as is theoretically possible, reducing our workload from a linear progression to a logarithmic one.

This realization that we can trade a bit of organizational effort for a massive gain in search efficiency is the perfect segue for our next chapter. If searching is easier when items are ordered, then we must understand the process of establishing that order. We must talk about sorting.