In this section, we explore the historical development of the tree data structure, with a particular focus on three foundational types: the binary search tree (BST), the 2-3 tree, and the red-black tree.

Origin of Binary Search Tree:

Visualization of Binary Search Tree (GeekforGeeks,2025).

The binary search tree (BST) is one of the earliest and most influential data structures in computer science. The core idea behind a binary search tree is to store data in a hierarchical structure where each node has at most two children, such that the left subtree contains values less than the parent, and the right subtree contains values greater than the parent. This ordering enables efficient lookup, insertion, and deletion operations, typically in O(log n) time under ideal conditions.

The binary search concept itself can be traced to the mid-20th century, as computers began to require more sophisticated methods for handling large datasets. The rise of electronic computing machines in the 1940s and 1950s pushed researchers to find data structures that could organize and search through memory efficiently. Early computers had very limited memory and extremely slow access speeds compared to modern machines. As a result, designing algorithms that minimized the number of accesses was a critical problem.

D.H. Lehmer, a pioneer in computational number theory, developed early ideas around decision trees and sorting networks, setting the stage for structured approaches to data access. However, it was in the 1950s and 60s, amid the growing field of algorithm analysis, that formal tree-based search methods took shape.

The binary search tree structure was developed by Conway Berners-Lee and David Wheeler in the 1960s for the problem of efficient storage of labeled data. The BST became a practical tool due to its simplicity and efficiency, particularly when average-case performance mattered more than worst-case guarantees. As researchers realized that BSTs could become highly unbalanced, efforts began to develop balanced variants like the AVL tree which was developed by Adelson-Velsky and Landis in 1962, ensuring that the tree’s height remained logarithmic by enforcing strict balancing conditions during insertions and deletions.

Conway Berners-Lee (1921- 2019) (Wikipedia).

David Wheeler (1927- 2004) (Wikipedia).

Orgin of 2-3 Tree:

Visualization of 2-3 Tree (Joshi, 2017).

The 2-3 Tree was introduced in the early 1970s by John Hopcroft, a pioneering American computer scientist, during a period when algorithmic efficiency and formal data structure design were rapidly advancing. Hopcroft developed the 2-3 Tree as a response to the growing need for balanced search trees that could guarantee consistently fast performance for insertion, deletion, and search operations, even in the worst case.

John Hopcroft - Ramo Founders Award IBM Professor of Engineering and Applied Mathematics in Computer Science Cornell University(National Academy of Engineering, 2017).

At the time, binary search trees—though conceptually elegant—were known to degrade into linear-time structures when given sorted or adversarial input. This made them unreliable for applications requiring guaranteed performance. The 2-3 Tree addressed this by maintaining a strict balance property: every internal node could have either two or three children, and all leaves existed at the same depth. This guaranteed that the tree’s height would grow logarithmically with the number of elements, ensuring O(log n) time complexity for all fundamental operations.

Hopcroft’s innovation was part of a broader trend in the early ’70s, where researchers focused on creating self-balancing tree structures. The 2-3 Tree became a conceptual foundation for later trees like Red-Black Trees and B-Trees, which expanded these ideas for use in memory hierarchies and database indexing.

Orgin of Red-Black Tree:

Visualization of Red and Black Tree (GeekforGeeks,2024).

Red-Black Trees were first introduced in 1972 by Rudolf Bayer, a German computer scientist, under the name symmetric binary B-trees (Bayer, 1972). His goal was to develop a data structure that could maintain balance in a binary search tree while avoiding the complexity of multi-way trees like 2-3 Trees. Bayer’s innovation introduced a way to encode balance information using node colors—red and black—allowing the tree to remain approximately balanced, and guaranteeing O(log n) performance for fundamental operations such as insertion, deletion, and search.

Rudolf Bayer - Professor emeritus of Informatics at the Technical University of Munich (Scale by the Bay, 2020).

The modern form and name “Red-Black Tree” were popularized in 1978 by Leonidas J. Guibas and Robert Sedgewick, who reformulated Bayer’s ideas into a more practical structure suitable for implementation in real-world systems (Guibas & Sedgewick, 1978). Their approach showed how Red-Black Trees could simulate 2-3 Trees using colored binary nodes, making them easier to implement while retaining balanced-tree performance.

Thanks to this balance between theoretical rigor and implementation simplicity, Red-Black Trees became widely adopted in programming libraries and systems software. They are now the standard underlying structure in Java’s TreeMap, C++, and various operating system kernels and memory management systems (Cormen, 2009).

Bayer’s foundational design, together with the practical enhancements by Guibas and Sedgewick, cemented Red-Black Trees as one of the most influential self-balancing search trees in computer science.

Why red black? The laser printers at Xerox PARC had the highest contrast with red and black ink. Left-leaning red black trees were created in 2008 due to frustrations with improper implementation of Red-Black Trees, so much that there have been lawsuits due to their improper implementation (Sedgewick, 2008).