Spenrath, Y., Hassani, M., & Van Dongen, B. F. (2022). BitBooster: Effective Approximation of Distance Metrics via Binary Operations. In H. Va Leong, S. S. Sarvestani, Y. Teranishi, A. Cuzzocrea, H. Kashiwazaki, D. Towey, J-J. Yang, & H. Shahriar (Eds.), Proceedings – 2022 IEEE 46th Annual Computers, Software, and Applications Conference, COMPSAC 2022 (pp. 201-210). Institute of Electrical and Electronics Engineers. https://doi.org/10.1109/COMPSAC54236.2022.00036
The Euclidean distance is one of the most commonly used distance metrics. Several approximations have been pro-posed in the literature to reduce the complexity of this metric for high-dimensional or large datasets. In this paper, we propose BitBooster, an approximation to the Euclidean distance that can be efficiently computed using binary operations and which can also be applied to the Manhattan distance. The introduced approximation error is shown to be negligible when BitBooster is used for both convex- and density-based clustering. While obtaining clusters of almost the same quality as those obtained with the exact computation, we require only a fraction of the computation time. We demonstrate the superiority of our method to alternative approximations on 960 synthetic and 13 real-world datasets of varying sizes, dimensions and clusters.