TY - JOUR
T1 - Uncovering the Euclidean Geometry of Data
AU - Giansiracusa, Noah
PY - 2025
Y1 - 2025
N2 - Suppose you have a collection of objects in some exotic metric space and you’d like to see what they would look like if they were instead in Euclidean space. Or suppose these objects aren’t even points in a metric space, they are just entities for which you have some rough, intuitive notion of distance between each pair of them—one that need not satisfy any official mathematical properties like the triangle inequality. There is a linear algebraic optimization procedure called multidimensional scaling that embeds these objects in Euclidean space in a manner that approximates their original distances. This uncovers the Euclidean geometry hidden in data. This article explores how it works and what it’s useful for, particularly in a data science context.
AB - Suppose you have a collection of objects in some exotic metric space and you’d like to see what they would look like if they were instead in Euclidean space. Or suppose these objects aren’t even points in a metric space, they are just entities for which you have some rough, intuitive notion of distance between each pair of them—one that need not satisfy any official mathematical properties like the triangle inequality. There is a linear algebraic optimization procedure called multidimensional scaling that embeds these objects in Euclidean space in a manner that approximates their original distances. This uncovers the Euclidean geometry hidden in data. This article explores how it works and what it’s useful for, particularly in a data science context.
UR - https://dx.doi.org/10.1080/00029890.2024.2409614
U2 - 10.1080/00029890.2024.2409614
DO - 10.1080/00029890.2024.2409614
M3 - Article
VL - 132
JO - American Mathematical Monthly
JF - American Mathematical Monthly
IS - Issue 1
ER -