Nyström Ncut (Quality)
While the Nyström method is often employed for its computational efficiency, it provides a distinct advantage in segmentation quality by mitigating class imbalance, a common pathology in graph-based spectral clustering.
The Class Imbalance Problem
Standard Normalized Cut (NCut) optimization is sensitive to the density distribution of the data. In typical datasets, such as natural images, class sizes are highly imbalanced (e.g., a large background region versus a small foreground object).
- Density Bias: Since standard graph construction relies on all data points, regions with high point density (majority classes) dominate the graph volume.
- Eigenvector Distortion: The leading eigenvectors of the graph Laplacian tend to minimize the cut cost by isolating these massive, dense regions, often at the expense of smaller, distinct structures.
- Result: Minority classes are frequently merged into dominant clusters or fragmented, as the optimization objective "overlooks" them in favor of the majority signal.
FPS as a Density Equalizer
The use of Farthest Point Sampling (FPS) to select landmarks for the Nyström approximation acts as a crucial regularization step.
Unlike random sampling, which preserves the underlying probability density function of the data (retaining the imbalance), FPS enforces spatial uniformity in the feature space:
- Downsampling Majority Classes: In dense regions (large classes), FPS skips redundant points to satisfy the distance criterion.
- Preserving Minority Classes: In sparse but distinct regions (small classes), FPS retains a higher proportion of points relative to the region's size.
Impact on Segmentation
By constructing the Nyström approximation on this spatially uniform subset, we effectively compute the spectral embedding on a rebalanced representation of the dataset.
The resulting eigenvectors capture the geometry of the feature space rather than the density of the sampling. When these eigenvectors are interpolated back to the full dataset, they provide segmentation boundaries that respect the structural differences between classes, regardless of their relative sizes. This leads to significantly improved recall for small objects and sharper boundaries for under-represented classes.