cd /news/computer-vision/interpolation-between-convolution-an… · home topics computer-vision article
[ARTICLE · art-28912] src=arxiv.org ↗ pub= topic=computer-vision verified=true sentiment=· neutral

Interpolation between Convolution and Attention via K-Nearest Neighbors

Researchers from an undisclosed institution introduced Convolutional Nearest Neighbors (ConvNN), a unified framework that shows convolution and self-attention are special cases of k-nearest neighbor aggregation. ConvNN can replace both convolution and attention layers, enabling exploration of the spectrum between local and global aggregation in computer vision models.

read1 min views1 publishedJun 16, 2026

arXiv:2606.14725v1 Announce Type: new Abstract: The shift from Convolutional Neural Networks to Transformers has reshaped computer vision, yet these two architectural families are typically viewed as fundamentally distinct. Convolutional Neural Networks are defined by spatially local convolution operations, while Transformers rely on global self-attention. We argue that convolution and self-attention, despite their apparent differences, can be unified within a single k-nearest neighbor aggregation framework. The critical insight is that both operations are special cases of neighbor selection and weighted aggregation. Convolution selects neighbors by spatial proximity while self-attention selects by feature similarity, revealing that they lie on a continuous spectrum rather than representing categorically different computations. We introduce Convolutional Nearest Neighbors (ConvNN), a unified framework that formalizes this connection. ConvNN exactly recovers standard and depthwise convolution by restricting neighbor selection to normalized spatial coordinates, and exactly recovers self-attention and its sparse variants, including KVT-attention, by replacing spatial proximity with scaled dot-product similarity. Beyond these special cases, ConvNN serves as a drop-in replacement for both convolution and attention layers, enabling systematic exploration of the intermediate spectrum between local and global aggregation through configurable similarity functions, neighbor selection strategies, positional encodings, and aggregation kernels.

── more in #computer-vision 4 stories · sorted by recency
── more on @convnn 3 stories trending now
sponsored brought to you by zahid.host 4,200+ EU-deployed projects
reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main
Live at https://your-agent.zahid.host
Get free account → Pricing
from €0/mo · no card required
LIVE [news/interpolation-betwee…] indexed:0 read:1min 2026-06-16 ·