Skip to content

Second Brain

Home

Home¶

Global Precedence Effect

Published at: 8/4/25, 7:59 AM

Rate Distortion and Spectral Analysis on Representations

Published at: 8/4/25, 7:59 AM

4M Massively Multimodal Masked Modeling

Published at: 8/4/25, 7:59 AM

AdaGlimpse Active Visual Exploration with Arbitrary Glimpse Position and Scale

Published at: 8/4/25, 7:59 AM

Average entropy of Gaussian mixtures

Published at: 8/4/25, 7:59 AM

Certifying Adapters Enabling and Enhancing the Certification of Classifier Adversarial Robustness

Published at: 8/4/25, 7:59 AM

Correlational Image Modeling for Self Supervised Visual Pre Training

Published at: 8/4/25, 7:59 AM

Deep Learning is Not So Mysterious or Different

Published at: 8/4/25, 7:59 AM

FNet Mixing Tokens with Fourier Transforms

Published at: 8/4/25, 7:59 AM

Fourier Transformer Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator

Published at: 8/4/25, 7:59 AM

How to represent part whole hierarchies in a neural network

Published at: 8/4/25, 7:59 AM

Is ImageNet worth 1 video Learning strong image encoders from 1 long unlabelled video

Published at: 8/4/25, 7:59 AM

Layer by Layer Uncovering Hidden Representations in Language Models

Published at: 8/4/25, 7:59 AM

Learning Continually by Spectral Regularization

Published at: 8/4/25, 7:59 AM

Learning to Learn without Forgetting using Attention

Published at: 8/4/25, 7:59 AM

Matryoshka Representation Learning

Published at: 8/4/25, 7:59 AM

Occam's model Selecting simpler representations for better transferability estimation

Published at: 8/4/25, 7:59 AM

PART Self supervised Pretraining with Pairwise Relative Translations

Published at: 8/4/25, 7:59 AM

Principal Components Enable A New Language of Images

Published at: 8/4/25, 7:59 AM

Rate–Distortion–Perception Trade Off in Information Theory, Generative Models, and Intelligent Communications

Published at: 8/4/25, 7:59 AM

Rethinking Lossy Compression The Rate Distortion Perception Tradeoff

Published at: 8/4/25, 7:59 AM

Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices

Published at: 8/4/25, 7:59 AM

Stochastic positional embeddings improve masked image modeling

Published at: 8/4/25, 7:59 AM

Structuring Representation Geometry with Rotationally Equivariant Contrastive Learning

Published at: 8/4/25, 7:59 AM

The Forward Forward Algorithm Some Preliminary Investigations

Published at: 8/4/25, 7:59 AM

The Hidden Uniform Cluster Prior in Self Supervised Learning

Published at: 8/4/25, 7:59 AM

To Compress or Not to Compress Self Supervised Learning and Information Theory A Review

Published at: 8/4/25, 7:59 AM

Towards Understanding the Spectral Bias of Deep Learning

Published at: 8/4/25, 7:59 AM

Towards a Definition of Disentangled Representations

Published at: 8/4/25, 7:59 AM

ViTAR Vision Transformer with Any Resolution

Published at: 8/4/25, 7:59 AM

SK Centering

Published at: 5/26/25, 10:11 AM

A Cookbook of Self Supervised Learning

Published at: 5/26/25, 10:11 AM

Beyond cls Exploring the true potential of Masked Image Modeling representations

Published at: 5/26/25, 10:11 AM

Byte Latent Transformer Patches Scale Better Than Tokens

Published at: 5/26/25, 10:11 AM

Deformable Convolutional Networks

Published at: 5/26/25, 10:11 AM

Deformable DETR Deformable Transformers for End to End Object Detection

Published at: 5/26/25, 10:11 AM

Dense Contrastive Learning for Self Supervised Visual Pre Training

Published at: 5/26/25, 10:11 AM

Equivariant Representation Learning via Class Pose Decomposition

Published at: 5/26/25, 10:11 AM

Exploring Simple Siamese Representation Learning

Published at: 5/26/25, 10:11 AM

FLSL Feature level Self supervised Learning

Published at: 5/26/25, 10:11 AM

FlexTok Resampling Images into 1D Token Sequences of Flexible Length

Published at: 5/26/25, 10:11 AM

From Pixels to Components Eigenvector Masking for Visual Representation Learning

Published at: 5/26/25, 10:11 AM

Guillotine Regularization Why removing layers is needed to improve generalization in Self Supervised Learning

Published at: 5/26/25, 10:11 AM

How Does SimSiam Avoid Collapse Without Negative Samples A Unified Understanding with Self supervised Contrastive Learning

Published at: 5/26/25, 10:11 AM

Learning Representations on the Unit Sphere Investigating Angular Gaussian and von Mises Fisher Distributions for Online Continual Learning

Published at: 5/26/25, 10:11 AM

Near, far Patch ordering enhances vision foundation models' scene understanding

Published at: 5/26/25, 10:11 AM

On the duality between contrastive and non contrastive self supervised learning

Published at: 5/26/25, 10:11 AM

Patch Wise Self Supervised Visual Representation Learning A Fine Grained Approach

Published at: 5/26/25, 10:11 AM

PatchRot A Self Supervised Technique for Training Vision Transformers

Published at: 5/26/25, 10:11 AM

Robust Self Supervised Learning with Lie Groups

Published at: 5/26/25, 10:11 AM

Self Supervised Learning of Object Parts for Semantic Segmentation

Published at: 5/26/25, 10:11 AM

Self supervised learning of Split Invariant Equivariant representations

Published at: 5/26/25, 10:11 AM

Self supervised learning of intertwined content and positional features for object detection

Published at: 5/26/25, 10:11 AM

Simplifying DINO via Coding Rate Regularization

Published at: 5/26/25, 10:11 AM

Toward a Geometrical Understanding of Self supervised Contrastive Learning

Published at: 5/26/25, 10:11 AM

VICRegL Self Supervised Learning of Local Visual Features

Published at: 5/26/25, 10:11 AM

Variance Covariance Regularization Enforces Pairwise Independence in Self Supervised Representations

Published at: 5/26/25, 10:11 AM

Vision Transformer with Deformable Attention

Published at: 5/26/25, 10:11 AM

Curiosity driven Exploration by Self supervised Prediction

Published at: 2/8/25, 6:52 AM

DeiT III Revenge of the ViT

Published at: 2/8/25, 6:52 AM

DropPos Pre Training Vision Transformers by Reconstructing Dropped Positions

Published at: 2/8/25, 6:52 AM

Fixing the train test resolution discrepancy

Published at: 2/8/25, 6:52 AM

HoPE A Novel Positional Encoding Without Long Term Decay for Enhanced Context Awareness and Extrapolation

Published at: 2/8/25, 6:52 AM

How JEPA Avoids Noisy Features The Implicit Bias of DeepLinear Self Distillation Networks

Published at: 2/8/25, 6:52 AM

Improving Self Consistency in LLMs through Probabilistic Tokenization

Published at: 2/8/25, 6:52 AM

LieRE Generalizing Rotary Position Encodings

Published at: 2/8/25, 6:52 AM

LoRA vs Full Fine tuning An Illusion of Equivalence

Published at: 2/8/25, 6:52 AM

Location Aware Self Supervised Transformers for Semantic Segmentation

Published at: 2/8/25, 6:52 AM

Masked Autoencoders Are Scalable Vision Learners

Published at: 2/8/25, 6:52 AM

Position Prediction as an Effective Pretraining Strategy

Published at: 2/8/25, 6:52 AM

Rotary Position Embedding for Vision Transformer

Published at: 2/8/25, 6:52 AM

Round and Round We Go! What makes Rotary Positional Encodings useful?

Published at: 2/8/25, 6:52 AM

Self Supervised Learning from Images with a Joint Embedding Predictive Architecture

Published at: 2/8/25, 6:52 AM

Three things everyone should know about Vision Transformers

Published at: 2/8/25, 6:52 AM

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Published at: 2/8/25, 6:52 AM

Unsupervised Visual Representation Learning by Context Prediction

Published at: 2/8/25, 6:52 AM

Hyperspherical Variational Auto Encoders

Published at: 10/10/24, 10:09 AM

nGPT Normalized Transformer with Representation Learning on the Hypersphere

Published at: 10/10/24, 10:09 AM

Ahead of Time (AOT) Compilation

Published at: 9/28/24, 10:31 AM

PyTorch Functionalization

Published at: 9/28/24, 10:31 AM

PyTorch Quantization for TensorRT

Published at: 9/28/24, 10:31 AM

Residual stream

Published at: 8/11/24, 2:26 PM

A Mathematical Framework for Transformer Circuits

Published at: 8/11/24, 2:26 PM

Revealing the Utilized Rank of Subspaces of Learning in Neural Networks

Published at: 8/11/24, 2:26 PM

Apple Intelligence Foundation Language Models

Published at: 8/11/24, 12:21 PM

Memorization Through the Lens of Curvature of Loss Function Around Samples

Published at: 8/11/24, 12:21 PM

Linear Quantization

Published at: 8/2/24, 6:49 PM

Neural Network Quantization

Published at: 8/2/24, 6:49 PM

Positive Logic Programs

Published at: 8/2/24, 6:49 PM

AWQ Activation aware Weight Quantization for LLM Compression and Acceleration

Published at: 8/2/24, 6:49 PM

Exact Conversion of In Context Learning to Model Weights in Linearized Attention Transformers

Published at: 8/2/24, 6:49 PM

Hydra Bidirectional State Space Models Through Generalized Matrix Mixers

Published at: 8/2/24, 6:49 PM

Llama 2 Open Foundation and Fine Tuned Chat Models

Published at: 8/2/24, 6:49 PM

Training quantized nets A deeper understanding

Published at: 8/2/24, 6:49 PM

Battle of the Backbones A Large Scale Comparison of Pretrained Models across Computer Vision Tasks

Published at: 7/4/24, 6:39 AM

DETRs with Collaborative Hybrid Assignments Training

Published at: 7/4/24, 6:39 AM

EVA 02 A Visual Representation for Neon Genesis

Published at: 7/4/24, 6:39 AM

On Good Practices for Task Specific Distillation of Large Pretrained Visual Models

Published at: 7/4/24, 6:39 AM

ViDT An Efficient and Effective Fully Transformer based Object Detector

Published at: 7/4/24, 6:39 AM

Mean Attention Distance

Published at: 7/3/24, 3:22 PM

What Do Self Supervised Vision Transformers Learn?

Published at: 7/2/24, 12:30 PM

Are less inductive biases better or worse?

Published at: 7/2/24, 11:31 AM

Masked Image Modelling

Published at: 7/2/24, 11:31 AM

Non translationally equivariant convolutions

Published at: 7/2/24, 11:31 AM

CKConv Continuous Kernel Convolution For Sequential Data

Published at: 7/2/24, 11:31 AM

DINOv2 Learning Robust Visual Features without Supervision

Published at: 7/2/24, 11:31 AM

Emerging Properties in Self Supervised Vision Transformers

Published at: 7/2/24, 11:31 AM

FlexiViT One Model for All Patch Sizes

Published at: 7/2/24, 11:31 AM

Learning with Unmasked Tokens Drives Stronger Vision Learners

Published at: 7/2/24, 11:31 AM

Do Vision Foundation models exist?

Published at: 7/1/24, 3:29 PM

BoxeR Box Attention for 2D and 3D Transformers

Published at: 7/1/24, 3:29 PM

DETRs Beat YOLOs on Real time Object Detection

Published at: 7/1/24, 3:29 PM

End to End Object Detection with Transformers

Published at: 7/1/24, 3:29 PM

Exploring Plain Vision Transformer Backbones for Object Detection

Published at: 7/1/24, 3:29 PM

LRP QViT Mixed Precision Vision Transformer Quantization via Layer wise Relevance Propagation

Published at: 7/1/24, 3:29 PM

R MAE Regions Meet Masked Autoencoders

Published at: 7/1/24, 3:29 PM

Segment Anything

Published at: 7/1/24, 3:29 PM

SimPLR A Simple and Plain Transformer for Scaling Efficient Object Detection and Segmentation

Published at: 7/1/24, 3:29 PM

Vision Transformers Need Registers

Published at: 7/1/24, 3:29 PM

A survey of quantization methods for efficient neural network inference

Published at: 6/29/24, 3:18 PM

Adapting Vision Foundation Models for Plant Phenotyping

Published at: 6/29/24, 3:18 PM

Building on Efficient Foundations Effectively Training LLMs with Structured Feedforward Layers

Published at: 6/29/24, 3:18 PM

EfficientViT SAM Accelerated Segment Anything Model Without Accuracy Loss

Published at: 6/29/24, 3:18 PM

Grokked Transformers are Implicit Reasoners A Mechanistic Journey to the Edge of Generalization

Published at: 6/29/24, 3:18 PM

Mixture of LoRa Experts

Published at: 6/29/24, 3:18 PM

Model Compression in Practice Lessons Learned from Practitioners Creating On device Machine Learning Experiences

Published at: 6/29/24, 3:18 PM

Progress measures for grokking via mechanistic interpretability

Published at: 6/29/24, 3:18 PM

ProxylessNAS Direct Neural Architecture Search on Target Task and Hardware

Published at: 6/29/24, 3:18 PM

Refusal in Language Models Is Mediated by a Single Direction

Published at: 6/29/24, 3:18 PM

TiC CLIP Continual Training of CLIP models

Published at: 6/29/24, 3:18 PM

Using Degeneracy in the Loss Landscape for Mechanistic Interpretability

Published at: 6/29/24, 3:18 PM

Hardware specific structured pruning

Published at: 6/15/24, 5:50 PM

Maximal pruning and functional recovery

Published at: 6/15/24, 5:50 PM

A Brief Review of Hypernetworks in Deep Learning

Published at: 6/15/24, 5:50 PM

An Image is Worth More Than 16x16 Patches Exploring Transformers on Individual Pixels

Published at: 6/15/24, 5:50 PM

Learning both Weights and Connections for Efficient Neural Networks

Published at: 6/15/24, 5:50 PM

MobileCLIP Fast Image Text Models through Multi Modal Reinforced Training

Published at: 6/15/24, 5:50 PM

Optimal Brain Damage

Published at: 6/15/24, 5:50 PM

Retrospective EIE Efficient Inference Engine onSparse and Compressed Neural Network

Published at: 6/15/24, 5:50 PM

Bit Palettization

Published at: 6/13/24, 1:28 PM

Block Expansion

Published at: 6/13/24, 1:28 PM

Grokking

Published at: 6/13/24, 1:28 PM

K Means based Quantization

Published at: 6/13/24, 1:28 PM

KV Cache

Published at: 6/13/24, 1:28 PM

LoRa Adapter

Published at: 6/13/24, 1:28 PM

LoRA Low Rank Adaptation of Large Language Models

Published at: 6/13/24, 1:28 PM

Parameter Efficient Fine tuning of Self supervised ViTs without Catastrophic Forgetting

Published at: 6/13/24, 1:28 PM

Parameter Efficient Fine Tuning for Pre Trained Vision Models A Survey

Published at: 6/13/24, 1:28 PM

SAM CLIP Merging Vision Foundation Models towards Semantic and Spatial Understanding

Published at: 6/13/24, 1:28 PM

Simultaneous linear connectivity of neural networks modulo permutation

Published at: 6/13/24, 1:28 PM

Surgical Fine Tuning Improves Adaptation to Distribution Shifts

Published at: 6/13/24, 1:28 PM

Surgical DINO Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery

Published at: 6/13/24, 1:28 PM

Talaria Interactively Optimizing Machine Learning Models for Efficient Inference

Published at: 6/13/24, 1:28 PM

Block Transformer Global to Local Language Modeling for Fast Inference

Published at: 6/6/24, 10:01 AM

Color Space Transformation Network

Published at: 6/6/24, 10:01 AM

Emergent Equivariance in Deep Ensembles

Published at: 6/6/24, 10:01 AM

In Search of Projectively Equivariant Networks

Published at: 6/6/24, 10:01 AM

Provably Strict Generalisation Benefit for Equivariant Models

Published at: 6/6/24, 10:01 AM

Symmetries in Overparametrized Neural Networks A Mean Field View

Published at: 6/6/24, 10:01 AM

Equivariance Initialization

Published at: 6/5/24, 12:46 PM

Priors over Neural Network weights

Published at: 6/5/24, 12:46 PM

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

Published at: 6/5/24, 12:46 PM

Optimization Dynamics of Equivariant and Augmented Neural Networks

Published at: 6/5/24, 12:46 PM

Understanding Deep Learning Chapter 10

Published at: 6/5/24, 12:46 PM

Representation (Group Theory)

Published at: 6/4/24, 2:11 PM

A ConvNet for the 2020s

Published at: 6/4/24, 2:11 PM

A Hierarchy of Graph Neural Networks Based on Learnable Local Features

Published at: 6/4/24, 2:11 PM

A general theory of correct, incorrect, and extrinsic equivariance

Published at: 6/4/24, 2:11 PM

Approximately equivariant networks for imperfectly symmetric dynamics

Published at: 6/4/24, 2:11 PM

Deep Learning Book

Published at: 6/4/24, 2:11 PM

Efficient Modulation for Vision Networks

Published at: 6/4/24, 2:11 PM

G SGD Optimizing ReLU Neural Networks in its Positively Scale Invariant Space

Published at: 6/4/24, 2:11 PM

Harmonics of Learning Universal Fourier Features Emerge in Invariant Networks

Published at: 6/4/24, 2:11 PM

Improving Convergence and Generalization Using Parameter Symmetries

Published at: 6/4/24, 2:11 PM

Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task specific Models

Published at: 6/4/24, 2:11 PM

Neural Mechanics Symmetry and Broken Conservation Laws in Deep Learning Dynamics

Published at: 6/4/24, 2:11 PM

On the Symmetries of Deep Learning Models and their Internal Representations

Published at: 6/4/24, 2:11 PM

OpenELM An Efficient Language Model Family with Open source Training and Inference Framework

Published at: 6/4/24, 2:11 PM

Relaxed Octahedral Group Convolution for Learning Symmetry Breaking in 3D Physical Systems

Published at: 6/4/24, 2:11 PM

Rewrite the Stars

Published at: 6/4/24, 2:11 PM

The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof

Published at: 6/4/24, 2:11 PM

Understanding symmetries in deep networks

Published at: 6/4/24, 2:11 PM

Vision Mamba Efficient Visual Representation Learning with Bidirectional State Space Model

Published at: 6/4/24, 2:11 PM

DenseNets Reloaded Paradigm Shift Beyond ResNets and ViTs

Published at: 4/15/24, 12:32 PM

Understanding Deep Learning Chapter 20

Published at: 4/15/24, 12:32 PM

The Unreasonable Ineffectiveness of the Deeper Layers

Published at: 4/15/24, 7:15 AM

Scaling (Down) CLIP A Comprehensive Analysis of Data, Architecture, and Training Strategies

Published at: 4/15/24, 7:15 AM

Effect of weight symmetries on training dynamics

Published at: 4/15/24, 7:14 AM

2D Convolutions

Published at: 4/14/24, 2:00 PM

Input dependent convolutions

Published at: 4/13/24, 8:57 PM

An Investigation into Neural Net Optimization via Hessian Eigenvalue Density

Published at: 4/13/24, 8:57 PM

An image is worth 16x16 words Transformers for image recognition at scale

Published at: 4/13/24, 8:57 PM

Approximation Generalization Trade offs under (Approximate) Group Equivariance

Published at: 4/13/24, 8:57 PM

Autoequivariant Network Search via Group Decomposition

Published at: 4/13/24, 8:57 PM

Color Equivariant Convolutional Networks

Published at: 4/13/24, 8:57 PM

ConViT Improving Vision Transformers with Soft Convolutional Inductive Biases

Published at: 4/13/24, 8:57 PM

Early Convolutions Help Transformers See Better

Published at: 4/13/24, 8:57 PM

Equivariance aware architectural optimization of neural networks

Published at: 4/13/24, 8:57 PM

Fast, Expressive SE(n) Equivariant Networks through Weight Sharing in Position Orientation Space

Published at: 4/13/24, 8:57 PM

How do vision transformers work?

Published at: 4/13/24, 8:57 PM

Learned Gridification for Efficient Point Cloud Processing

Published at: 4/13/24, 8:57 PM

Mamba Linear Time Sequence Modeling with Selective State Spaces

Published at: 4/13/24, 8:57 PM

MobileViT light weight, general purpose, and mobile friendly vision transformer

Published at: 4/13/24, 8:57 PM

On the Relationship between Self Attention and Convolutional Layers

Published at: 4/13/24, 8:57 PM

Relaxing Equivariance Constraints with Non stationary Continuous Filters

Published at: 4/13/24, 8:57 PM

Self Supervised Detection of Perfect and Partial Input Dependent Symmetries

Published at: 4/13/24, 8:57 PM

Stand Alone Self Attention in Vision Models

Published at: 4/13/24, 8:57 PM

Convergence rate and Hessian spectra

Published at: 4/11/24, 3:52 PM

Depthwise separable convolutions

Published at: 4/11/24, 3:52 PM

Group Axioms

Published at: 4/11/24, 3:52 PM

Group direct product

Published at: 4/11/24, 3:52 PM

Multiple global minima

Published at: 4/11/24, 3:52 PM

Efficient Equivariant Transfer Learning from Pretrained Models

Published at: 4/11/24, 3:52 PM

Equi Tuning Group Equivariant Fine Tuning of Pretrained Models

Published at: 4/11/24, 3:52 PM

Equivariance with Learned Canonicalization Functions

Published at: 4/11/24, 3:52 PM

Exploiting Redundancy Separable Group Convolutional Networks on Lie Groups

Published at: 4/11/24, 3:52 PM

Learning Partial Equivariances from Data

Published at: 4/11/24, 3:52 PM

The Lie derivative for measuring learned equivariance

Published at: 4/11/24, 3:52 PM

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

Total 218 posts.