Skip to content

Home

Hyperspherical Variational Auto Encoders

Published at: 10/10/24, 10:09 AM

nGPT Normalized Transformer with Representation Learning on the Hypersphere

Published at: 10/10/24, 10:09 AM

Ahead of Time (AOT) Compilation

Published at: 9/28/24, 10:31 AM

PyTorch Functionalization

Published at: 9/28/24, 10:31 AM

PyTorch Quantization for TensorRT

Published at: 9/28/24, 10:31 AM

Residual stream

Published at: 8/11/24, 2:26 PM

A Mathematical Framework for Transformer Circuits

Published at: 8/11/24, 2:26 PM

Revealing the Utilized Rank of Subspaces of Learning in Neural Networks

Published at: 8/11/24, 2:26 PM

Apple Intelligence Foundation Language Models

Published at: 8/11/24, 12:21 PM

Memorization Through the Lens of Curvature of Loss Function Around Samples

Published at: 8/11/24, 12:21 PM

EVA 02 A Visual Representation for Neon Genesis

Published at: 7/4/24, 6:39 AM

On Good Practices for Task Specific Distillation of Large Pretrained Visual Models

Published at: 7/4/24, 6:39 AM

ViDT An Efficient and Effective Fully Transformer based Object Detector

Published at: 7/4/24, 6:39 AM

Mean Attention Distance

Published at: 7/3/24, 3:22 PM

What Do Self Supervised Vision Transformers Learn?

Published at: 7/2/24, 12:30 PM

Are less inductive biases better or worse?

Published at: 7/2/24, 11:31 AM

Masked Image Modelling

Published at: 7/2/24, 11:31 AM

Non translationally equivariant convolutions

Published at: 7/2/24, 11:31 AM

CKConv Continuous Kernel Convolution For Sequential Data

Published at: 7/2/24, 11:31 AM

DINOv2 Learning Robust Visual Features without Supervision

Published at: 7/2/24, 11:31 AM

MobileCLIP Fast Image Text Models through Multi Modal Reinforced Training

Published at: 6/15/24, 5:50 PM

Optimal Brain Damage

Published at: 6/15/24, 5:50 PM

Retrospective EIE Efficient Inference Engine onSparse and Compressed Neural Network

Published at: 6/15/24, 5:50 PM

Bit Palettization

Published at: 6/13/24, 1:28 PM

Block Expansion

Published at: 6/13/24, 1:28 PM

Grokking

Published at: 6/13/24, 1:28 PM

K Means based Quantization

Published at: 6/13/24, 1:28 PM

KV Cache

Published at: 6/13/24, 1:28 PM

LoRa Adapter

Published at: 6/13/24, 1:28 PM

LoRA Low Rank Adaptation of Large Language Models

Published at: 6/13/24, 1:28 PM

In Search of Projectively Equivariant Networks

Published at: 6/6/24, 10:01 AM

Provably Strict Generalisation Benefit for Equivariant Models

Published at: 6/6/24, 10:01 AM

Symmetries in Overparametrized Neural Networks A Mean Field View

Published at: 6/6/24, 10:01 AM

Equivariance Initialization

Published at: 6/5/24, 12:46 PM

Priors over Neural Network weights

Published at: 6/5/24, 12:46 PM

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

Published at: 6/5/24, 12:46 PM

Optimization Dynamics of Equivariant and Augmented Neural Networks

Published at: 6/5/24, 12:46 PM

Understanding Deep Learning Chapter 10

Published at: 6/5/24, 12:46 PM

Representation (Group Theory)

Published at: 6/4/24, 2:11 PM

A ConvNet for the 2020s

Published at: 6/4/24, 2:11 PM

Stand Alone Self Attention in Vision Models

Published at: 4/13/24, 8:57 PM

Convergence rate and Hessian spectra

Published at: 4/11/24, 3:52 PM

Depthwise separable convolutions

Published at: 4/11/24, 3:52 PM

Group Axioms

Published at: 4/11/24, 3:52 PM

Group direct product

Published at: 4/11/24, 3:52 PM

Multiple global minima

Published at: 4/11/24, 3:52 PM

Efficient Equivariant Transfer Learning from Pretrained Models

Published at: 4/11/24, 3:52 PM

Equi Tuning Group Equivariant Fine Tuning of Pretrained Models

Published at: 4/11/24, 3:52 PM

Equivariance with Learned Canonicalization Functions

Published at: 4/11/24, 3:52 PM

Exploiting Redundancy Separable Group Convolutional Networks on Lie Groups

Published at: 4/11/24, 3:52 PM

Learning Partial Equivariances from Data

Published at: 4/11/24, 3:52 PM

The Lie derivative for measuring learned equivariance

Published at: 4/11/24, 3:52 PM

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Total 142 posts.