Skip to content

Home

Ahead of Time (AOT) Compilation

Published at: 9/28/24, 10:31 AM

PyTorch Functionalization

Published at: 9/28/24, 10:31 AM

PyTorch Quantization for TensorRT

Published at: 9/28/24, 10:31 AM

Residual stream

Published at: 8/11/24, 2:26 PM

A Mathematical Framework for Transformer Circuits

Published at: 8/11/24, 2:26 PM

Revealing the Utilized Rank of Subspaces of Learning in Neural Networks

Published at: 8/11/24, 2:26 PM

Apple Intelligence Foundation Language Models

Published at: 8/11/24, 12:21 PM

Memorization Through the Lens of Curvature of Loss Function Around Samples

Published at: 8/11/24, 12:21 PM

Linear Quantization

Published at: 8/2/24, 6:49 PM

Neural Network Quantization

Published at: 8/2/24, 6:49 PM

ViDT An Efficient and Effective Fully Transformer based Object Detector

Published at: 7/4/24, 6:39 AM

Mean Attention Distance

Published at: 7/3/24, 3:22 PM

What Do Self Supervised Vision Transformers Learn?

Published at: 7/2/24, 12:30 PM

Are less inductive biases better or worse?

Published at: 7/2/24, 11:31 AM

Masked Image Modelling

Published at: 7/2/24, 11:31 AM

Non translationally equivariant convolutions

Published at: 7/2/24, 11:31 AM

CKConv Continuous Kernel Convolution For Sequential Data

Published at: 7/2/24, 11:31 AM

DINOv2 Learning Robust Visual Features without Supervision

Published at: 7/2/24, 11:31 AM

Emerging Properties in Self Supervised Vision Transformers

Published at: 7/2/24, 11:31 AM

FlexiViT One Model for All Patch Sizes

Published at: 7/2/24, 11:31 AM

Retrospective EIE Efficient Inference Engine onSparse and Compressed Neural Network

Published at: 6/15/24, 5:50 PM

Bit Palettization

Published at: 6/13/24, 1:28 PM

Block Expansion

Published at: 6/13/24, 1:28 PM

Grokking

Published at: 6/13/24, 1:28 PM

K Means based Quantization

Published at: 6/13/24, 1:28 PM

KV Cache

Published at: 6/13/24, 1:28 PM

LoRa Adapter

Published at: 6/13/24, 1:28 PM

LoRA Low Rank Adaptation of Large Language Models

Published at: 6/13/24, 1:28 PM

Parameter Efficient Fine tuning of Self supervised ViTs without Catastrophic Forgetting

Published at: 6/13/24, 1:28 PM

Parameter Efficient Fine Tuning for Pre Trained Vision Models A Survey

Published at: 6/13/24, 1:28 PM

Symmetries in Overparametrized Neural Networks A Mean Field View

Published at: 6/6/24, 10:01 AM

Equivariance Initialization

Published at: 6/5/24, 12:46 PM

Priors over Neural Network weights

Published at: 6/5/24, 12:46 PM

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

Published at: 6/5/24, 12:46 PM

Optimization Dynamics of Equivariant and Augmented Neural Networks

Published at: 6/5/24, 12:46 PM

Understanding Deep Learning Chapter 10

Published at: 6/5/24, 12:46 PM

Representation (Group Theory)

Published at: 6/4/24, 2:11 PM

A ConvNet for the 2020s

Published at: 6/4/24, 2:11 PM

A Hierarchy of Graph Neural Networks Based on Learnable Local Features

Published at: 6/4/24, 2:11 PM

A general theory of correct, incorrect, and extrinsic equivariance

Published at: 6/4/24, 2:11 PM

Depthwise separable convolutions

Published at: 4/11/24, 3:52 PM

Group Axioms

Published at: 4/11/24, 3:52 PM

Group direct product

Published at: 4/11/24, 3:52 PM

Multiple global minima

Published at: 4/11/24, 3:52 PM

Efficient Equivariant Transfer Learning from Pretrained Models

Published at: 4/11/24, 3:52 PM

Equi Tuning Group Equivariant Fine Tuning of Pretrained Models

Published at: 4/11/24, 3:52 PM

Equivariance with Learned Canonicalization Functions

Published at: 4/11/24, 3:52 PM

Exploiting Redundancy Separable Group Convolutional Networks on Lie Groups

Published at: 4/11/24, 3:52 PM

Learning Partial Equivariances from Data

Published at: 4/11/24, 3:52 PM

The Lie derivative for measuring learned equivariance

Published at: 4/11/24, 3:52 PM

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Total 160 posts.