본문 바로가기

분류 전체보기

(39)
NAS vs Pruning 보통의 경우 NAS로 최적화 된 Pretrained Model에 Pruning을 사용한다. Dataset이 충분하다고 가정했을 때 Pruning의 이점은 high-dimensional Space에서의 최적화 즉 local minum에 영향이 적은 space에 학습한 후 Pruning을 통해서 low-dimension으로 가는 것이다. NAS의 Gradient Descent : DARTS: Differentiable Architecture Searchlearning the connection probability .. 도 Pruning과 비슷 ? In pruning, the high-dimensional search space:Increases the prevalence of saddle points, w..
LORA/Pruning LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning How to prune Foundation Model for Domain Specific Efficiently?
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - 693 https://www.youtube.com/watch?v=yceNl9C6Ir0 Attention vs state space model :  Attention does kv cache with selection (softmax selection), state-space model does the compression.  State-space model is hard to recover its past data.  Attention works great on the well-defined tokenizer, which every of its tokens has meaningful values, but needs     compression. Many works are integrating this two a..
DeepGraph: Towards Any Structural Pruning Problem :  Structural pruning enables model acceleration by removing structurally-groupd parameters form NN.However, the parameter-grouping patterns vary widely across different models, making architecture-specific pruners, which rely on manually-designed grouping schemes, non-generalizable to new architictures. Abstract : We study any structural pruning, to tackle general structural pruning of ..
EfficientML.ai Lecture 3 Pruning and Sparsity Part II
EfficientML.ai Lecture 2 Pruning and Sparsity Part I
EfficientML.ai Lecture 1 Basics of Neural Network
YOLOv10 1. Abstract We aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and the model architecture. We first tackle the problem of redundant predictions in the post-processing by presenting a consistent dual assignments strategy for NMS-free YOLOs with the dual label assignments { one-to-many head and one-to-one head } and consistent matching metric.It al..