Skip to content
Change the repository type filter

All

    Repositories list

    • Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers
      Apache License 2.0
      0100Updated Dec 19, 2024Dec 19, 2024
    • [ECCV 2024 Oral] "Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields" by Yonggan Fu, Huaizhi Qu, Zhifan Ye, Chaojian Li, Kevin Zhao, and Yingyan (Celine) Lin.
      Python
      MIT License
      0600Updated Dec 14, 2024Dec 14, 2024
    • AmoebaLLM

      Public
      [NeurIPS 2024] "AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment" by Yonggan Fu, Zhongzhi Yu, Junwei Li, Jiayi Qian, Yongan Zhang, Xiangchi Yuan, Dachuan Shi, Roman Yakunin, and Yingyan (Celine) Lin.
      Python
      MIT License
      0700Updated Dec 13, 2024Dec 13, 2024
    • ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
      Python
      Apache License 2.0
      129350Updated Oct 15, 2024Oct 15, 2024
    • Python
      MIT License
      63400Updated Oct 8, 2024Oct 8, 2024
    • LLM4HWDesign Starting Toolkit
      Python
      41710Updated Oct 4, 2024Oct 4, 2024
    • ACT

      Public
      [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration
      Python
      02720Updated Jun 30, 2024Jun 30, 2024
    • Edge-LLM

      Public
      [DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting
      Python
      53820Updated Jun 30, 2024Jun 30, 2024
    • [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
      Python
      Apache License 2.0
      22710Updated Jun 12, 2024Jun 12, 2024
    • [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference
      Python
      Apache License 2.0
      12610Updated Mar 14, 2024Mar 14, 2024
    • NeRFool

      Public
      [ICML 2023] "NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations" by Yonggan Fu, Ye Yuan, Souvik Kundu, Shang Wu, Shunyao Zhang, Yingyan (Celine) Lin
      Python
      MIT License
      11400Updated Mar 10, 2024Mar 10, 2024
    • CPT

      Public
      [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, and Yingyan (Celine) Lin.
      Python
      MIT License
      63021Updated Mar 2, 2024Mar 2, 2024
    • [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
      Python
      Apache License 2.0
      03210Updated Dec 6, 2023Dec 6, 2023
    • C
      0600Updated Oct 19, 2023Oct 19, 2023
    • BNS-GCN

      Public
      [MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling" by Cheng Wan, Youjie Li, Ang Li, Nam Sung Kim, Yingyan Lin
      Python
      MIT License
      115200Updated Oct 6, 2023Oct 6, 2023
    • An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.
      Apache License 2.0
      01000Updated Sep 24, 2023Sep 24, 2023
    • S3-Router

      Public
      [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing" by Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Lai, Yingyan Lin
      Python
      MIT License
      21610Updated Sep 19, 2023Sep 19, 2023
    • ViTCoD

      Public
      [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
      Python
      Apache License 2.0
      1110020Updated Jun 27, 2023Jun 27, 2023
    • Hint-Aug

      Public
      Python
      MIT License
      0500Updated Jun 25, 2023Jun 25, 2023
    • [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
      Python
      MIT License
      1610510Updated Apr 18, 2023Apr 18, 2023
    • HALO

      Public
      The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"
      Python
      MIT License
      0900Updated Mar 22, 2023Mar 22, 2023
    • PipeGCN

      Public
      [ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Youjie Li, Cameron R. Wolfe, Anastasios Kyrillidis, Nam Sung Kim, Yingyan Lin
      Python
      MIT License
      73100Updated Mar 15, 2023Mar 15, 2023
    • ViTALiTy

      Public
      ViTALiTy (HPCA'23) Code Repository
      Python
      Apache License 2.0
      52020Updated Mar 13, 2023Mar 13, 2023
    • Spline-EB

      Public
      [TMLR] Max-Affine Spline Insights Into Deep Network Pruning
      Python
      MIT License
      0100Updated Nov 12, 2022Nov 12, 2022
    • 6900Updated Oct 27, 2022Oct 27, 2022
    • [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architectures
      Python
      62310Updated Oct 1, 2022Oct 1, 2022
    • NASA

      Public
      [ICCAD 2022] NASA: Neural Architecture Search and Acceleration for Hardware Inspired Hybrid Networks
      Python
      0800Updated Sep 22, 2022Sep 22, 2022
    • [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan Fu, Haichuan Yang, Jiayi Yuan, Meng Li, Cheng Wan, Raghuraman Krishnamoorthi, Vikas Chandra, and Yingyan (Celine) Lin.
      MIT License
      13510Updated Jul 12, 2022Jul 12, 2022
    • [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
      Python
      MIT License
      01900Updated Jul 7, 2022Jul 7, 2022
    • [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
      Python
      MIT License
      21420Updated May 18, 2022May 18, 2022