#

efficient-ai

Here are 11 public repositories matching this topic...

NVlabs / Long-RL

Long-RL: Scaling RL to Long Sequences

reinforcement-learning multi-modality long-sequence large-language-models sequence-parallelism efficient-ai

Updated Aug 5, 2025
Python

tiannuo-yang / SearchAgent-X

A High-Efficiency System of Large Language Model Based Search Agents

agent information-retrieval ai approximate-nearest-neighbor-search post-training rag llm rlhf llm-serving vllm efficient-ai

Updated Jul 2, 2025
Python

cokeshao / Awesome-Multimodal-Token-Compression

Survey: https://arxiv.org/pdf/2507.20198

awesome-list model-acceleration long-context mllm efficient-ai token-compression efficient-mllm

Updated Aug 5, 2025

jeho-lee / Awesome-On-Device-AI-Systems

machine-learning edge-computing mobile-systems on-device-ai resource-constrained-devices efficient-ai

Updated Jun 10, 2025

BaiTheBest / SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

pruning model-compression inference-optimization alternating-optimization large-language-models efficient-ai

Updated Mar 27, 2025
Python

erectbranch / MIT-Efficient-AI

TinyML and Efficient Deep Learning Computing | MIT 6.S965/6.5940

ai deep-learning lecture-notes tinyml efficient-ai mit-6s965 mit-65940

Updated Jul 31, 2025

Shikha-code36 / early-exit-cnn

A deep learning framework that implements Early Exit strategies in Convolutional Neural Networks (CNNs) using Deep Q-Learning (DQN). This project enhances computational efficiency by dynamically determining the optimal exit point in a neural network for image classification tasks on CIFAR-10.

reinforcement-learning deep-learning cnn pytorch dqn image-classification cifar10 cifar-10 pytorch-cnn cnn-pytorch cifar10-classification early-exit model-optimization efficient-ai

Updated Feb 23, 2025
Jupyter Notebook

Liu-Hy / WMDD

Official PyTorch implementation of the paper "Dataset Distillation via the Wasserstein Metric" (ICCV 2025).

efficiency optimal-transport distillation dataset-distillation efficient-ai

Updated Aug 5, 2025
Python

ResponsibleAILab / DAM

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

sujin-1013 / task-aware-DMO

Task-Aware Dynamic Model Optimization for Multi-Task Learning (IEEE Access 2023)

deep-learning mtl multi-task-learning model-compression decathlon ai-research lightweight-model efficient-ai

Updated Apr 21, 2025

fangvv / HWGNAS

Code for paper "Automated Design for Hardware-aware Graph Neural Networks on Edge Devices"

neural-networks neural-architecture-search latency-prediction edge-devices graph-neural-networks gnn jetson-nano on-device-ai inference-acceleration hardware-aware-nas efficient-ai

Updated Aug 6, 2025
Python

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."