Model Merging And Adapter Composition

1

PEFTRepository58/100

via “adapter merging and unmerging”

Parameter-efficient fine-tuning — LoRA, QLoRA, adapter methods for LLMs on consumer GPUs.

Unique: Implements reversible weight merging by storing the original base weights separately and computing merged_weight = base_weight + adapter_weight, enabling unmerge_adapter() to restore the original state. The merge operation is mathematically simple but requires careful state management to support unmerging.

vs others: Eliminates adapter inference overhead (5-10% latency reduction) and removes PEFT runtime dependency, enabling deployment as standard transformers models, but at the cost of losing adapter modularity and storage efficiency.

2

gemini-flowAgent45/100

via “adapter-based model abstraction for service heterogeneity”

rUv's Claude-Flow, translated to the new Gemini CLI; transforming it into an autonomous AI development team.

Unique: Implements adapter pattern specifically for Google's heterogeneous AI services with unified request/response formats and consistent error handling, whereas most frameworks either support single services or require manual service-specific code

vs others: Provides unified abstraction across 8+ Google AI services with pluggable adapters, compared to service-specific SDKs requiring manual coordination or frameworks supporting only homogeneous service types

3

Wan2.2-Fun-Reward-LoRAsFine-tune37/100

via “multi-adapter composition for blended video generation styles”

text-to-video model by undefined. 40,686 downloads.

Unique: Enables runtime composition of multiple entertainment-focused LoRA adapters without model merging or retraining — users can dynamically adjust blend weights to explore the space of entertainment characteristics, whereas most video generation systems require choosing a single style or retraining for new combinations

vs others: Provides fine-grained style control through adapter composition that competitors don't expose — users can create custom entertainment profiles by blending pre-trained adapters, whereas Runway or Pika offer fixed style options or require full model fine-tuning

4

trlFramework33/100

via “model-merging-and-adapter-composition”

Train transformer language models with reinforcement learning.

Unique: Provides utilities for merging and composing LoRA adapters with support for weighted combinations and sequential stacking, enabling multi-task inference without separate model instances

vs others: More flexible than single-adapter inference because it supports adapter composition, while more efficient than maintaining separate models by combining adapters into single merged weights

5

peftFine-tune26/100

via “multi-adapter composition and routing”

Parameter-Efficient Fine-Tuning (PEFT)

Unique: Implements a stateful adapter registry within PeftModel that tracks active adapters and their configurations, enabling runtime switching without model recompilation. The design separates adapter loading (from disk) from adapter activation (in forward pass), allowing multiple adapters to coexist in memory with minimal overhead.

vs others: More flexible than single-adapter approaches because it supports arbitrary composition patterns and dynamic routing, while maintaining the same inference latency as single adapters when only one is active. Enables multi-tenant serving that would otherwise require separate model instances.

6

exllamav2Repository26/100

via “multi-lora adapter composition and switching”

Python AI package: exllamav2

Unique: Implements in-place LoRA composition with dynamic adapter switching without base weight reloading, using a cached adapter registry that pre-computes rank-decomposed products for zero-copy switching between adapters

vs others: Faster adapter switching than HuggingFace PEFT (no model reload); lower memory overhead than storing separate full models; simpler composition API than manual adapter blending

7

QLoRA: Efficient Finetuning of Quantized LLMs (QLoRA)Product23/100

via “adapter composition and inference with merged weight strategies”

* ⭐ 05/2023: [Voyager: An Open-Ended Embodied Agent with Large Language Models (Voyager)](https://arxiv.org/abs/2305.16291)

Unique: Provides systematic adapter composition strategies (sequential, weighted ensemble) with automatic precision handling when merging full-precision adapters into quantized base weights, enabling flexible multi-task model construction — prior LoRA work focused on single-adapter inference

vs others: Enables multi-task inference without maintaining separate models or adapter routing logic, and supports weighted ensemble composition that would otherwise require custom inference code or model ensembling infrastructure

8

VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks (VL-Adapter)Product23/100

via “multi-task adapter composition for vision-language understanding”

* ⭐ 04/2022: [Winoground: Probing Vision and Language Models for Visio-Linguistic... (Winoground)](https://arxiv.org/abs/2204.03162)

Unique: Implements task-specific adapter composition for multimodal models with explicit routing logic, enabling independent training of task adapters while maintaining shared backbone — distinct from single-task adapter approaches and multi-task learning methods that require joint training

vs others: More memory-efficient than training separate full models per task and more flexible than single-task adapters, enabling dynamic task switching without model reloading

Top Matches

Also Known As

Company