Migrate to `peft` from `opendelta` for parameter efficient tuning methods #434

jon-tow · 2023-04-13T19:27:29Z

🚀 The feature, motivation, and pitch

Let's migrate to peft.

Tasks

Doing so will require the following updates:

Replace the opendelta setup in the AccelerateBaseTrainer with a peft backed setup:

trlx/trlx/trainer/accelerate_base_trainer.py

Lines 145 to 155 in 92b68e4

    
           if self.config.model.delta_kwargs is not None: 
        
               delta_type, delta_kwargs = parse_delta_kwargs( 
        
                   model.base_model.config, 
        
                   self.config.model.delta_kwargs, 
        
                   self.config.model.num_layers_unfrozen, 
        
               ) 
        
               delta_model_class = get_delta_model_class(delta_type) 
        
               delta_model = delta_model_class(model.base_model, **delta_kwargs) 
        
               delta_model.freeze_module(exclude=["deltas"], set_state_dict=True) 
        
               if self.accelerator.is_main_process: 
        
                   delta_model.log()

Handle fine-grained layer capturing to only modify the upper trunk layers of hydra architectures as handled below:

trlx/trlx/utils/modeling.py

Lines 414 to 428 in 92b68e4

    
           def get_delta_modified_modules( 
        
               config: transformers.PretrainedConfig, 
        
               modified_modules: List[str], 
        
               num_layers_unfrozen: int = -1, 
        
           ) -> List[str]: 
        
               """Returns a list of module names to be modified for a given delta method with 
        
               the specified number of learnable layers.""" 
        
               unfrozen_layers_pattern = generate_layer_regex(config, num_layers_unfrozen) 
        
               # [r] for regex as per https://github.com/thunlp/OpenDelta/blob/main/opendelta/utils/name_based_addressing.py#L20 
        
               regex_prefix = "[r]" 
        
               # TODO (jon-tow): `decoder.block.` is hardcoded to support T5 layer naming. 
        
               decoder_prefix = "decoder.block." if config.is_encoder_decoder else "" 
        
               module_list = [regex_prefix + decoder_prefix + unfrozen_layers_pattern + module for module in modified_modules] 
        
               return module_list

Motivation

Citing @ethankim00's concerns with opendelta:

opendelta import fails due to an unnecessary turtle package import. Even if pip installed, users may be required to have sudo privileges to install the corresponding base graphics package ModuleNotFoundError caused by turtle package thunlp/OpenDelta#47
Doesn’t seem to work with DeepSpeed ZeRO 3
Additional inference overhead from not merging in the LoRA adapters layers
Incompatibility with int8 training
Less actively maintained than the peft library, which has been growing rapidly
Sharing adapter weights on the HuggingFace Hub is less convenient with opendelta

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

jon-tow · 2023-04-14T15:19:26Z

Assignee: @glerzing will be taking a go at this :)

…CarperAI#434)

loganlebanoff · 2023-05-10T15:24:21Z

I'm curious as to the status of this issue/PR

glerzing · 2023-05-10T16:02:13Z

I'm developing automated tests for it, there should be a PR soon.

akk-123 · 2023-05-12T08:37:54Z

@glerzing look forward it

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

akk-123 · 2023-05-22T02:11:42Z

@glerzing when will have a pr?

glerzing · 2023-05-22T13:05:30Z

It should be ready for PR tomorrow, sorry for the wait.

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

* Migrate to peft from opendelta for parameter efficient tuning methods (#434) + Collapse reference+learner hydra heads when using LoRa (#320) * fix from_config * Review corrections * ILQL generate when temperature is 0. * revert: guard against experimental 8-bit loading support * format: run `black` --------- Co-authored-by: jon-tow <[email protected]> Co-authored-by: maxreciprocate <[email protected]>

jon-tow added the feature request New feature or request label Apr 13, 2023

glerzing added a commit to glerzing/trlx that referenced this issue Apr 20, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

9b416c6

…CarperAI#434)

glerzing added a commit to glerzing/trlx that referenced this issue Apr 21, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

551526b

…CarperAI#434)

glerzing added a commit to glerzing/trlx that referenced this issue Apr 22, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

8ec6ba8

…CarperAI#434)

glerzing added a commit to glerzing/trlx that referenced this issue May 13, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

32bbebe

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 13, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

267c971

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 23, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

3430029

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 23, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

66a56f5

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

9a86e63

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

5abd209

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

1e73234

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

7e48164

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate to `peft` from `opendelta` for parameter efficient tuning methods #434

Migrate to `peft` from `opendelta` for parameter efficient tuning methods #434

jon-tow commented Apr 13, 2023 •

edited

Loading

jon-tow commented Apr 14, 2023

loganlebanoff commented May 10, 2023

glerzing commented May 10, 2023

akk-123 commented May 12, 2023

akk-123 commented May 22, 2023

glerzing commented May 22, 2023

Migrate to peft from opendelta for parameter efficient tuning methods #434

Migrate to peft from opendelta for parameter efficient tuning methods #434

Comments

jon-tow commented Apr 13, 2023 • edited Loading

🚀 The feature, motivation, and pitch

Tasks

Motivation

Alternatives

Additional context

jon-tow commented Apr 14, 2023

loganlebanoff commented May 10, 2023

glerzing commented May 10, 2023

akk-123 commented May 12, 2023

akk-123 commented May 22, 2023

glerzing commented May 22, 2023

Migrate to `peft` from `opendelta` for parameter efficient tuning methods #434

Migrate to `peft` from `opendelta` for parameter efficient tuning methods #434

jon-tow commented Apr 13, 2023 •

edited

Loading