Fine tune with lora

#26

by saireddy - opened Mar 3

Mar 3

I am trying to fine tune/domain adapt (use case is only for text though) using lora and are these target modules a good start ?
target_modules:

DeltaNet Linear Attention — 48 layers

because I see that we are adding new layers with 35 architecture ( - in_proj_qkv

Mar 3

This is so awesome, I want to learn how to make it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment