overview for 1azytux

1

Adjusting Probability distribution Using Speculative Decoding [D] (alien.top)

submitted 11 months ago by 1azytux@alien.top to c/machinelearning@academy.garden

1 comments fedilink

Hi, as Speculative Decoding runs a small model and a large model at the same time with a sampler in between, but in this instance the sampler's job is to NOT skew the probability distributions while doing so. There's a fairly simple python implementation of this idea here. Is there a way we can adjust the probability distributions of either the small model or the large model for the task of generation?

1

Controlled text generation [D] (alien.top)

submitted 11 months ago by 1azytux@alien.top to c/machinelearning@academy.garden

1 comments fedilink

Is there any way we can involve another model (let's call it Model B) to manipulate the logits of Model A? This way, we could incorporate information from Model B when calculating the final outputs of Model A. One way is done by Dexperts paper, but has anyone done it in more straightforward/easier way for LLaMA based model?

1

Model's behavioural changes (alien.top)

submitted 11 months ago by 1azytux@alien.top to c/localllama@poweruser.forum

0 comments fedilink

Why do models behave this way when they’re instructions are fine tuned? As how they start performing better, Is there any study done already?

1

[D] intermediate attention values of llama (alien.top)

submitted 1 year ago by 1azytux@alien.top to c/machinelearning@academy.garden

0 comments fedilink

Is anyone aware of how to obtain attention values of LLaMA model? For example, if I want to obtain attention values (of size 4096) from layer 24. How do I get them?

1

intermediate attention values of llama (alien.top)

submitted 1 year ago by 1azytux@alien.top to c/localllama@poweruser.forum

0 comments fedilink

Is anyone aware of how to obtain attention values of LLaMA model? For example, if I want to obtain attention values (of size 4096) from layer 24. How do I get them?

How do I choose the Llama Model? It's so confusing. in c/localllama@poweruser.forum

[–] 1azytux@alien.top 1 points 1 year ago

you can look at the huggingface page of thebloke where he has told about the differences too

[D] Can you review my resume? Looking for new opportunities. in c/machinelearning@academy.garden

[–] 1azytux@alien.top 1 points 1 year ago (1 children)

this sub is for discussion of important stuff happening in ml not for how candidates can apply for jobs smh