ylai@lemmy.ml to

LocalLLaMA@sh.itjust.worksEnglish · 8 months ago

Mistral 7B v0.2 Base (released at SHACK15sf hackathon)

1

20

Mistral 7B v0.2 Base (released at SHACK15sf hackathon)

ylai@lemmy.ml to

LocalLLaMA@sh.itjust.worksEnglish · 8 months ago

1

GitHub - mistralai-sf24/hackathon

Contribute to mistralai-sf24/hackathon development by creating an account on GitHub.

GitHub: https://github.com/mistralai-sf24/hackathon
X: https://twitter.com/MistralAILabs/status/1771670765521281370

New release: Mistral 7B v0.2 Base (Raw pretrained model used to train Mistral-7B-Instruct-v0.2)
🔸 https://models.mistralcdn.com/mistral-7b-v0-2/mistral-7B-v0.2.tar
🔸 32k context window
🔸 Rope Theta = 1e6
🔸 No sliding window
🔸 How to fine-tune:

Chat

BetaDoggo_@lemmy.world
link
fedilink
English
arrow-up
2·
8 months ago
It does a little bit worse than v0.1 on all benchmarks which isn’t ideal. That doesn’t really say much about the finetuning potential though.

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

15 users / day
17 users / week
22 users / month
362 users / 6 months
1 local subscriber
2.24K subscribers
223 Posts
814 Comments
Modlog