noneabove1182@sh.itjust.worksM to

LocalLLaMA@sh.itjust.worksEnglish · 1 year ago

OpenOrca, an open-source dataset and series of instruct-tuned language models

erichartford.com

5

17

OpenOrca, an open-source dataset and series of instruct-tuned language models

erichartford.com

noneabove1182@sh.itjust.worksM to

LocalLLaMA@sh.itjust.worksEnglish · 1 year ago

5

OpenOrca

erichartford.com

Today I am announcing OpenOrca, an open-source dataset and series of instruct-tuned language models. As I read Orca: Progressive Learning from Complex Explanation Traces of GPT-4 by Mukherjee et. al. of Microsoft, I had to consider the implications f...

I realized that while Microsoft would probably release their LLaMA-13b based model (as of the time of this writing they still haven’t) I concluded that they might not release the dataset. Therefore, I resolved to replicate their efforts, download the data myself, and train the model myself, so that OpenOrca can be released on other sizes of LLaMA as well as other foundational models such as Falcon, OpenLLaMA, RedPajama, MPT, RWKV.

Chat

mediocreatbest@lemmy.sdf.org
link
fedilink
English
arrow-up
2·
1 year ago
It looks like it! :) https://huggingface.co/datasets/Open-Orca/OpenOrca

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

15 users / day
16 users / week
21 users / month
362 users / 6 months
1 local subscriber
2.24K subscribers
223 Posts
815 Comments
Modlog