You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+7-3
Original file line number
Diff line number
Diff line change
@@ -8,15 +8,19 @@ Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of th
8
8
9
9
If you are interested in replicating something like ChatGPT out in the open, please consider joining <ahref="https://discord.gg/xBPBXfcFHd">Laion <imgalt="Join us on Discord"src="https://img.shields.io/discord/823813159592001537?color=5865F2&logo=discord&logoColor=white"></a>
10
10
11
-
This repository has gone viral without my permission. Next time, if you are promoting my unfinished repositories (notice the work in progress flag) for twitter engagement or eyeballs, at least (1) do your research or (2) be totally transparent with your readers about the capacity of the repository without resorting to clickbait. (1) I was not the first, CarperAI had been working on RLHF months before, link below. (2) There is no trained model. This is just the ship and overall map. We still need millions of dollars of compute + data to sail to the correct point in high dimensional parameter space. Even then, you need professional sailors (like Robin Rombach of Stable Diffusion fame) to actually guide the ship through turbulent times to that point.
11
+
## FAQ
12
+
13
+
- Does this contain a model for inference?
14
+
15
+
There is no trained model. This is just the ship and overall map. We still need millions of dollars of compute + data to sail to the correct point in high dimensional parameter space. Even then, you need professional sailors (like Robin Rombach of Stable Diffusion fame) to actually guide the ship through turbulent times to that point.
12
16
13
17
## Community
14
18
15
-
<ahref="https://carper.ai/">CarperAI</a> had been working on <ahref="https://github.com/CarperAI/trlx">an RLHF framework</a> for large language models
19
+
<ahref="https://carper.ai/">CarperAI</a> had been working on <ahref="https://github.com/CarperAI/trlx">an RLHF framework</a> for large language models for many months prior to the release of ChatGPT.
16
20
17
21
<ahref="https://www.youtube.com/watch?v=sswA4j_IUxg">Yannic Kilcher</a> is also working on an <ahref="https://github.com/LAION-AI/Open-Assistant">open sourced implementation</a>
0 commit comments