https://github.com/lucidrains/llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
artificial-intelligence
attention
deep-learning
q-learning
Added: over 1 year ago - Last Synced: 11 months ago
- Created: November 23, 2023
- Relevant topics? true
- External users? true
- Open source license? true
- Active? true
- Fork? false
- Main Language: Python
- Commits: 19
- Committers: 1
- Issues: 1
- Pull Requests: 0
- Owner: lucidrains
- Stars: 148
- Forks: 4
- Packages: 0
