https://github.com/lucidrains/llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning
artificial-intelligence attention deep-learning q-learning
Added: over 1 year ago - Last Synced: 11 months ago - Created: November 23, 2023

  • Relevant topics? true
  • External users? true
  • Open source license? true
  • Active? true
  • Fork? false
  • Main Language: Python
  • Commits: 19
  • Committers: 1
  • Issues: 1
  • Pull Requests: 0
  • Owner: lucidrains
  • Stars: 148
  • Forks: 4
  • Packages: 0