f1nuttic

joined 2 years ago
[–] f1nuttic@alien.top 1 points 2 years ago (1 children)

I had the same question a few weeks back and this blog post was really helpful for me: https://together.ai/blog/redpajama-data-v2 . The scripts used are also open sources on the github repo.