f1nuttic

joined 1 year ago
[–] f1nuttic@alien.top 1 points 1 year ago (1 children)

I had the same question a few weeks back and this blog post was really helpful for me: https://together.ai/blog/redpajama-data-v2 . The scripts used are also open sources on the github repo.