this post was submitted on 23 Nov 2023
1 points (100.0% liked)
Machine Learning
1 readers
1 users here now
Community Rules:
- Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
- Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
- Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
- Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Is this something that could be trained on a laptop without a gpu or would it be better to use cloud based GPU services?
Also can you or anyone else recommend any other libraries which simplify LLM training? I've done some ML projects but I'd like to do something a bit deeper and this looks perfect.
I tried out Talequest by the way. Very impressive.
I strongly recommend training on a GPU, as it speeds up the training process by an order of magnitude and has become the standard. I can recommend services that offer GPU rentals at the lowest prices.
https://vast.ai
https://www.runpod.io
https://www.tensordock.com
Regarding the competitor libraries, I'm unlikely to be able to recommend anything specific. I created this particular library to simplify training on multi-GPU and prototyping, as well as to provide extensive customization options, including modifying the architecture, as is done in LoRA.
Thank you very much for your feedback on Tale Quest. It is very valuable to me, and I plan to further develop it someday. I would appreciate it if you continue to share your feedback. And I wanted to ask right away: is Telegram a popular app where you live? I am very concerned that Telegram might not be widespread enough for a full-fledged launch.
Wow thank you for the detailed reply. Your library looks fantastic. I'm definitely going to give it a go. I'm going to try fine-tuning it on music theory. Is that a crazy idea? Training on a GPU sounds much better. I looked more thoroughly through the repo and found it's all explained in there.
Telegram is a popular app here in the UK. Seems to me like an excellent way to launch it as there's no need for the user to download an app. WhatsApp is much more popular here but maybe it's harder to deploy a bot like this on WhatsApp?