overview for saintshing

What All Dropped Recently: in c/localllama@poweruser.forum

[–] saintshing@alien.top 1 points 2 years ago

https://github.com/m-bain/whisperX/issues/569

[D] Need advice on training neural network to trigger “laughing” for comedy skit audio files in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago

Does this work in real time or your model has access to the entire sequence so you can use context from before and after the current time point?

You have to be careful with leaking when you preprocess the training data if you remove the laughter and leave an silent time interval.

The text based approach may work but it may not give you a precise timing.

[D] how do you remember methods in papers you read? in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago

Or you try to teach someone else

By implementing a model without looking at the paper, you essentially perform autoencoding/masked language modelling and learn a more compact latent representation.

[D] Is there any research for RAG where vectors that are queried in succession become associated for future queries, even if their embedding values are different? in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago

I am not sure if I understand your question.

What exactly is your query? Is it "spell the poem in blocks"? Or you really want a "poem" query to always return also the part about spelling in minecraft blocks even though you haven't mentioned anything about minecraft blocks?

These two things are not associated together in human languages. I meant you can create training data to force them to be embedded together if you want. You can also add a layer on top of the vector db, so some metadata is stored together with the embedding which can help you retrieve related documents.

[D] Gen-AI/LLM - Interview prep in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago (1 children)

Good luck with your interview

https://rentry.org/llm-training

Challenges and Applications of Large Language Models

https://github.com/bkitano/llama-from-scratch

https://www.philschmid.de/tags/generativeai

https://web.stanford.edu/class/cs224n/ and 224u, 224v

https://eugeneyan.com/writing/llm-patterns/ (many other good blogposts)

https://huyenchip.com/2023/04/11/llm-engineering.html

https://lilianweng.github.io/

RAG stuff(sentence embedding, vector db):
https://www.pinecone.io/learn/
https://www.sbert.net/
https://haystackconf.com/us2023/keynote/
https://www.latent.space/p/llamaindex#details

[P] End-to-end Keyword Bidding for Apple Search Ads in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago

How we killed SQL and built a machine learning model in its place

Please consider using a different subtitle

[D] Can LLM trained on Testset? in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago

Pretraining on the Test Set Is All You Need

[D] How To Do Product Matching Using ML? in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago (1 children)

First of all, do you already have the data? Scraping and maintaining an updated product list is an entirely separate task and it is not easy.

If you have already scrapped the data, what data do you have? Just title, or you have the description, brand, model number, images? There was a kaggle competition on matching two products on Shopee for price match guarantee(based on titles and images). You can look at some of the winning solutions.

https://www.kaggle.com/competitions/shopee-product-matching

I looked at your post history. You are tring to sell your price monitoring tool but it seems like you dont have a working prototype based on this question?

[D] How can you add additional features/attributes while doing instance segmentation? in c/machinelearning@academy.garden

[–] saintshing@alien.top 1 points 2 years ago

additional information about the products than run at a specific time (like product family, color, brand, etc.).

Seems easier to just search for images based on the additional information and then train on those images. You would have to first predict the brand(which again requires you to extract visual features from product images of that brand) and maybe do ocr(slow) and use that to filter region proposal.

What are top open source projects in LLM space in c/localllama@poweruser.forum

[–] saintshing@alien.top 1 points 2 years ago

mlc llm(deploying llms on mobiles and in browsers)

Finally, a diffusion based LMM! in c/localllama@poweruser.forum

[–] saintshing@alien.top 1 points 2 years ago (1 children)

Instead of using gaussian noise(in the latent space), I wonder if we can introduce noise by randomly inserting/deleting/replacing/swaping words. Cant we train a BERT model to predict the original text from a noise-added text?