Machine Learning

1 readers

1 users here now

Community Rules:

Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.

founded 2 years ago

MODERATORS

communick@academy.garden

[D] NLP infrastructure (alien.top)

submitted 2 years ago by Pitiful_Marketing733@alien.top to c/machinelearning@academy.garden

6 comments fedilink hide all child comments

I'm always willing to build a career in AI/ML infra. Usually when talking about AI infra in tech industry, we refer to training infra, serving infra, model deployment etc.

Now with this genAI/LLM wave, I find many NLP specific infrastructure such as semantic indexing, vector databases are quickly rising up. So do semantic indexing/vector databases also count as AI infra? And is it a promising field?

top 6 comments

sorted by: hot top controversial new old

[–] notllmchatbot@alien.top 1 points 2 years ago

I see these as the equivalent of selling picks and shovels in a gold rush. Good thing is that you won't need to bet on any particular vertical or application, which is always hard for novel technologies. Bad thing is infra is usually not where most of the value generation/capture happens.

[–] Mammoth-Doughnut-160@alien.top 1 points 2 years ago (1 children)

Yes, semantic indexing and vector databases are now part of AI infra called Retrieval Augmented Generation which is used to link knowledge sources to LLMs for information retrieval. (LLMs are not good at searching). To learn more about how to implement RAG in a GenAI context, check out LLMWare which provides an integrated RAG platform so you can quickly level up in AI Infra: https://github.com/llmware-ai/llmware

[–] Pitiful_Marketing733@alien.top 1 points 2 years ago

https://github.com/llmware-ai/llmware

Yes I notice now everybody is working in RAG!

[–] localhost80@alien.top 1 points 2 years ago

No, it is not considered AI infra. Embedded databases consume AI infra to create the embedding, but vector databases can exist without any AI component. AI infra is the generation of output not the consumption of the generation. If that was the case, every piece of software built that uses a LLM component would be considered AI infra.

[–] Consistent_Area9877@alien.top 1 points 2 years ago (1 children)

Tip: go to openAI’s hiring page and look up their infrastructure engineer requirements :)

[–] Pitiful_Marketing733@alien.top 1 points 2 years ago

Yes!