this post was submitted on 12 Nov 2023
1 points (100.0% liked)
Machine Learning
1 readers
1 users here now
Community Rules:
- Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
- Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
- Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
- Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.
founded 11 months ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Since it’s writing style, it’s unstructured data (as opposed to tabular) and therefore a neural network is the best option. Because you’re looking at text, you have two options:
More so if you’re into the inner workings. Recursive neural networks bring in the concept of recursion, lstm (long short term memory) gives you more power (but a little more complicated), and finally transformers have the fun encoder/decoder features built in to make a super-powered lstm.
https://huggingface.co/bert-base-cased
The big thing here is how are you going to fine tune it? You’ll need some classification outcomes to attach to your samples. Because the traits aren’t mutually exclusive, you might want to make a few binary classifiers (yes/no for a specific trait). The link has some examples of fine tuning too.
Hope this gets you off to a decent start!