Machine Learning

1 readers

1 users here now

Community Rules:

Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.

founded 2 years ago

MODERATORS

communick@academy.garden

[Project] Big 5 Personality Project Question (alien.top)

submitted 2 years ago by OpenJuggernaut8556@alien.top to c/machinelearning@academy.garden

4 comments fedilink hide all child comments

I'm looking for some advice regarding a project idea I have. I would like to predict the big five personality traits for authors based on an analysis of their writing samples. However, would I need to have had some authors take the big five personality assessment and have a training set with those results in order to do a project like this? Or is their a way to "guess" what certain writing patterns would correlate with? What would be the potential strategy for orienting an ml project like this?

you are viewing a single comment's thread
view the rest of the comments

[–] Veggies-are-okay@alien.top 1 points 2 years ago

Since it’s writing style, it’s unstructured data (as opposed to tabular) and therefore a neural network is the best option. Because you’re looking at text, you have two options:

theoretical: rnn -> lstm -> transformer

More so if you’re into the inner workings. Recursive neural networks bring in the concept of recursion, lstm (long short term memory) gives you more power (but a little more complicated), and finally transformers have the fun encoder/decoder features built in to make a super-powered lstm.

huggingface! For simple classification from text this is gonna be real easy and pretty effective:

https://huggingface.co/bert-base-cased

The big thing here is how are you going to fine tune it? You’ll need some classification outcomes to attach to your samples. Because the traits aren’t mutually exclusive, you might want to make a few binary classifiers (yes/no for a specific trait). The link has some examples of fine tuning too.

Hope this gets you off to a decent start!