this post was submitted on 26 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 2 years ago
MODERATORS
 

Hey guys, begginers doubt:

I am preparing a dataframe for a machine learning model. The purpose of the model is to predict whether people infected with COVID will die or not.

To do this, I am looking for some conditions and symptoms, such as sore throat, cough, comorbidities, gender, and others, and binarizing them into "yes" or "no" or "male" and "female".

I have a problem. One of the variables is "pregnant", but only individuals of the female sex can be pregnant. How can I deal with this variable?

Can I keep it in the dataframe and assign the value "not pregnant" to all male individuals? Or could this harm the model?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] VoidRippah@alien.top 1 points 2 years ago (1 children)

The purpose of the model is to predict whether people infected with COVID will die or not.

There is no need for all that effort, I can tell you with 100% accuracy that they will day for certain. Just like those who where not infected. /s

[โ€“] grudev@alien.top 1 points 2 years ago

TIL: ML redditors don't understand sarcasm or the downsides of "accuracy" in model evaluation.