this post was submitted on 26 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

Hey guys, begginers doubt:

I am preparing a dataframe for a machine learning model. The purpose of the model is to predict whether people infected with COVID will die or not.

To do this, I am looking for some conditions and symptoms, such as sore throat, cough, comorbidities, gender, and others, and binarizing them into "yes" or "no" or "male" and "female".

I have a problem. One of the variables is "pregnant", but only individuals of the female sex can be pregnant. How can I deal with this variable?

Can I keep it in the dataframe and assign the value "not pregnant" to all male individuals? Or could this harm the model?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] VoidRippah@alien.top 1 points 10 months ago (1 children)

The purpose of the model is to predict whether people infected with COVID will die or not.

There is no need for all that effort, I can tell you with 100% accuracy that they will day for certain. Just like those who where not infected. /s

[โ€“] grudev@alien.top 1 points 9 months ago

TIL: ML redditors don't understand sarcasm or the downsides of "accuracy" in model evaluation.