Hey guys, begginers doubt:
I am preparing a dataframe for a machine learning model. The purpose of the model is to predict whether people infected with COVID will die or not.
To do this, I am looking for some conditions and symptoms, such as sore throat, cough, comorbidities, gender, and others, and binarizing them into “yes” or “no” or “male” and “female”.
I have a problem. One of the variables is “pregnant”, but only individuals of the female sex can be pregnant. How can I deal with this variable?
Can I keep it in the dataframe and assign the value “not pregnant” to all male individuals? Or could this harm the model?
Keep in mind that it is possible but rare for men to be pregnant (eg trans men, some kind of intersex people).
https://www.healthline.com/health/transgender/can-men-get-pregnant#if-you-have-a-uterus-and-ovaries
Trans men, you mean… women?
And of course you get downvoted… the virus devours peoples minds.
Didn’t expect anything else…
It’s been nice having you on Reddit, but prepare to be banned. They don’t tolerate any variance from the Progressive mindset.