Explain with an example how to find out the p-values in order to avoid multicollinearity?
In logistic regression, p-value can be found by the following code
After training the data we can find the following p-values
Here, summary code generates the following output, among which the most insignificant variable is purpose_A46, with a p-value of 0.93. So in order to remove multicollinearity, we remove this column to increase the accuracy.