The project aims to create a model to predict whether a pending merger is successful or not. The group used data set from SDC Platinum with years between 1990 and 2016 and only included public entity targets.
Things I like about the report:
- The group described clearly how they cleaned the data (Simplifying multiple statuses into only two status, delete samples without or with extremely low share prices, and etc.).
- The group used appropriate models to tackle the binary classification problem like logistic regression, simple classification tree, and random forest.
- The group set a reasonable benchmark to evaluate the model (comparing with blindly guessing "Completed").
Areas for improvement:
- Maybe try interpolation to fill the NaNs rather than filling with zeros.
- It would be better if there are more illustrations on the graphs. For example, what are the features in the feature importance plot?
The project aims to create a model to predict whether a pending merger is successful or not. The group used data set from SDC Platinum with years between 1990 and 2016 and only included public entity targets.
Things I like about the report:
Areas for improvement: