e-mail: dvkazakov @ gmail.com |
Phone/WhatsApp: +7-916-909-7864 |
Telegram: @denis_v_kazakov |
|
Skype: denis.v.kazakov |
Multivariate regression when there are more targets than predictors
This project at GitHub.
|
The dataset included data on 455 mercury injection capillary pressure experiments with about 20 features (data on oil wells and geology) and 200 target variables. A literature review demonstrated that the target variables are actually curves of mercury injection volumes vs. pressure, i.e. 100 datapoints with two coordinates: volume and pressure.
I tried several regression methods: linear, KNN, Random Forest and boosting (AdaBoost) with AdaBoost proving to be the best perfomer.
I also proposed an alternative solution with pressure data considered to be predictors which, in my opinion, is closer to real laboratory studies.
For details, please see the following notebooks: