{ "cells": [ { "cell_type": "markdown", "id": "2e64172a", "metadata": {}, "source": [ "# Alternative solution using pressure values as independent variables\n", "The purpose of MICP experiments is to measure injected mercury volume as a function of pressure, so, in my opinion, pressure values should be independent variables. They may be unknown, in which case it is possible to use the maximum range from 0 to 60,000 psi (maximum pressure in MICP experiments)." ] }, { "cell_type": "code", "execution_count": 1, "id": "444d3eb9", "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import numpy as np\n", "from sklearn.pipeline import Pipeline\n", "from sklearn.preprocessing import StandardScaler\n", "from sklearn.compose import ColumnTransformer\n", "from sklearn.preprocessing import OneHotEncoder\n", "from sklearn.model_selection import train_test_split\n", "from sklearn.model_selection import GridSearchCV\n", "from sklearn.metrics import mean_absolute_percentage_error\n", "from sklearn.multioutput import MultiOutputRegressor\n", "import chime\n", "import optuna\n", "from sklearn.model_selection import cross_val_score\n", "from sklearn.metrics import mean_absolute_percentage_error" ] }, { "cell_type": "markdown", "id": "b51a469a", "metadata": {}, "source": [ "Visualization:" ] }, { "cell_type": "code", "execution_count": 2, "id": "5878166e", "metadata": {}, "outputs": [], "source": [ "from optuna.visualization.matplotlib import plot_optimization_history\n", "from optuna.visualization.matplotlib import plot_param_importances" ] }, { "cell_type": "markdown", "id": "ae7acac4", "metadata": {}, "source": [ "Audible notification:" ] }, { "cell_type": "code", "execution_count": 3, "id": "068462fa", "metadata": {}, "outputs": [], "source": [ "%load_ext chime" ] }, { "cell_type": "markdown", "id": "6bbadc41", "metadata": {}, "source": [ "# Model performance metric \n", "The target variables, bv and pc (i.e mercury volume and pressure) use different scales: pressure scale is 3 orders of magnitude larger. So the variables and their prediction errors are not comparable. Moreover, pressure and volume themselves vary across wide ranges (also a few order of magnitude wide). Therefore, I use mean average percentage error to evaluage model performance. " ] }, { "cell_type": "markdown", "id": "ff355bb1", "metadata": {}, "source": [ "# Data preparation\n", "- Group is a sequential well number that does not have physical sense. It is also uniquely defined by well coordinates. So I drop this column.\n", "- Well coordinates as such also do not determine anything but relative well proximity to each other can result in similarities, so I keep them in.\n", "- Same sample numbers across different wells do not result in any similarity so I drop this column as well.\n", "- The only categorical feature (and a very important one) is lithology, so I will one-hot encode it.\n", "- I will take logarithm of pressure due to a very wide range\n", "- All other values will be standardized as usual." ] }, { "cell_type": "code", "execution_count": 4, "id": "200a6594", "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", " | group | \n", "sample | \n", "depth | \n", "por | \n", "den | \n", "ct_1 | \n", "ct_2 | \n", "ct_3 | \n", "ct_4 | \n", "ct_5 | \n", "... | \n", "pc_91 | \n", "pc_92 | \n", "pc_93 | \n", "pc_94 | \n", "pc_95 | \n", "pc_96 | \n", "pc_97 | \n", "pc_98 | \n", "pc_99 | \n", "pc_100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "3 | \n", "52 | \n", "1660.178974 | \n", "18.556209 | \n", "2.740942 | \n", "1971.579998 | \n", "2396.714551 | \n", "2799.219912 | \n", "1951.977330 | \n", "2041.857394 | \n", "... | \n", "20517.973164 | \n", "23072.031367 | \n", "25918.199746 | \n", "29130.680234 | \n", "32741.818105 | \n", "36776.940391 | \n", "41337.948828 | \n", "46440.194570 | \n", "52182.348633 | \n", "58608.434023 | \n", "
1 | \n", "4 | \n", "92 | \n", "3890.779426 | \n", "8.555400 | \n", "2.834776 | \n", "2513.180531 | \n", "3001.782975 | \n", "2348.160682 | \n", "2414.636280 | \n", "2798.706138 | \n", "... | \n", "4178.079326 | \n", "4564.280930 | \n", "4987.905991 | \n", "5457.903966 | \n", "5976.209199 | \n", "6537.572468 | \n", "7149.014724 | \n", "7826.130471 | \n", "8559.884813 | \n", "9368.567796 | \n", "
2 | \n", "3 | \n", "90 | \n", "2287.441253 | \n", "-0.169935 | \n", "2.761468 | \n", "2274.773580 | \n", "1083.899155 | \n", "2974.647775 | \n", "2713.863889 | \n", "2381.094609 | \n", "... | \n", "20522.705605 | \n", "23075.290605 | \n", "25927.208926 | \n", "29136.474492 | \n", "32743.575156 | \n", "36780.525273 | \n", "41344.256875 | \n", "46461.810508 | \n", "52207.803164 | \n", "58651.855859 | \n", "
3 | \n", "3 | \n", "49 | \n", "2144.788740 | \n", "28.192998 | \n", "2.637605 | \n", "1776.270868 | \n", "2374.721334 | \n", "2670.367528 | \n", "2814.751969 | \n", "2919.311685 | \n", "... | \n", "20522.544570 | \n", "23075.227422 | \n", "25928.721426 | \n", "29137.418496 | \n", "32742.759414 | \n", "36779.554531 | \n", "41344.410508 | \n", "46463.711484 | \n", "52212.429531 | \n", "58661.142344 | \n", "
4 | \n", "2 | \n", "65 | \n", "3754.453151 | \n", "4.136069 | \n", "2.900202 | \n", "1787.771840 | \n", "1893.016733 | \n", "2818.411074 | \n", "1542.522104 | \n", "2246.952313 | \n", "... | \n", "4223.781546 | \n", "4615.144614 | \n", "5045.280631 | \n", "5520.831567 | \n", "6044.115860 | \n", "6611.085441 | \n", "7231.857753 | \n", "7913.697529 | \n", "8657.308785 | \n", "9475.725759 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
450 | \n", "5 | \n", "51 | \n", "2097.045676 | \n", "14.781216 | \n", "2.688458 | \n", "2455.839335 | \n", "2480.124636 | \n", "2574.049871 | \n", "2511.215039 | \n", "2395.579990 | \n", "... | \n", "20522.071465 | \n", "23074.956074 | \n", "25926.207676 | \n", "29135.754883 | \n", "32743.289434 | \n", "36780.388477 | \n", "41343.607969 | \n", "46459.794961 | \n", "52204.491289 | \n", "58646.683398 | \n", "
451 | \n", "5 | \n", "35 | \n", "2078.198020 | \n", "19.385152 | \n", "2.684518 | \n", "2044.566279 | \n", "2052.309283 | \n", "2343.335254 | \n", "2529.277934 | \n", "2360.570479 | \n", "... | \n", "20515.948887 | \n", "23070.407505 | \n", "25916.965068 | \n", "29127.595225 | \n", "32738.985176 | \n", "36777.577871 | \n", "41334.507256 | \n", "46434.336172 | \n", "52175.582588 | \n", "58596.777002 | \n", "
452 | \n", "2 | \n", "68 | \n", "3672.405920 | \n", "26.585923 | \n", "2.772035 | \n", "2040.649000 | \n", "2573.163502 | \n", "1292.780567 | \n", "2079.696767 | \n", "2355.948265 | \n", "... | \n", "4384.635758 | \n", "4792.584608 | \n", "5241.195855 | \n", "5734.702477 | \n", "6275.782170 | \n", "6869.952183 | \n", "7510.868407 | \n", "8221.208958 | \n", "8994.294442 | \n", "9843.708743 | \n", "
453 | \n", "5 | \n", "6 | \n", "2094.513127 | \n", "16.977858 | \n", "2.705836 | \n", "2591.491630 | \n", "2295.452470 | \n", "2432.286576 | \n", "2406.785838 | \n", "2705.931007 | \n", "... | \n", "20522.289775 | \n", "23073.814199 | \n", "25925.970928 | \n", "29138.137822 | \n", "32746.070488 | \n", "36781.718086 | \n", "41348.267930 | \n", "46455.386035 | \n", "52207.608379 | \n", "58647.802676 | \n", "
454 | \n", "5 | \n", "40 | \n", "2083.348434 | \n", "21.551898 | \n", "2.698208 | \n", "2759.357112 | \n", "2391.516056 | \n", "2552.341865 | \n", "2043.676110 | \n", "2197.082350 | \n", "... | \n", "20521.270671 | \n", "23074.265576 | \n", "25929.970017 | \n", "29137.153210 | \n", "32746.966450 | \n", "36791.815923 | \n", "41348.842178 | \n", "46455.946401 | \n", "52216.163403 | \n", "58661.856567 | \n", "
455 rows × 222 columns
\n", "\n", " | group | \n", "sample | \n", "depth | \n", "por | \n", "den | \n", "ct_1 | \n", "ct_2 | \n", "ct_3 | \n", "ct_4 | \n", "ct_5 | \n", "ct_6 | \n", "ct_7 | \n", "permeability | \n", "ntg | \n", "thickness_effective | \n", "x | \n", "y | \n", "lithology | \n", "gr | \n", "rhob | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "3 | \n", "52 | \n", "1660.178974 | \n", "18.556209 | \n", "2.740942 | \n", "1971.579998 | \n", "2396.714551 | \n", "2799.219912 | \n", "1951.977330 | \n", "2041.857394 | \n", "2442.840273 | \n", "2318.618853 | \n", "6.425410 | \n", "0.106890 | \n", "5.658985 | \n", "18985 | \n", "5423 | \n", "limestone | \n", "37.738168 | \n", "2.664759 | \n", "
1 | \n", "4 | \n", "92 | \n", "3890.779426 | \n", "8.555400 | \n", "2.834776 | \n", "2513.180531 | \n", "3001.782975 | \n", "2348.160682 | \n", "2414.636280 | \n", "2798.706138 | \n", "3035.549168 | \n", "2729.578887 | \n", "14.300516 | \n", "0.718312 | \n", "4.483886 | \n", "16790 | \n", "3644 | \n", "shale | \n", "42.371106 | \n", "2.760788 | \n", "
2 | \n", "3 | \n", "90 | \n", "2287.441253 | \n", "-0.169935 | \n", "2.761468 | \n", "2274.773580 | \n", "1083.899155 | \n", "2974.647775 | \n", "2713.863889 | \n", "2381.094609 | \n", "2085.069195 | \n", "2747.971468 | \n", "12.825353 | \n", "0.639979 | \n", "6.349391 | \n", "18985 | \n", "5423 | \n", "sandstone | \n", "42.931089 | \n", "2.624635 | \n", "
3 | \n", "3 | \n", "49 | \n", "2144.788740 | \n", "28.192998 | \n", "2.637605 | \n", "1776.270868 | \n", "2374.721334 | \n", "2670.367528 | \n", "2814.751969 | \n", "2919.311685 | \n", "2016.024319 | \n", "2546.626337 | \n", "13.320168 | \n", "0.288901 | \n", "3.819145 | \n", "18985 | \n", "5423 | \n", "limestone | \n", "39.485022 | \n", "2.634539 | \n", "
4 | \n", "2 | \n", "65 | \n", "3754.453151 | \n", "4.136069 | \n", "2.900202 | \n", "1787.771840 | \n", "1893.016733 | \n", "2818.411074 | \n", "1542.522104 | \n", "2246.952313 | \n", "1943.089817 | \n", "1561.393112 | \n", "7.183351 | \n", "0.500868 | \n", "6.593625 | \n", "16169 | \n", "5288 | \n", "siltstome | \n", "34.846060 | \n", "2.459622 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
450 | \n", "5 | \n", "51 | \n", "2097.045676 | \n", "14.781216 | \n", "2.688458 | \n", "2455.839335 | \n", "2480.124636 | \n", "2574.049871 | \n", "2511.215039 | \n", "2395.579990 | \n", "2439.147140 | \n", "2089.051532 | \n", "41.015660 | \n", "0.709441 | \n", "7.313445 | \n", "19850 | \n", "3873 | \n", "limestone | \n", "30.240244 | \n", "2.562103 | \n", "
451 | \n", "5 | \n", "35 | \n", "2078.198020 | \n", "19.385152 | \n", "2.684518 | \n", "2044.566279 | \n", "2052.309283 | \n", "2343.335254 | \n", "2529.277934 | \n", "2360.570479 | \n", "2256.546050 | \n", "2663.343376 | \n", "1.366440 | \n", "0.554834 | \n", "8.669671 | \n", "19850 | \n", "3873 | \n", "limestone | \n", "14.771108 | \n", "2.617332 | \n", "
452 | \n", "2 | \n", "68 | \n", "3672.405920 | \n", "26.585923 | \n", "2.772035 | \n", "2040.649000 | \n", "2573.163502 | \n", "1292.780567 | \n", "2079.696767 | \n", "2355.948265 | \n", "2281.831720 | \n", "2071.560649 | \n", "3.371203 | \n", "0.786869 | \n", "4.971094 | \n", "16169 | \n", "5288 | \n", "sandstone | \n", "41.162699 | \n", "2.687207 | \n", "
453 | \n", "5 | \n", "6 | \n", "2094.513127 | \n", "16.977858 | \n", "2.705836 | \n", "2591.491630 | \n", "2295.452470 | \n", "2432.286576 | \n", "2406.785838 | \n", "2705.931007 | \n", "2400.377198 | \n", "2512.672454 | \n", "16.081238 | \n", "0.671111 | \n", "7.022591 | \n", "19850 | \n", "3873 | \n", "sandstone | \n", "37.093964 | \n", "2.653836 | \n", "
454 | \n", "5 | \n", "40 | \n", "2083.348434 | \n", "21.551898 | \n", "2.698208 | \n", "2759.357112 | \n", "2391.516056 | \n", "2552.341865 | \n", "2043.676110 | \n", "2197.082350 | \n", "2635.710708 | \n", "2602.250347 | \n", "6.585460 | \n", "0.791506 | \n", "4.847459 | \n", "19850 | \n", "3873 | \n", "limestone | \n", "29.943692 | \n", "2.578852 | \n", "
455 rows × 20 columns
\n", "\n", " | pc_0 | \n", "pc_1 | \n", "pc_2 | \n", "pc_3 | \n", "pc_4 | \n", "pc_5 | \n", "pc_6 | \n", "pc_7 | \n", "pc_8 | \n", "pc_9 | \n", "... | \n", "pc_91 | \n", "pc_92 | \n", "pc_93 | \n", "pc_94 | \n", "pc_95 | \n", "pc_96 | \n", "pc_97 | \n", "pc_98 | \n", "pc_99 | \n", "pc_100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.535189 | \n", "0.595750 | \n", "0.665606 | \n", "0.745420 | \n", "0.835103 | \n", "0.932039 | \n", "1.038247 | \n", "1.156261 | \n", "1.292918 | \n", "1.448585 | \n", "... | \n", "20517.973164 | \n", "23072.031367 | \n", "25918.199746 | \n", "29130.680234 | \n", "32741.818105 | \n", "36776.940391 | \n", "41337.948828 | \n", "46440.194570 | \n", "52182.348633 | \n", "58608.434023 | \n", "
1 | \n", "0.908788 | \n", "1.038644 | \n", "1.168391 | \n", "1.298765 | \n", "1.437975 | \n", "1.558354 | \n", "1.814425 | \n", "1.966166 | \n", "2.169002 | \n", "2.359788 | \n", "... | \n", "4178.079326 | \n", "4564.280930 | \n", "4987.905991 | \n", "5457.903966 | \n", "5976.209199 | \n", "6537.572468 | \n", "7149.014724 | \n", "7826.130471 | \n", "8559.884813 | \n", "9368.567796 | \n", "
2 | \n", "0.531813 | \n", "0.591931 | \n", "0.661652 | \n", "0.741723 | \n", "0.831225 | \n", "0.928339 | \n", "1.035978 | \n", "1.155482 | \n", "1.291775 | \n", "1.447311 | \n", "... | \n", "20522.705605 | \n", "23075.290605 | \n", "25927.208926 | \n", "29136.474492 | \n", "32743.575156 | \n", "36780.525273 | \n", "41344.256875 | \n", "46461.810508 | \n", "52207.803164 | \n", "58651.855859 | \n", "
3 | \n", "0.530184 | \n", "0.590141 | \n", "0.659933 | \n", "0.740125 | \n", "0.829539 | \n", "0.926774 | \n", "1.034327 | \n", "1.154018 | \n", "1.290386 | \n", "1.445932 | \n", "... | \n", "20522.544570 | \n", "23075.227422 | \n", "25928.721426 | \n", "29137.418496 | \n", "32742.759414 | \n", "36779.554531 | \n", "41344.410508 | \n", "46463.711484 | \n", "52212.429531 | \n", "58661.142344 | \n", "
4 | \n", "0.908399 | \n", "1.037779 | \n", "1.167289 | \n", "1.297142 | \n", "1.436113 | \n", "1.557094 | \n", "1.813521 | \n", "1.964542 | \n", "2.168307 | \n", "2.358832 | \n", "... | \n", "4223.781546 | \n", "4615.144614 | \n", "5045.280631 | \n", "5520.831567 | \n", "6044.115860 | \n", "6611.085441 | \n", "7231.857753 | \n", "7913.697529 | \n", "8657.308785 | \n", "9475.725759 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
450 | \n", "0.532050 | \n", "0.592537 | \n", "0.662303 | \n", "0.742076 | \n", "0.831483 | \n", "0.928488 | \n", "1.036162 | \n", "1.155606 | \n", "1.291590 | \n", "1.447329 | \n", "... | \n", "20522.071465 | \n", "23074.956074 | \n", "25926.207676 | \n", "29135.754883 | \n", "32743.289434 | \n", "36780.388477 | \n", "41343.607969 | \n", "46459.794961 | \n", "52204.491289 | \n", "58646.683398 | \n", "
451 | \n", "0.535261 | \n", "0.596386 | \n", "0.666660 | \n", "0.745552 | \n", "0.834887 | \n", "0.932013 | \n", "1.038366 | \n", "1.156286 | \n", "1.292117 | \n", "1.448102 | \n", "... | \n", "20515.948887 | \n", "23070.407505 | \n", "25916.965068 | \n", "29127.595225 | \n", "32738.985176 | \n", "36777.577871 | \n", "41334.507256 | \n", "46434.336172 | \n", "52175.582588 | \n", "58596.777002 | \n", "
452 | \n", "0.908082 | \n", "1.037725 | \n", "1.166750 | \n", "1.297510 | \n", "1.436421 | \n", "1.557271 | \n", "1.813146 | \n", "1.964591 | \n", "2.167796 | \n", "2.358566 | \n", "... | \n", "4384.635758 | \n", "4792.584608 | \n", "5241.195855 | \n", "5734.702477 | \n", "6275.782170 | \n", "6869.952183 | \n", "7510.868407 | \n", "8221.208958 | \n", "8994.294442 | \n", "9843.708743 | \n", "
453 | \n", "0.534512 | \n", "0.593998 | \n", "0.663882 | \n", "0.744470 | \n", "0.834432 | \n", "0.932015 | \n", "1.038154 | \n", "1.156274 | \n", "1.293843 | \n", "1.448943 | \n", "... | \n", "20522.289775 | \n", "23073.814199 | \n", "25925.970928 | \n", "29138.137822 | \n", "32746.070488 | \n", "36781.718086 | \n", "41348.267930 | \n", "46455.386035 | \n", "52207.608379 | \n", "58647.802676 | \n", "
454 | \n", "0.530512 | \n", "0.589967 | \n", "0.661031 | \n", "0.740283 | \n", "0.829952 | \n", "0.928524 | \n", "1.035100 | \n", "1.154249 | \n", "1.290490 | \n", "1.445513 | \n", "... | \n", "20521.270671 | \n", "23074.265576 | \n", "25929.970017 | \n", "29137.153210 | \n", "32746.966450 | \n", "36791.815923 | \n", "41348.842178 | \n", "46455.946401 | \n", "52216.163403 | \n", "58661.856567 | \n", "
455 rows × 101 columns
\n", "\n", " | group | \n", "sample | \n", "depth | \n", "por | \n", "den | \n", "ct_1 | \n", "ct_2 | \n", "ct_3 | \n", "ct_4 | \n", "ct_5 | \n", "... | \n", "pc_91 | \n", "pc_92 | \n", "pc_93 | \n", "pc_94 | \n", "pc_95 | \n", "pc_96 | \n", "pc_97 | \n", "pc_98 | \n", "pc_99 | \n", "pc_100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "3 | \n", "52 | \n", "1660.178974 | \n", "18.556209 | \n", "2.740942 | \n", "1971.579998 | \n", "2396.714551 | \n", "2799.219912 | \n", "1951.977330 | \n", "2041.857394 | \n", "... | \n", "20517.973164 | \n", "23072.031367 | \n", "25918.199746 | \n", "29130.680234 | \n", "32741.818105 | \n", "36776.940391 | \n", "41337.948828 | \n", "46440.194570 | \n", "52182.348633 | \n", "58608.434023 | \n", "
1 | \n", "4 | \n", "92 | \n", "3890.779426 | \n", "8.555400 | \n", "2.834776 | \n", "2513.180531 | \n", "3001.782975 | \n", "2348.160682 | \n", "2414.636280 | \n", "2798.706138 | \n", "... | \n", "4178.079326 | \n", "4564.280930 | \n", "4987.905991 | \n", "5457.903966 | \n", "5976.209199 | \n", "6537.572468 | \n", "7149.014724 | \n", "7826.130471 | \n", "8559.884813 | \n", "9368.567796 | \n", "
2 | \n", "3 | \n", "90 | \n", "2287.441253 | \n", "-0.169935 | \n", "2.761468 | \n", "2274.773580 | \n", "1083.899155 | \n", "2974.647775 | \n", "2713.863889 | \n", "2381.094609 | \n", "... | \n", "20522.705605 | \n", "23075.290605 | \n", "25927.208926 | \n", "29136.474492 | \n", "32743.575156 | \n", "36780.525273 | \n", "41344.256875 | \n", "46461.810508 | \n", "52207.803164 | \n", "58651.855859 | \n", "
3 | \n", "3 | \n", "49 | \n", "2144.788740 | \n", "28.192998 | \n", "2.637605 | \n", "1776.270868 | \n", "2374.721334 | \n", "2670.367528 | \n", "2814.751969 | \n", "2919.311685 | \n", "... | \n", "20522.544570 | \n", "23075.227422 | \n", "25928.721426 | \n", "29137.418496 | \n", "32742.759414 | \n", "36779.554531 | \n", "41344.410508 | \n", "46463.711484 | \n", "52212.429531 | \n", "58661.142344 | \n", "
4 | \n", "2 | \n", "65 | \n", "3754.453151 | \n", "4.136069 | \n", "2.900202 | \n", "1787.771840 | \n", "1893.016733 | \n", "2818.411074 | \n", "1542.522104 | \n", "2246.952313 | \n", "... | \n", "4223.781546 | \n", "4615.144614 | \n", "5045.280631 | \n", "5520.831567 | \n", "6044.115860 | \n", "6611.085441 | \n", "7231.857753 | \n", "7913.697529 | \n", "8657.308785 | \n", "9475.725759 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
450 | \n", "5 | \n", "51 | \n", "2097.045676 | \n", "14.781216 | \n", "2.688458 | \n", "2455.839335 | \n", "2480.124636 | \n", "2574.049871 | \n", "2511.215039 | \n", "2395.579990 | \n", "... | \n", "20522.071465 | \n", "23074.956074 | \n", "25926.207676 | \n", "29135.754883 | \n", "32743.289434 | \n", "36780.388477 | \n", "41343.607969 | \n", "46459.794961 | \n", "52204.491289 | \n", "58646.683398 | \n", "
451 | \n", "5 | \n", "35 | \n", "2078.198020 | \n", "19.385152 | \n", "2.684518 | \n", "2044.566279 | \n", "2052.309283 | \n", "2343.335254 | \n", "2529.277934 | \n", "2360.570479 | \n", "... | \n", "20515.948887 | \n", "23070.407505 | \n", "25916.965068 | \n", "29127.595225 | \n", "32738.985176 | \n", "36777.577871 | \n", "41334.507256 | \n", "46434.336172 | \n", "52175.582588 | \n", "58596.777002 | \n", "
452 | \n", "2 | \n", "68 | \n", "3672.405920 | \n", "26.585923 | \n", "2.772035 | \n", "2040.649000 | \n", "2573.163502 | \n", "1292.780567 | \n", "2079.696767 | \n", "2355.948265 | \n", "... | \n", "4384.635758 | \n", "4792.584608 | \n", "5241.195855 | \n", "5734.702477 | \n", "6275.782170 | \n", "6869.952183 | \n", "7510.868407 | \n", "8221.208958 | \n", "8994.294442 | \n", "9843.708743 | \n", "
453 | \n", "5 | \n", "6 | \n", "2094.513127 | \n", "16.977858 | \n", "2.705836 | \n", "2591.491630 | \n", "2295.452470 | \n", "2432.286576 | \n", "2406.785838 | \n", "2705.931007 | \n", "... | \n", "20522.289775 | \n", "23073.814199 | \n", "25925.970928 | \n", "29138.137822 | \n", "32746.070488 | \n", "36781.718086 | \n", "41348.267930 | \n", "46455.386035 | \n", "52207.608379 | \n", "58647.802676 | \n", "
454 | \n", "5 | \n", "40 | \n", "2083.348434 | \n", "21.551898 | \n", "2.698208 | \n", "2759.357112 | \n", "2391.516056 | \n", "2552.341865 | \n", "2043.676110 | \n", "2197.082350 | \n", "... | \n", "20521.270671 | \n", "23074.265576 | \n", "25929.970017 | \n", "29137.153210 | \n", "32746.966450 | \n", "36791.815923 | \n", "41348.842178 | \n", "46455.946401 | \n", "52216.163403 | \n", "58661.856567 | \n", "
455 rows × 121 columns
\n", "\n", " | bv_0 | \n", "bv_1 | \n", "bv_2 | \n", "bv_3 | \n", "bv_4 | \n", "bv_5 | \n", "bv_6 | \n", "bv_7 | \n", "bv_8 | \n", "bv_9 | \n", "... | \n", "bv_91 | \n", "bv_92 | \n", "bv_93 | \n", "bv_94 | \n", "bv_95 | \n", "bv_96 | \n", "bv_97 | \n", "bv_98 | \n", "bv_99 | \n", "bv_100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "8.314193e-30 | \n", "0.022937 | \n", "0.175831 | \n", "0.343738 | \n", "0.457215 | \n", "0.544975 | \n", "0.619610 | \n", "0.668326 | \n", "0.714923 | \n", "0.758957 | \n", "... | \n", "18.272765 | \n", "18.298018 | \n", "18.320578 | \n", "18.341130 | \n", "18.359811 | \n", "18.369696 | \n", "18.377619 | \n", "18.377846 | \n", "18.378063 | \n", "18.378261 | \n", "
1 | \n", "8.404712e-30 | \n", "0.065743 | \n", "0.110847 | \n", "0.149221 | \n", "0.182478 | \n", "0.207055 | \n", "0.251148 | \n", "0.264525 | \n", "0.286616 | \n", "0.296471 | \n", "... | \n", "7.703552 | \n", "7.729920 | \n", "7.762970 | \n", "7.786158 | \n", "7.807564 | \n", "7.832035 | \n", "7.853543 | \n", "7.873677 | \n", "7.889946 | \n", "7.903239 | \n", "
2 | \n", "9.628115e-30 | \n", "0.065462 | \n", "0.126050 | \n", "0.185575 | \n", "0.236476 | \n", "0.276897 | \n", "0.326915 | \n", "0.352935 | \n", "0.379736 | \n", "0.399494 | \n", "... | \n", "3.323700 | \n", "3.392507 | \n", "3.455950 | \n", "3.515671 | \n", "3.572989 | \n", "3.626552 | \n", "3.677021 | \n", "3.717735 | \n", "3.751224 | \n", "3.785149 | \n", "
3 | \n", "1.001518e-29 | \n", "0.035238 | \n", "0.135762 | \n", "0.260780 | \n", "0.354239 | \n", "0.437589 | \n", "0.512409 | \n", "0.573600 | \n", "0.627993 | \n", "0.674529 | \n", "... | \n", "29.457616 | \n", "29.462819 | \n", "29.467363 | \n", "29.471621 | \n", "29.475390 | \n", "29.477765 | \n", "29.480009 | \n", "29.482167 | \n", "29.484013 | \n", "29.485694 | \n", "
4 | \n", "1.003009e-29 | \n", "0.064098 | \n", "0.117940 | \n", "0.167830 | \n", "0.213313 | \n", "0.245085 | \n", "0.286452 | \n", "0.302900 | \n", "0.324897 | \n", "0.341102 | \n", "... | \n", "3.721371 | \n", "3.787876 | \n", "3.844572 | \n", "3.896132 | \n", "3.944263 | \n", "3.987842 | \n", "4.027115 | \n", "4.057976 | \n", "4.080364 | \n", "4.101747 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
450 | \n", "8.405910e-30 | \n", "0.011935 | \n", "0.266474 | \n", "0.392014 | \n", "0.488393 | \n", "0.561142 | \n", "0.629793 | \n", "0.674519 | \n", "0.721665 | \n", "0.761462 | \n", "... | \n", "14.737324 | \n", "14.754105 | \n", "14.769177 | \n", "14.782801 | \n", "14.794812 | \n", "14.805402 | \n", "14.809196 | \n", "14.809561 | \n", "14.809862 | \n", "14.810078 | \n", "
451 | \n", "9.770500e-30 | \n", "0.035415 | \n", "0.148900 | \n", "0.262791 | \n", "0.349767 | \n", "0.428302 | \n", "0.524900 | \n", "0.561232 | \n", "0.611864 | \n", "0.654517 | \n", "... | \n", "19.464941 | \n", "19.480960 | \n", "19.495513 | \n", "19.507846 | \n", "19.517875 | \n", "19.522732 | \n", "19.525760 | \n", "19.527513 | \n", "19.529068 | \n", "19.530144 | \n", "
452 | \n", "9.858864e-30 | \n", "0.035401 | \n", "0.105316 | \n", "0.252543 | \n", "0.343569 | \n", "0.436199 | \n", "0.520484 | \n", "0.579677 | \n", "0.666790 | \n", "0.735258 | \n", "... | \n", "27.938931 | \n", "27.943315 | \n", "27.947413 | \n", "27.951092 | \n", "27.954383 | \n", "27.956902 | \n", "27.959231 | \n", "27.961456 | \n", "27.963561 | \n", "27.965526 | \n", "
453 | \n", "1.020829e-29 | \n", "0.035444 | \n", "0.181987 | \n", "0.246770 | \n", "0.296267 | \n", "0.331121 | \n", "0.381807 | \n", "0.402956 | \n", "0.436622 | \n", "0.463018 | \n", "... | \n", "15.804989 | \n", "15.818642 | \n", "15.830467 | \n", "15.840493 | \n", "15.845425 | \n", "15.847699 | \n", "15.848337 | \n", "15.848766 | \n", "15.849141 | \n", "15.849408 | \n", "
454 | \n", "1.126088e-29 | \n", "0.041702 | \n", "0.143468 | \n", "0.244122 | \n", "0.323763 | \n", "0.397491 | \n", "0.492625 | \n", "0.538694 | \n", "0.614895 | \n", "0.679498 | \n", "... | \n", "21.485961 | \n", "21.496698 | \n", "21.506562 | \n", "21.514549 | \n", "21.520342 | \n", "21.524409 | \n", "21.526741 | \n", "21.528437 | \n", "21.529944 | \n", "21.531123 | \n", "
455 rows × 101 columns
\n", "\n", " | onehot__lithology_clay sandstone | \n", "onehot__lithology_limestone | \n", "onehot__lithology_sandstone | \n", "onehot__lithology_shale | \n", "onehot__lithology_siltstome | \n", "scale__depth | \n", "scale__por | \n", "scale__den | \n", "scale__ct_1 | \n", "scale__ct_2 | \n", "... | \n", "pc_91 | \n", "pc_92 | \n", "pc_93 | \n", "pc_94 | \n", "pc_95 | \n", "pc_96 | \n", "pc_97 | \n", "pc_98 | \n", "pc_99 | \n", "pc_100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "-1.497434 | \n", "0.249045 | \n", "-0.111626 | \n", "-1.219575 | \n", "0.003803 | \n", "... | \n", "9.929057 | \n", "10.046376 | \n", "10.162701 | \n", "10.279547 | \n", "10.396408 | \n", "10.512626 | \n", "10.629536 | \n", "10.745921 | \n", "10.862500 | \n", "10.978634 | \n", "
1 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "1.580383 | \n", "-0.766025 | \n", "1.035027 | \n", "0.411760 | \n", "1.682853 | \n", "... | \n", "8.337607 | \n", "8.426016 | \n", "8.514771 | \n", "8.604820 | \n", "8.695542 | \n", "8.785321 | \n", "8.874730 | \n", "8.965223 | \n", "9.054842 | \n", "9.145116 | \n", "
2 | \n", "0.0 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "-0.631928 | \n", "-1.651635 | \n", "0.139207 | \n", "-0.306337 | \n", "-3.639227 | \n", "... | \n", "9.929287 | \n", "10.046518 | \n", "10.163048 | \n", "10.279746 | \n", "10.396462 | \n", "10.512724 | \n", "10.629689 | \n", "10.746386 | \n", "10.862987 | \n", "10.979374 | \n", "
3 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "-0.828762 | \n", "1.227167 | \n", "-1.374409 | \n", "-1.807858 | \n", "-0.057228 | \n", "... | \n", "9.929279 | \n", "10.046515 | \n", "10.163107 | \n", "10.279778 | \n", "10.396437 | \n", "10.512697 | \n", "10.629693 | \n", "10.746427 | \n", "10.863076 | \n", "10.979533 | \n", "
4 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "1.0 | \n", "1.392278 | \n", "-1.214581 | \n", "1.834544 | \n", "-1.773216 | \n", "-1.393946 | \n", "... | \n", "8.348486 | \n", "8.437098 | \n", "8.526209 | \n", "8.616284 | \n", "8.706840 | \n", "8.796503 | \n", "8.886251 | \n", "8.976350 | \n", "9.066159 | \n", "9.156489 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
450 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "-0.894639 | \n", "-0.134112 | \n", "-0.752990 | \n", "0.239044 | \n", "0.235264 | \n", "... | \n", "9.929256 | \n", "10.046503 | \n", "10.163010 | \n", "10.279721 | \n", "10.396453 | \n", "10.512720 | \n", "10.629673 | \n", "10.746343 | \n", "10.862924 | \n", "10.979286 | \n", "
451 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "-0.920645 | \n", "0.333181 | \n", "-0.801132 | \n", "-0.999735 | \n", "-0.951913 | \n", "... | \n", "9.928958 | \n", "10.046306 | \n", "10.162653 | \n", "10.279441 | \n", "10.396322 | \n", "10.512644 | \n", "10.629453 | \n", "10.745794 | \n", "10.862370 | \n", "10.978435 | \n", "
452 | \n", "0.0 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "1.279068 | \n", "1.064050 | \n", "0.268330 | \n", "-1.011535 | \n", "0.493444 | \n", "... | \n", "8.385862 | \n", "8.474825 | \n", "8.564305 | \n", "8.654291 | \n", "8.744453 | \n", "8.834912 | \n", "8.924106 | \n", "9.014473 | \n", "9.104346 | \n", "9.194588 | \n", "
453 | \n", "0.0 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "-0.898133 | \n", "0.088844 | \n", "-0.540626 | \n", "0.647638 | \n", "-0.277197 | \n", "... | \n", "9.929267 | \n", "10.046454 | \n", "10.163000 | \n", "10.279803 | \n", "10.396538 | \n", "10.512756 | \n", "10.629786 | \n", "10.746248 | \n", "10.862984 | \n", "10.979305 | \n", "
454 | \n", "0.0 | \n", "1.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "-0.913539 | \n", "0.553103 | \n", "-0.633844 | \n", "1.153259 | \n", "-0.010623 | \n", "... | \n", "9.929217 | \n", "10.046473 | \n", "10.163155 | \n", "10.279769 | \n", "10.396566 | \n", "10.513031 | \n", "10.629800 | \n", "10.746260 | \n", "10.863147 | \n", "10.979545 | \n", "
455 rows × 123 columns
\n", "MultiOutputRegressor(estimator=AdaBoostRegressor(estimator=DecisionTreeRegressor(max_depth=10,\n", " random_state=8),\n", " learning_rate=0.01974826191547239,\n", " n_estimators=272,\n", " random_state=8),\n", " n_jobs=-1)In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
MultiOutputRegressor(estimator=AdaBoostRegressor(estimator=DecisionTreeRegressor(max_depth=10,\n", " random_state=8),\n", " learning_rate=0.01974826191547239,\n", " n_estimators=272,\n", " random_state=8),\n", " n_jobs=-1)
AdaBoostRegressor(estimator=DecisionTreeRegressor(max_depth=10, random_state=8),\n", " learning_rate=0.01974826191547239, n_estimators=272,\n", " random_state=8)
DecisionTreeRegressor(max_depth=10, random_state=8)
DecisionTreeRegressor(max_depth=10, random_state=8)