Abstract
|
|
---|---|
Random forests algorithm has been applied extensively due to its high prediction accuracy, interpretability, ability to deal with high dimensional data and to assess the relevance of highly correlated variables in complex non-linear models. We propose an alternative framework to assess the variable importance in multivariate response scenarios based on the permutation importance method using the conditional inference trees algorithm. To build the solution, a f-divergence measure from information theory is used. The main goal of divergence measures is to provide a distance between probability distributions, in our case, the observations and predicted values. The solution was tested in simulated examples and also in a real case, where we assessed and ranked the most relevant predictors for price and demand of electricity jointly. The results show that the new method outperforms in most cases the outcomes achieved by the recently proposed variable importance technique, Intervention Prediction Measure. | |
International
|
Si |
Congress
|
11th international Conference of the ERCIM (European Research Consortium for Informatics and Mathematics ) Working group on Computational and Methodological statistics (CMStatistics 2018) |
|
960 |
Place
|
Pisa , Italia |
Reviewers
|
Si |
ISBN/ISSN
|
978-9963-2227-5-9 |
|
|
Start Date
|
12/12/2018 |
End Date
|
14/12/2018 |
From page
|
42 |
To page
|
42 |
|
Abstracts of the 1th international Conference of the ERCIM (European Research Consortium for Informatics and Mathematics ) Working group on Computational and Methodological statistics (CMStatistics 2018) |