Random Subspace Learning (RASSEL) with data driven weighting schemes

dc.contributor.authorElshrif, M.
dc.contributor.authorFokoué, E.
dc.coverage.issue1cs
dc.coverage.volume7cs
dc.date.accessioned2019-01-02T13:23:43Z
dc.date.available2019-01-02T13:23:43Z
dc.date.issued2018cs
dc.description.abstractWe present a novel adaptation of the random subspace learning approach to regression analysis and classification of high dimension low sample size data, in which the use of the individual strength of each explanatory variable is harnessed to achieve a consistent selection of a predictively optimal collection of base learners. In the context of random subspace learning, random forest (RF) occupies a prominent place as can be seen by the vast number of extensions of the random forest idea and the multiplicity of machine learning applications of random forest. The adaptation of random subspace learning presented in this paper differs from random forest in the following ways: (a) instead of using trees as RF does, we use multiple linear regression (MLR) as our regression base learner and the generalized linear model (GLM) as our classification base learner and (b) rather than selecting the subset of variables uniformly as RF does, we present the new concept of sampling variables based on a multinomial distribution with weights (success ’probabilities’) driven through p independent one-way analysis of variance (ANOVA) tests on the predic- tor variables. The proposed framework achieves two substantial benefits, namely, (1) the avoidance of the extra computational burden brought by the permutations needed by RF to de-correlate the predictor variables, and (2) the substantial reduc- tion in the average test error gained with the base learners used.en
dc.formattextcs
dc.format.extent11-30cs
dc.format.mimetypeapplication/pdfen
dc.identifier.citationMathematics for Applications. 2018 vol. 7, č. 1, s. 11-30. ISSN 1805-3629cs
dc.identifier.doi10.13164/ma.2018.02en
dc.identifier.issn1805-3629
dc.identifier.urihttp://hdl.handle.net/11012/137265
dc.language.isoencs
dc.publisherVysoké učení technické v Brně, Fakulta strojního inženýrství, Ústav matematikycs
dc.relation.ispartofMathematics for Applicationsen
dc.relation.urihttp://ma.fme.vutbr.cz/archiv/7_1/ma_7_1_2_elshrif_fokoue_final.pdfcs
dc.rights© Vysoké učení technické v Brně, Fakulta strojního inženýrství, Ústav matematikycs
dc.rights.accessopenAccessen
dc.subjectRandom SubSpace Learning (RSSL), RASSELen
dc.titleRandom Subspace Learning (RASSEL) with data driven weighting schemesen
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
eprints.affiliatedInstitution.departmentÚstav matematikycs
eprints.affiliatedInstitution.facultyFakulta strojního inženýrstvícs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ma_7_1_2_elshrif_fokoue_final.pdf
Size:
951.22 KB
Format:
Adobe Portable Document Format
Description:
Collections