Partitioned least squares

Author:

Esposito Roberto,Cerrato Mattia,Locatelli Marco

Abstract

AbstractLinear least squares is one of the most widely used regression methods in many fields. The simplicity of the model allows this method to be used when data is scarce and allows practitioners to gather some insight into the problem by inspecting the values of the learnt parameters. In this paper we propose a variant of the linear least squares model allowing practitioners to partition the input features into groups of variables that they require to contribute similarly to the final result. We show that the new formulation is not convex and provide two alternative methods to deal with the problem: one non-exact method based on an alternating least squares approach; and one exact method based on a reformulation of the problem. We show the correctness of the exact method and compare the two solutions showing that the exact solution provides better results in a fraction of the time required by the alternating least squares solution (when the number of partitions is small). We also provide a branch and bound algorithm that can be used in place of the exact method when the number of partitions is too large as well as a proof of NP-completeness of the optimization problem.

Funder

Università degli Studi di Torino

Publisher

Springer Science and Business Media LLC

Reference32 articles.

1. Abdi, H. (2010). Partial least squares regression and projection on latent structure regression (PLS regression). WIREs Computational Statistics, 2(1), 97–106.

2. Anna Montoya, D. (2016). House Prices - Advanced Regression Techniques. Kaggle (2016). https://kaggle.com/competitions/house-prices-advanced-regression-techniques

3. Bakin, S. (1999). Adaptive regression and model selection in data mining problems. PhD thesis, School of Mathematical Sciences, Australian National University.

4. Bertin-Mahieux, T., Ellis, D.P.W., Whitman, B., & Lamere, P. (2011). The million song dataset. In: Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR 2011)

5. Bezanson, J., Karpinski, S., Shah, V.B., & Edelman, A. (2012). Julia: A fast dynamic language for technical computing. CoRR arXiv:1209.5145

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3