Nonsingular subsampling for regression S estimators with categorical predictors

Koller, Manuel; Stahel, Werner A. (2017). Nonsingular subsampling for regression S estimators with categorical predictors. Computational Statistics, 32(2), pp. 631-646. Springer 10.1007/s00180-016-0679-x

[img]
Preview
Text
Koller ComputStat 2017.pdf - Published Version
Available under License Publisher holds Copyright.

Download (449kB) | Preview
[img]
Preview
Text
Koller ComputStat 2017_postprint.pdf - Accepted Version
Available under License Publisher holds Copyright.

Download (1MB) | Preview

Abstract

Simple random subsampling is an integral part of S estimation algorithms
for linear regression. Subsamples are required to be nonsingular. Usually, discarding
a singular subsample and drawing a new one leads to a sufficient number of non-
singular subsamples with a reasonable computational effort. However, this procedure
can require so many subsamples that it becomes infeasible, especially if levels of
categorical variables have low frequency. A subsampling algorithm called nonsingu-
lar subsampling is presented, which generates only nonsingular subsamples. When no
singular subsamples occur, nonsingular subsampling is as fast as the simple algorithm,
and if singular subsamples do occur, it maintains the same computational order. The
algorithm works consistently, unless the full design matrix is singular. The method is
based on a modified LU decomposition algorithm that combines sample generation
with solving the least squares problem. The algorithm may also be useful for ordinary
bootstrapping. Since the method allows for S estimation in designs with factors and
interactions between factors and continuous regressors, we study properties of the
resulting estimators, both in the sense of their dependence on the randomness of the
sampling and of their statistical performance.

Item Type:

Journal Article (Original Article)

Division/Institute:

04 Faculty of Medicine > Pre-clinic Human Medicine > Institute of Social and Preventive Medicine (ISPM)

UniBE Contributor:

Koller, Manuel

Subjects:

600 Technology > 610 Medicine & health
300 Social sciences, sociology & anthropology > 360 Social problems & social services

ISSN:

1613-9658

Publisher:

Springer

Language:

English

Submitter:

Doris Kopp Heim

Date Deposited:

06 Sep 2016 13:33

Last Modified:

05 Dec 2022 14:58

Publisher DOI:

10.1007/s00180-016-0679-x

Uncontrolled Keywords:

Robust regression; MM estimate; S estimate; Resampling; Collinearity; Bootstrap; Dummy variables

BORIS DOI:

10.7892/boris.87843

URI:

https://boris.unibe.ch/id/eprint/87843

Actions (login required)

Edit item Edit item
Provide Feedback