All Questions
Tagged with heteroscedasticity multiple-regression
86 questions
5
votes
1
answer
495
views
Assumptions of Linear Regression (homoscedasticity and normality of residuals)
I am confused about some assumptions of linear regression: homoscedasticity and residuals are normally distributed. These two require residuals, but to get the residuals, we need to fit the model ...
2
votes
1
answer
168
views
Assumption in multiple linear regression
The principles of multiple linear regression are widely described, however there are still some aspects I don't truly understand why. Specifically speaking I don't understand why heteroscedasticity ...
0
votes
1
answer
149
views
How critical/serious is the heteroscedasticity in my data (Breusch-Pagan test significant at p=.03)?
edit below
I am doing this analysis for the first time. How concerned should I be about heteroscedasticity in my data? Here's the scatterplot of predicted values vs residuals:
The Breusch-Pagan test ...
1
vote
0
answers
32
views
Trust the graphs or go with Breusch-Pagan and White's tests for Homoscedasticity on large datasets? [duplicate]
I have a large dataset (n > 500,000) which I'm building a linear model with lm(PV1READ ~ PV1MATH + PV1SCIE + ST004D01T). Tests for Normality, No ...
0
votes
0
answers
36
views
I am unable to relate the normal distribution dependency for regression . I need a mathematical intuition cum understanding for regression assumptions [duplicate]
What is the mathematical significance for the assumptions of linear regression to hold true for arriving at a single/multiple regression formula? Can anyone use the assumed normally distributed ...
2
votes
2
answers
295
views
Multiple linear regression homoscedasticity/linearity
My question is about the implications of the violation of homoscedasticity/linearity for multiple linear regression. I have tried to find the answer in multiple sources but could not figure it out.
I ...
2
votes
3
answers
352
views
Why does heteroskedasticity not affect $R^2$ and why does it make estimated regression more statistically significant?
I am studying what the consequences of heteroskedasticity are. And i found that assuming that the model is linear in the parameters (i.e $Y=X\beta+\epsilon$), is identifiable, has no perfect ...
1
vote
2
answers
253
views
Heteroscedasticity still present after Box-Cox transformation
I just started to learn regression and I'm trying to fit a linear regression model to some data with one continuous independent variable x1, one categorical variable x2, and the dependent variable y.
...
0
votes
0
answers
133
views
Can't fix non-normality and heteroskedasticity
I am attempting, via linear regression, to model a dataset.I've tried various transformations on the response/ and predictors, as well as WLS but the assumptions are not met. I'm looking for the ...
0
votes
0
answers
211
views
Bootstrapping with Hetroskedasticity?
In most of the times in linear regression, the two problems of non-normality and hetroskedasticity are present both in the model.
However, the two problems could be (but not neccessarily) inter-...
2
votes
0
answers
192
views
Is it impossible to fit HC4 robust method for linear mixed models?
I've been struggling all day to fit a robust HC4 model to my lmer in R (all the examples I've seen were with lm. I simply cannot ...
0
votes
0
answers
177
views
What to do when there is Heteroscedasticity in a multiple regression model?
So I’m doing a multiple regression analysis with Y and 6 features. I realize that Heteroscedasticity exists between Y and 2 of the independent variables. How can I deal with this problem? Y is already ...
0
votes
0
answers
471
views
Covariance matrix of errors for homoskedasticity/heteroskedasticity
I've seen homoskedasticty and heteroskedasticity defined as the following
The error term of our regression model is homoskedastic if the variance of the conditional distribution of $u_{i}$ given $X_{...
1
vote
1
answer
535
views
How to tell if there is a homoscedasticity of the model based on this plot?
I am building regression model of cholesterol predicted by 4 dietary components. I want to check if the assumption of Homoscedasticity is satisfied. I plotted Residuals vs Laverage plot.
...
3
votes
2
answers
436
views
Does this plot show heteroscedasticity?
I am running a multiple regression with a continuous DV and a mix of dichotomous and continuous IVs (but mostly dichotomous). This is the ZRESID vs ZPRED scatterplot, and I think there is ...
0
votes
0
answers
70
views
Tried rectifying Heteroscedasticity and I don't know why I did not succeed
I am doing a regression on the influence on interest rates on marketing spending. I tried many different approaches to get my regression working. I have panel data and time series. I got a data set of ...
0
votes
1
answer
1k
views
Trying to use the white test in r
I am doing a regression on the influence on marketing spending.
I have already tested for heteroskedasticity with the Breusch-Pagan Test and found that the test came out positive.
Based on the ...
1
vote
0
answers
170
views
Weighted least square regression - different ways of estimating weights
Newbie here.
I have a question regarding WLS regression.
Specifically, I've come across different ways of estimating weights in WLS regression, the most frequent ones being:
...
1
vote
0
answers
78
views
Test statistic for regression analysis with robust standard errors
I am looking for the test statistic for a regression analysis (using lm in R) with robust standard errors. As far as I understood the F-statistic from the original model is invalid but I was wondering ...
1
vote
0
answers
741
views
How can I transform response variable with negative values to fix heteroscedasticity
I'm trying to build a multivariable least-square linear regression model, and there is heteroscedasticity in my model. I saw many articles suggesting transformations such as log transformation or Box-...
5
votes
1
answer
706
views
how to deal with heteroscedasticity in least squares regression with multiple independent variables
I am trying to build a least-squares regression model and when I analyzed the independent variables, I saw a case of heteroscedasticity in one of the independent variables.
I'm building this model in ...
0
votes
1
answer
217
views
Can I use an OLS regression model with assumption violations for filling the missing data and prediction?
I am working on a multiple linear regression model as ordinary least squares (OLS)with several predictors and one response variable.
The data is for the counties for different years around 2200 ...
6
votes
1
answer
274
views
Flawed multiple linear regression in academia? Heteroscedasticity's effect on p-value?
I believe I have found a paper in academia that has used a flawed multiple linear regression. I have downloaded the data set and replicated their regression results. I have done some diagnostics and ...
3
votes
1
answer
1k
views
Which plot to check for heteroscedasticity in a multiple regression model
I have a linear model like:
Reg.Model = lm(Y~X1+X2+X3, data=DF)
If I want to check for the presence of heteroscedasticity using a plot, should I plot the residuals ...
0
votes
0
answers
52
views
Residual plot Diagnosis
I am working on a multiple linear regression model to investigate the relationship between several independent variables such as profitability, leverage, board size, percentage of women on board, ...
2
votes
2
answers
537
views
Do explanatory variables have to have a linear relationship with the response variables?
Do explanatory variables have to have a linear relationship with the response variable in multiple linear regression? What is the reason for this assumption?
Also, why are heteroscedastic ...
1
vote
1
answer
1k
views
Homoscedasticity issues? And how to solve this?
Based on the plots attached. Do i have any issues with the homoscedasticity assumption? It looks like the dots on the scatterplot are spread but there is some sort of downward trend. Is this causing ...
0
votes
1
answer
164
views
Assumption of Homogeneity of variance [duplicate]
Why is assumption of Homogeneity of variance required. What are the problems if they are not satisfied
1
vote
1
answer
53
views
How Do I Create a Better Model?
Disclaimer: I am a senior undergraduate student of Political Science with little proficiency in Data Science; please help me understand better and forgive any ensuing statistical illiteracy!
TL;DR: I ...
0
votes
0
answers
47
views
In a multilinear regression, does the usage of White standard errors always correct for heteroscedasticity?
I've calculated a multilinear regression. When testing the assumptions of linear regression, I've come to understand that my model violates the assumption of homoscedasticity (as shown with a Breusch-...
3
votes
1
answer
702
views
Checking the constant variance assumption for residuals vs fitted plots: What about for the same fitted values?
For a residuals vs fitted plot, we use the fitted values $\hat{Y} = \beta_0 + \beta_1 + \cdots + \beta_p x_p$ on the horizontal axis and the residuals on the vertical axis, and then compare the ...
1
vote
0
answers
49
views
how to check heteroskedasticity in pyhton?
After building model by using glm. How to check heteroskedasticity? are there any tests available to find or is it checked by using graphs??
0
votes
1
answer
32
views
How can the below graph be interpreted
How can you interpret the scale location graph in terms of Homoscedasticity?
0
votes
1
answer
240
views
Data transformation to deal with heteroscedacity
I am trying to build a linear regression model of the data which generally looks like this:
Certainly due to the exponential (I guess) nature of the data, I have tried to do a logarithmic ...
1
vote
0
answers
1k
views
Generalized Least Squares for linear regression with continous and categorical predictors
I have a linear regression model in R studio with a continous and a categorical predictor, where the assumption of homoscedasticity is violated:
...
2
votes
0
answers
228
views
How much heteroscedasticity need to be present in order to justify the use of robust standard errors?
Im trying to figure out if my data is heteroscedastic and if I need to use robust standard errors (Huber-White standard errors). The dataset contains 70 000 rows and 5 columns. Y is a numeric ...
2
votes
1
answer
143
views
Do these graphs show that the regression assumptions are met?
Is there any concern regarding this plot, specifically that it meets the homoscedasticity assumption? May I continue with multiple linear regression? How can I fix this?
My research is on household ...
0
votes
0
answers
68
views
Minimizing the expectation value of least-squares loss when data and model are randomly distributed with known normal distribution
How do you minimize the stochastic robust least-squares problem
$$
\min_x \mathbb{E}\left\{||A x - b||^2\right\}
$$
in which both the parameters $b$ and the model $A$ are normally distributed with ...
0
votes
1
answer
1k
views
How to fix heteroscedasticity (funnel shape)?
I am running a mlr in python on a dataset with 2D feature vectors, X1 and X2 on a single response, Y. The data ends up being funnel-shaped, as below:
X1 v Y, with the colors being X2.
It was ...
2
votes
0
answers
49
views
Multiple linear regression and model build in light of regression diagnostics
I have a dataset of approx. 200 observations, consisting of Profit which is my dependent variable and is continuous, and the independent variables are Turnover (also continuous), and 3 additional ...
2
votes
2
answers
3k
views
Does this graph imply a violation of homoscedasticity?
I assume that this graph doesn't support the assumption of homoscedasticity. Am I right? Does it make sense to carry out another test to be sure?
y-axis: Regression Standardized Residual, x-axis: ...
1
vote
0
answers
96
views
How do I appropriately control for a limiting/maximum value in regression?
I have a dataset where one variable is limited by the value of another. It is a study of participants with a particular disease. By necessity, therefore, age of disease onset, A, can be no larger than ...
1
vote
1
answer
123
views
Multiple Regression - Heteroskedasticity? - Is this a linear model?
I am analyzing a multiple regression model in SPSS. I am checking whether the requirements for a linear model are met. The last requirement is homoscedasticity.
In my survey, you can choose between 1-...
7
votes
0
answers
2k
views
What are the differences between HC estimators and their small sample properties?
I am currently using R to run regression with the following code:
...
2
votes
1
answer
329
views
Heteroscedasticity consistent (HC) standard error analysis and interaction effects in an OLS
I have made a model with several variables, and 8 of them interact with a dummy to find interaction effects. These are added stepwise, resulting in three models. Now, through a Breusch-Pagan test I ...
1
vote
1
answer
116
views
Evidence for heteroscedasticity from unordered values
I'm fitting a linear regression model on a dataset about how many upvotes a certain post will get based on its views, its author's reputation ecc.
To satisfy the normality assumptions I performed a ...
0
votes
2
answers
2k
views
Multiple linear regression: homoscedasticity or heteroscedasticity
Regarding the multiple linear regression: I read that the magnitude of the residuals should not increase with the increase of the predicted value; the residual plot should not show a ‘funnel shape’, ...
1
vote
0
answers
355
views
Using coeftest results in predict.lm() in R [closed]
I am analyzing a dataset in which the variance of the error term in my regression is not constant for all observations.
For this, I re-built the model, estimating heteroskedasticity-robust (Huber-...
2
votes
1
answer
2k
views
In the presence of heteroskedasticity, is quantile regression more appropiate than OLS?
..for understanding the relationship between a dependent and independent variables, given that quantile regression makes no assumptions about the distribution of the residual.
1
vote
2
answers
2k
views
Constant Variance Assumption in Linear Regression
It seems to me that the following plot of "Residuals Vs. Fitted Values" violates the assumption of constant variance, since for lower fitted values, there are fewer points whereas for higher fitted ...