Skip to main content

All Questions

Filter by
Sorted by
Tagged with
5 votes
1 answer
495 views

Assumptions of Linear Regression (homoscedasticity and normality of residuals)

I am confused about some assumptions of linear regression: homoscedasticity and residuals are normally distributed. These two require residuals, but to get the residuals, we need to fit the model ...
Ratchainant Thammasudjarit's user avatar
0 votes
0 answers
32 views

True or False: If the distribution of Y|X is normal, then the regression of Y on X must be both linear and homoscedastic [duplicate]

I'm trying to interpret an early and pretty dense (to me) paper on the theory of linear regression: Bartlett, M. S. (1934). On the theory of statistical regression. Proceedings of the Royal Society of ...
virtuolie's user avatar
  • 642
2 votes
1 answer
61 views

Anova model assumptions. How to go by?

I have a data frame that contains a continuous response variable measured on different species, at different elevation and month of sampling as explanatory variables. I want to analyze how the ...
scholar101's user avatar
0 votes
0 answers
36 views

I am unable to relate the normal distribution dependency for regression . I need a mathematical intuition cum understanding for regression assumptions [duplicate]

What is the mathematical significance for the assumptions of linear regression to hold true for arriving at a single/multiple regression formula? Can anyone use the assumed normally distributed ...
ANKIT CHAKRABORTY's user avatar
1 vote
1 answer
585 views

Dealing with violation of linear regression assumptions

I have a data set where some extreme, but not nonetheless important observations are present which prompts violation of the linear regression assumptions of normality and constant variance. The ...
OLGJ's user avatar
  • 327
1 vote
1 answer
64 views

Testing assumptions for repeated measures ANOVA

I was wondering how to assess residual normality of a repeated measures ANOVA. In some threads, users refer to Venables and Ripley: Residuals in multistratum analyses: Projections and recommend to ...
a.henrietty's user avatar
0 votes
0 answers
133 views

Can't fix non-normality and heteroskedasticity

I am attempting, via linear regression, to model a dataset.I've tried various transformations on the response/ and predictors, as well as WLS but the assumptions are not met. I'm looking for the ...
Chase_stats's user avatar
1 vote
0 answers
248 views

Lmer model fails normality and homogeneity of the residuals; model-predicted lines are not great. Some predictors might not be linear. What can I do?

CONTEXT: I gathered data from 1000 participants in five different countries, with each line in my dataset corresponding to a unique participant. My study examines how much individuals support the idea ...
Olivia's user avatar
  • 375
0 votes
0 answers
211 views

Bootstrapping with Hetroskedasticity?

In most of the times in linear regression, the two problems of non-normality and hetroskedasticity are present both in the model. However, the two problems could be (but not neccessarily) inter-...
Hussain's user avatar
  • 171
2 votes
0 answers
36 views

How do I go about rectifying violated assumptions when more than one is violated at the same time? [duplicate]

I am currently trying to run a model analyzing the duration of the egg stage of each sex of two species of insect across five different temperatures. All independent variables are categorical. My ...
Insect_biologist's user avatar
1 vote
0 answers
138 views

Model residuals are normally distributed but they are heteroscedastic and autocorrelated

I have a candidate set of five linear models, as I want to choose the model with the lowest AIC and calculate predictions from it. For each model, I have tested for normality of model residuals (...
Darius's user avatar
  • 223
0 votes
0 answers
91 views

Calculating fixed effects in a mixed model with non-normality and heteroscedasticity with a 3-level time variable?

Due to non-normality and heteroscedasdicity, I use robustlmm and not lme4 for my mixed effects model. The variables look like this: ID: subject variable (random factor) var1: categorical between ...
profiterol's user avatar
0 votes
1 answer
89 views

Heteroskedacity and non-normality - What to do?

I conducted an experiment in which I am trying to model the relationship between my response weed_coverage [%] and the predictors soil moisture [%] + treatment + distance. Weed_coverage and ...
Effigy's user avatar
  • 93
1 vote
0 answers
38 views

Can we objectively determine whether assumptions have been violated in R?

I'm testing statistical assumptions in R and so far I've been using plot(model, which = c(1:6)), which produces six graphs for linearity, normality of errors, ...
user13343754's user avatar
4 votes
1 answer
241 views

How does data that violates linear regression assumptions (of the residuals) look?

Linear regression has two assumptions about the residuals : The residuals should have constant variance (for every level of the predictor). The residuals should follow a normal distribution. Is it ...
Sam's user avatar
  • 777
1 vote
0 answers
35 views

Problems with VAR: Autocorrelation when imposing restrictions, ARCH effects and non-normality at all times

I am estimating a VAR model for log-returns of: copper prices, USD/local currency exchange rate, and the local stock market index. Using VARselect I estimated a VAR(...
DDV's user avatar
  • 11
6 votes
1 answer
1k views

Heteroscedasticity and non-normal errors are only an issue when predicting from a linear model - why?

In Regression and Other Stories, the authors state that heteroscedasticity and non-normal errors are only problematic when predicting from a linear model (1; p. 154-155): Equal variance of errors. ...
Ezra Herman's user avatar
0 votes
1 answer
585 views

Do I need to check for Autocorrelation, Heteroskedasticity and Normality when building a model with data that are not time series? [duplicate]

I want to build a simple regression model with non-time series such as Client ID Number. when testing the validity of this model, do I need to check for autocorrelation, Heteroskedasticity and ...
cookiemonster1697's user avatar
0 votes
0 answers
270 views

When distributions are non normal and heteroscedastic, is it preferred to use ordinal logistic regression or permutation tests?

I am currently conducting statistical tests on my two independent samples (both with more than 1500 entries each). The sample sizes are no equal. My response variables are interval as well as quasi ...
Annanas's user avatar
  • 13
1 vote
1 answer
902 views

Is assumption of residual normality and Homoscedasticity in nonlinear regression

I am still learning a lot about nonlinear regression and I have some questions about residual normality and Homoscedasticity: 1) From what I could find here (Consequences of violating assumptions of ...
João Duarte's user avatar
0 votes
1 answer
417 views

Dealing with violation of OLS assumptions

I am currently writing my Master's thesis in economics. I am analyzing the second home rate in Swiss municipalities in R. The second home rate for municipality $j$ is defined as the share of the ...
Beat Johner's user avatar
1 vote
2 answers
2k views

Help interpreting residual vs fitted plot and normality (ANOVA on R)

I'm carrying out a statistical analysis on R using ANOVA and am not sure if the data meets the assumptions of normality of residuals or homogeneity of variance. My data : And my plots: Any help is ...
Robbie's user avatar
  • 31
3 votes
1 answer
520 views

Is one-way ANOVA robust to violations of homoscedasticity?

I read here that if group sizes are equal, ANOVA is robust against the violation of the assumptions of normality and homoscedasticity. I am wondering if this is the case, and if so why?
manesioz's user avatar
  • 133
3 votes
1 answer
957 views

Analysis of variance with not normally distributed residuals : how important is normality?

I am using gls and anova to analyse my data. I use gls to aply weights. I have one factor (tree genotype) and I analyse its influence on soil content. Here is an exemple of my data with one variable (...
Karelle Rheault's user avatar
2 votes
1 answer
6k views

Dealing with heteroscedasticity and non-normality in a mixed model

I am trying to fit a mixed model (person as random effect) on data which has heteroscedasticity and non-normality. I log-transformed the Y-variable but it did not ...
Rodrigo's user avatar
  • 39
5 votes
1 answer
119 views

Use of supportive inferential statistics (e.g. Levene's and normality tests) in underpowered samples

We know that underpowered statistics greatly increase the probability of a type II error (by definition), meaning a greater chance of failing to reject the null hypothesis despite the existence of a '...
Josh Blake's user avatar
0 votes
0 answers
46 views

Which one of these is correct for linear regression?

Only one of these is supposed to be the correct one for simple linear regression. Which pair of plots would you say has constant variance and normal distribution? I feel like none of them have both ...
koras's user avatar
  • 51
2 votes
0 answers
352 views

Engle Granger Error Correction Model - Normality, heteroskedasticity and Autocorrelation tests

I'm building an Error Correction Model using the Engle-Granger approach with the following interest rates data: Observations: 230 Periodicity: Monthly I have the following model: $$\Delta R_t = \...
Quant_Penguin's user avatar
0 votes
0 answers
90 views

Transformation for non-normally distributed homoscedastic data?

I have continuous data (latency to perform a behaviour) that is heteroscedastic and also the data and the residuals are not normally dsitributed. I've tried square root $\frac{1}{log}$ and log10 ...
Gra's user avatar
  • 11
3 votes
2 answers
2k views

Theil-Sen estimator assumptions

I found by accident the nonparametric Theil-Sen Estimator as a replacement for standard OLS linear Regression. How well does it perform with autocorrelated data, non-normal residuals and ...
Neon67's user avatar
  • 69
0 votes
0 answers
856 views

Logistic regression normality and homoscedasticity [duplicate]

Why does logistic regression not require residuals to be normally distributed and homoscedastic the way linear regression does? Why does this not cause problems for estimating logistic regression ...
grig109's user avatar
0 votes
1 answer
37 views

Can I use residuals for transformation?

This may be a simple question, but my data (once transformed) passes normality, however does not pass homoscedasticity. Can I look at the residuals to see if they pass this? or am I over looking this?
AyAyRon166's user avatar
0 votes
2 answers
74 views

Available options when data violates multiple test assumptions?

I am confused as to how I should analyse my data since I think it violates almost all possible assumptions. I applied several hormoneconcentrations to plants and counted the amount of roots. ...
CJA's user avatar
  • 1
2 votes
1 answer
407 views

Testing the assumptions of a linear regression model

I am building a model for predicting airline prices. I have 19 relevant predictors and and 114000 observations. I am getting an $R^2$ predicted of 95.87 (Minitab displays $R^2$ predicted after ...
Earthling's user avatar
  • 147
5 votes
1 answer
890 views

Under what conditions is Welch's unequal variance t-test more conservative than Student's equal variance t-test?

I am comparing the anatomy of two Neanderthal populations for which sample sizes, the number of available bones, is low (usually between 3 and 8 for a given trait). I am using Welch's unequal variance ...
Pertinax's user avatar
  • 587
1 vote
0 answers
265 views

Problem with one-way ANOVA assumptions

Our research is entitled "Relation of Family Perfectionism on the Career Indecisions for Grade 10, 11 and 12 Students" Independent variable: grade level Dependent variable: family perfectionism and ...
Anonymous's user avatar
2 votes
0 answers
235 views

Are regression estimates still reliable despite heteroscedasticity and non-normality

I am performing a simple linear regression with the lm() function to make statements about the association between the two variables. But I am not sure if my ...
Kaizen502's user avatar
0 votes
1 answer
773 views

In linear regression, why do the assumptions such as normality and homoscedasticity only hold for the residuals?

Why does the distribution of the residuals should be normal and variance of the residuals should be same across the values of the independent variable? Why do we assume these for the residuals only? ...
behrengi's user avatar
2 votes
1 answer
55 views

Are my assumptions right for this question?

I answered these homework questions but I was told that at least one of my answers is wrong. However, I can't tell which one I answered wrong. What incorrect assumptions have I made? QUESTION: MY ...
orangebull's user avatar
8 votes
1 answer
2k views

Heteroskedasticity and Distribution of the Dependent Variable in Linear Models

I am running a multivariate ols model where my dependent variable is Food Consumption Score, an index created by the weighted sum of the consumption occurrences of some given food categories. ...
Caserio's user avatar
  • 406
3 votes
0 answers
1k views

Linear regression with trimmed data

I would like to know how experts deal with real data. Even if statistical text books uses real data I'm always surprised how good the real data are and at the end of the exercises the residuals are ...
giordano's user avatar
  • 1,079
3 votes
1 answer
2k views

Compare several means with different sample sizes (greater than 2)

I have seven different groups with different sample sizes and variances and I want to compare the means of their data. I'm not very informed in statistics, so could anyone help me out here? I've only ...
Abhi V's user avatar
  • 141
1 vote
2 answers
701 views

ANOVA and Kruskal-Wallis Test in One Study

I am currently analyzing the results of my study which deals with several dependent variables. I have tested the data for normality and homogeneity and all but one passed the assumptions for ANOVA. ...
P. Jim's user avatar
  • 41
1 vote
1 answer
123 views

What is an appropriate test for a normally distributed, heteroscedastic, multi-factor data set?

I have a data set of active layer depths from an Arctic field site. There are two factors in the data set, Month measured (July or August), and Location (shrub patch or open tundra). I had intended on ...
C W's user avatar
  • 53
5 votes
1 answer
2k views

Standardized residuals vs fitted values: OLS assumptions satisfied?

Based on only the above plot, what comments would you make about whether the OLS assumptions are satisfied? In particular homoskedasticity, normality. I just want to know if I'm right. It seems to me ...
rbm's user avatar
  • 1,003
4 votes
1 answer
226 views

Is an ANOVA applicable for these data?

I have a data set from 7 groups, with 20 fish in each group. Measurement of a parameter is made on 25 cells from each fish (so each observation in the data-set is completely independent, right?). One ...
Apo's user avatar
  • 43
1 vote
0 answers
2k views

Non-normally distributed residuals for multivariate linear regression.Still a valid model?

I try to know if the independent variables are affecting the outcome of the dependent variable, but while the Shapiro-Wilk test shows residuals non-normally distributed, the autocorrelation of errors ...
Saccharo's user avatar
  • 181
5 votes
3 answers
15k views

When to use Brown-Forsythe Test?

I have been researching the differences between Welch ANOVA and Brown-Forsythe Test. I know that Welch ANOVA is used for more than two groups comparing whether there is statistically meaningful ...
evros's user avatar
  • 901
1 vote
0 answers
942 views

Unpaired t-test: large samples, unequal variances, unequal sample sizes [duplicate]

I a have two independent samples one with more than 700 observations and the second with more than 500 observations. I firstly use Cramér-von Mises test to test normality just to check how non-normal ...
virusdotcom's user avatar
2 votes
1 answer
5k views

Consequences of violating assumptions of nonlinear regression when comparing models and/or datasets

I have a question about the consequences of using non-linear regression when the data violate the assumptions of (1) homoscedasticity and (2) normal distribution. Specifically, I am wondering about ...
Angela's user avatar
  • 540