Newest 'rule-of-thumb' Questions

1 vote

2 answers

56 views

Prevalence upper bound when no events are observed in sample

In a sample of 2000 observations, no positive cases were found, but I want to still be able to provide an upper bound for the prevalence. The general rule that people seem to use is to simply take 3/n,...

philtttx3x

11

asked Sep 6 at 12:04

5 votes

3 answers

926 views

Rule of Thumb meaning in statistics

I was wondering what the term "rule of thumb" actually means in statistics. Why did they select this name, for example, for sample size calculation? Is it like an approximation based on ...

user413503

107

asked Jun 21 at 7:46

4 votes

1 answer

413 views

Academic reference on the "minimum of 5 expected counts per cell" rule of thumb for Chi-Square test?

I remember very clearly an academic paper that investigated thoroughly the "minimum of 5 expected counts per cell" rule of thumb when conducting a chi-square test on a contingency table. It ...

J-J-J

5,883

asked Feb 3, 2023 at 12:07

1 vote

1 answer

940 views

Number of observations for multiple linear regression

I've read multiple responses here that the recommended range of observations per each IV is 20. However, I'd like to ask for clarification on whether this number (20) includes dummy variables. Let's ...

Seba

11

asked Jun 6, 2022 at 17:23

1 vote

0 answers

1k views

Identification High Leverage for Logistic Regression

As seen below from Piet De Jong, Generalized Linear Models for Insurance Data, for linear model, we can identify high leverage, if the value of leverage>2p/n (or hii > 2p/n) then the indicates ...

Jasmine Helen

33

asked Dec 21, 2021 at 11:48

1 vote

1 answer

1k views

Rule of Thumb for multivariable linear regression analysis

Rule-of-thumb suggested by Green(1991)(https://doi.org/10.1207/s15327906mbr2603_7) : "Some support was obtained for a rule-of-thumb that $N ≥ 50 + 8 m $ for the multiple correlation and $N ≥104 + ...

Jino

11

asked Aug 31, 2021 at 10:25

0 votes

0 answers

328 views

Sum of sensitivity and specificity meaningful?

I came across an article in the BMJ that claims that a useful rule of thumb metric for assessing a medical test's performance is that the sum of specificity and sensitivity should be greater than 1.5 ...

N Blake

579

asked May 31, 2021 at 14:14

0 votes

1 answer

280 views

Doc2vec Corpus Size Recommendation

I'm trying to make a semantic search engine with Doc2Vec where you query the model a document and it returns N most similar documents from its training corpus. I'm having trouble pushing accuracy past ...

fpt

123

asked Mar 11, 2021 at 15:39

3 votes

1 answer

200 views

What is the optimal sampling split for stratified sampling?

Suppose we have a population split into two strata of interest and we want to estimate the mean of some quantity of interest pertaining to the population. Denote the quantities of interest for the ...

Ben

133k

asked Jan 1, 2021 at 23:05

0 votes

1 answer

194 views

On the existence of rule of thumb for machine learning algorithms

I want to know if there are conditions about the minimum number of observations to have (the relation between the number of variables and the number of presence and absence records) in order to use ...

user1988

155

asked Dec 23, 2020 at 20:41

13 votes

3 answers

2k views

Revisiting the Rule of Three

The rule of three is a method for calculating a 95% confidence interval when estimating $p$ from a set of $n$ IID Bernoulli trials with no successes. My understanding from its derivation is that the ...

Set

1,463

asked Nov 22, 2020 at 21:23

2 votes

1 answer

510 views

Is there any rationale for rules of thumb for maxlag selection?

I understand that the optimum number of lags can be selected using some information criterion (e.g. Akaike (AIC), Schwartz Bayesian Information Criterion (SBIC) etc.). However, in order to select the ...

1muflon1

915

asked Nov 12, 2020 at 22:21

1 vote

0 answers

106 views

Bounds for hypothesis testing with non-normal distributions

After doing randomized experiment with two variants (AB tests), I want to do hypothesis testing without the normal distribution assumption to compare their means. Since the samples sizes are different,...

RandomVariable

11

asked Aug 4, 2020 at 14:53

1 vote

0 answers

2k views

Reference for the rule of thumb $\sqrt{n}$ for the number of bins of an histogram

Does anyone have an idea where the rule of thumb $\sqrt{n}$ for the number of bins of an histogram come from? I need a reference to put in my article. I remember that rule from my college times, but ...

coolsv

141

asked Jun 13, 2020 at 0:43

1 vote

0 answers

550 views

Does the 10% condition apply to geometric distributions?

I know that the 10% condition is required to assume independence in binomial settings. However, in a geometric setting where there is no fixed n-value, how would one determine independence? I am a ...

J Winnie

11

asked Jan 11, 2020 at 19:58

3 votes

0 answers

77 views

How small is too small to fit a reasonable long memory model?

When looking at papers about long memory they tend to analyze data sets whose length is in the thousands, see http://www.math.canterbury.ac.nz/~m.reale/pub/Reaetal2011.pdf for an example. My question ...

adunaic

1,309

asked Jan 15, 2019 at 17:26

1 vote

1 answer

318 views

How should I treat categorical variables for the purpose of the "One in 10 rule"?

Hope a basic question like this is alright! To avoid overfitting, we try to maintain enough cases for the least common event per explanatory variable; people usually recommend at least 10. How should ...

Benjamin Liu

11

asked Dec 12, 2018 at 1:35

2 votes

0 answers

55 views

Bootstrap and distribution of the test statistic

One rule of thumb says that we should avoid using the bootstrap to construct a confidence interval for some test statistic, if (1) the test statistic is greatly affected by outliers or by rare ...

D.W.

6,738

asked Oct 28, 2018 at 0:10

3 votes

1 answer

540 views

Is there a general rule of thumb for how big the ratio ( Sample size / parameters in model ) should be?

I'm aware of model selection methods based on AIC and backwards/forwards selection, but I'm wondering if there is a general rule about how big your sample size $n$ should be (as an absolute minimum) ...

Xiaomi

2,554

asked Oct 5, 2018 at 1:32

1 vote

1 answer

2k views

Equality of variances based on Rule of Thumb (for very small groups)

I have three groups with very small sample sizes (6 obs per each group). In order to use ANOVA I just want to make sure that the equality of variances is satisfied, as Levene's test does not have ...

stat

45

asked Aug 29, 2018 at 15:33

1 vote

0 answers

181 views

Minimum sample size heuristic for using Hotelling's T$^2$ test

My understanding is that for non-normal data, a widely accepted heuristic for using the t-test is a sample size of at least $n=30$. Since for smaller sample sizes the distribution of the sample mean ...

Set

1,463

asked Aug 2, 2018 at 1:50

5 votes

1 answer

648 views

How good an approximation is sampling with replacement to sampling without replacement?

I'm learning about probability with Feller's book and he states that, when the population size $n$ is big in comparison with the sample size $r$, then $n_r$, which is a shorthand for $\frac{ n!}{(n-r)!...

David Moseler

198

asked Jun 10, 2018 at 23:52

18 votes

2 answers

8k views

What is the logic behind "rule of thumb" for meaningful differences in AIC?

I've been struggling to find meaningful guidelines for comparing models based on differences in AIC. I keep coming back to the rule of thumb offered by Burnham & Anderson 2004, pp. 270-272: ...

Tripartio

2,346

asked Jun 5, 2018 at 12:11

3 votes

1 answer

1k views

Paper for the rule of thumb for homogenity of variances

I often read about a rule of thumb, one can apply if the test of equal variances returns a significant result. Depending on the source, the proposed maximal $F$ ratio varies between 1.5 and 4, which ...

j3ypi

133

asked May 21, 2018 at 15:00

1 vote

1 answer

4k views

Multiple Regression - Minimum Observations Per Dummy Variable

I believe the rule of thumb is at least 10-20 observations per predictor variable, but I was hoping to get some additional clarification. Suppose a hypothetical example with dependent variable of ...

user98462

11

asked Jun 10, 2017 at 16:36

1 vote

1 answer

657 views

rule of thumb for the number of degrees of freedom of a chi-squared to converge to normal

It is said that if $X\sim\chi^2_{(k)}$, then $Y=\frac{X-k}{\sqrt{2k}}$ converges to $Y\sim N(0,1)$ when $k$ tends to $\infty$. Is there a commonly used rule of thumb for a $k$ to be "big enough" to ...

Rafael

213

asked Apr 21, 2017 at 18:50

4 votes

0 answers

153 views

Optimal number of folds for nested cross-validation? [duplicate]

I'm doing nested cross-validation on a classification problem (inner loop to tune hyperparameters, outer loop to select algorithm). I've often heard that a reasonable number of folds for standard (I.e....

enricoferrero

536

asked Mar 5, 2017 at 16:55

3 votes

3 answers

3k views

How to judge skewness based on the mean and range?

Is there any rule of thumb to judging skewness of data based on its mean and range (max-min)? I found such implication in one of the papers I'm reading and I can't see why it would be obvious. The ...

Paula

288

asked Nov 20, 2016 at 18:52

2 votes

3 answers

2k views

How many cycles are required to model seasonality?

How many cycles should I have in my time series to model the seasonality component of it so that I can get rid of it and just look at the trend?

user2253546

241

asked Sep 20, 2016 at 16:42

2 votes

0 answers

327 views

'68–95–99.7 rule' equivalent for multivariate normal distribution [duplicate]

The 68–95–99.7 rule is a convenient way of quickly getting an overview of the spread of some normally distributed data. I am wondering if there is an equivalent rule in the multivariate case? and how ...

Faur

201

asked Sep 5, 2016 at 9:47

15 votes

3 answers

10k views

Relation between learning rate and number of hidden layers?

Is there any rule of thumb between depth of a neural network and learning rate? I have been noticing that the deeper the network is, the lower the learning rate must be. If that's correct, why is ...

user_1177868

752

asked Sep 2, 2016 at 2:45

2 votes

2 answers

5k views

Rule of Thumb for Accepting the Null Hypothesis

Usually, hypothesis testing is performed with the goal to make a conclusions about the statistical significance of an effect, i.e. $H_0 \ \hat{=} \ \text{No Effect} $ vs. $H_1 \ \hat{=} \ \text{Effect}...

random_guy

2,440

asked Jul 6, 2016 at 13:17

10 votes

1 answer

373 views

How does one formalize a prior probability distribution? Are there rules of thumb or tips one should use?

While I like to think I have good grasp of the concept of prior information in Bayesian statistical analysis and decision making, I often have trouble wrapping my head around its application. I have ...

Phil

405

asked Jan 11, 2016 at 23:52

3 votes

1 answer

3k views

Rule of thumb for the number of significant digits to report

What is a good rule of thumb for the number of significant digits to report? Preferably, the rule of thumb is given in a citable publication. I am particularly interested in a rule that does not ...

prettyprint

33

asked Jan 8, 2016 at 16:07

3 votes

1 answer

5k views

How to tell a coefficient of variation (C.V.) is high?

I have a group of data series which contain hundreds of values. I've calculated the C.V.s of these data series, but I don't know how I can recognise if they are high or low according to the C.V. ...

ChangeMyName

275

asked Dec 10, 2015 at 15:39

1 vote

0 answers

72 views

Statistical texts [closed]

Neophyte social science researcher here. I keep finding myself in the position of needing to reference the interpretation of some statistic or other (e.g. >.40 is adequate factor loading, >.80 is ...

tony.minter

21

asked Oct 8, 2015 at 13:40

6 votes

2 answers

3k views

Rule of thumb for using logarithmic scale

When I am given a variable, I usually decide whether to take its logarithm based on gut feeling. Usually I base it on its distribution - if it has long tail (like: salaries, GDP, ...) I use logarithms....

Piotr Migdal

5,896

asked Jul 22, 2015 at 11:18

0 votes

0 answers

283 views

How do "stacked" repeated samples influence our rules of thumb for minimum samples size in regression?

A professor in one of my graduate statistics courses once said, when briefly reviewing simple linear regression: "I would never EVER fit a line to fewer than 8-10 data points, it would make me feel......

Gavin M. Jones

127

asked Jul 20, 2015 at 3:54

8 votes

1 answer

20k views

Optimal number of bins in histogram by the Freedman–Diaconis rule: difference between theoretical rate and actual number

Wikipedia reports that under the Freedman and Diaconis rule, the optimal number of bins in an histogram, $k$ should grow as $$k\sim n^{1/3}$$ where $n$ is the sample size. However, If you look at ...

user603

23k

asked Mar 26, 2015 at 0:56

0 votes

3 answers

982 views

How strongly supported is a conclusion drawn from the correlation of 10 observations?

The Spearman’s rank correlation coefficient calculated for these observations is 0.826 (p<0.005). Is there something like a "rule of thumb" minimum number of observations?

Sultan Abraham

75

asked Aug 7, 2014 at 16:42

15 votes

2 answers

38k views

"When to use boxplot and when barplot" rules (of thumb?)

Both box-and-whisker plot and bar chart are appropriate graphics for ANOVA according to The R Book (Crawley, 2013), but which is more appropriate? I suppose it depends on situation... can anybody help ...

Ladislav Naďo

2,350

asked Feb 4, 2014 at 9:22

8 votes

3 answers

3k views

Is there a simple method to calculate the minimum number of datalines needed for a machine learning algorithm with x variables?

Here is the problem: I am making a machine learning algorithm that takes the inputs and outputs of some software I've written, and I don't know how many datalines to produce to get results that are a '...

FraserOfSmeg

653

asked Jan 2, 2014 at 21:59

3 votes

2 answers

255 views

Rule of thumb when drawing N samples from a discrete distribution with N possible values with replacement

I'm looking for an explanation and possibly the name of a rule of thumb: When drawing N samples with replacement from a discrete uniform distribution of N values, it is very likely that: 1/3 of the ...

Sentry

602

asked Dec 27, 2013 at 18:56

4 votes

0 answers

403 views

Intuition or explanation of "rule of thumb" for loss of degrees of freedom in chi-square for model selection/knot placement?

A document I read many years ago (to do with actuarial education; sorry I can't link to a copy) used chi-square tests to assess lack of fit of various models to mortality rates which were treated ...

Glen_b

291k

asked Dec 4, 2013 at 3:40

-1 votes

3 answers

3k views

Minimum items for questionnaire

Is there a minimum item for questionnaire? Some researchers said that the total items for teenagers ranging from 80-110 items

user30918

17

asked Oct 1, 2013 at 11:06

1 vote

0 answers

6k views

Min-max vs. standard deviation when plotting error bars

I am running some experiments in which I plot the values of some variable y (averaged over 4 runs) against some independent variable x. I also compare the effect of z. In other words: horizontal axis:...

maditya

205

asked Jun 18, 2013 at 2:02

11 votes

1 answer

9k views

Histogram with uniform vs non-uniform Bins

This question describes the basic difference between a uniform and a nonuniform histogram. And this question discusses the rule of thumb for picking the number of bins of a uniform histogram that ...

Alan Turing

233

asked Apr 10, 2013 at 22:52

3 votes

2 answers

465 views

Is Rule of Three inappropriate in some cases?

I'm building binomial proportion confidence intervals for a patient dataset containing the frequency of home nursing visits during the week prior to hospital admission. The freq. of home visit ...

RobertF

6,286

asked Mar 18, 2013 at 15:30

7 votes

0 answers

1k views

Rule of thumb - number of predictors - Poisson regression rates

I am interested in estimating a Poisson regression for mortality rates, with number of deaths as the dependent variable and log(population size) as the offset. I have 50 observations (states). I am ...

MP Sylvestre

71

asked Sep 25, 2012 at 20:46

3 votes

2 answers

2k views

Rules of Thumb to choose an initial number of class intervals and refine that choice (potentially automatically)

I was wondering if there are established rules of thumb (or algorithms) that, given a set of observations can help: choose an initial number of class intervals. refine that choice to a better number. ...

user1172468

2,055

asked Sep 24, 2012 at 6:26

Questions tagged [rule-of-thumb]

Related Tags