Skip to main content

Questions tagged [definition]

This tag indicates questions about definitions of statistical terms. Use a more general tag [terminology] for questions on statistical parlance that are not specifically about definitions.

Filter by
Sorted by
Tagged with
1 vote
1 answer
25 views

A problem with the definition of ergodicity in "Evaluating gambles using dynamics, O.Peters"

In the paper we are presented with a additive time stochastic process: $$ W_{t+ \delta t \times T} - W_t = \sum^T_{i=1} D_i $$ Where $W_t$ represents the wealth at time $t$ , $\delta t$ is the ...
lohe's user avatar
  • 51
2 votes
1 answer
111 views

definition of regular estimators

In the book "Semiparametric Theory and Missing Data" by Tsiatis, superefficient estimators are defined as "they are unnatural and have undesirable local properties associated with them&...
kara890's user avatar
  • 311
9 votes
3 answers
998 views

Is "categorical data" a synonym of "nominal data"?

So far, it's always been my understanding that nominal data was a type of categorical data, not a synonym of it. For me, categorical data included ordinal data, not just nominal data. As of November ...
J-J-J's user avatar
  • 5,848
2 votes
0 answers
46 views

What are the mathematical structures of nominal, ordinal, interval and ratio data? [duplicate]

In Michael Eysenck's fundamentals of psychology, it says Another factor to consider when deciding which statistical test to use is the type of data you have obtained. There are four types of data of ...
Tim's user avatar
  • 19.8k
2 votes
2 answers
81 views

Where should the $\leq$ go in the definition of the hazard function?

I have come across two definitions of the hazard function in several textbooks and online resource. Definition 1. Here for example. $$h(t) = \lim_{\epsilon \to 0+}\dfrac{P(t < T \leq t + \epsilon \...
Eden Hazard's user avatar
3 votes
1 answer
186 views

What does it mean to say that a statistical model is a location family?

I would like to know what it means to say that a statistical model is a location family. Looking at some definitions, I came up with three possible interpretations of the definition of location family....
rfloc's user avatar
  • 163
0 votes
0 answers
16 views

Can manual feature extraction be considered a part of a learning algorithm?

We can view a learning algorithm as a tuple $(\mathcal{H}, \mathcal{O}, \mathcal{L})$ where $\mathcal{H}$, $\mathcal{O}$ and $\mathcal{L}$ are the hypothesis class, optimizer and loss function ...
Antonios Sarikas's user avatar
1 vote
0 answers
47 views

Can you arbitrarily define your sample space?

Let's say you flip a fair coin. A sample space is a set of possible outcomes of an experiment. So, intuitively, the sample space will be {Head, Tail}. However, I have seen that it is also possible to ...
Sherlock_Hound's user avatar
1 vote
1 answer
125 views

Formal definition of sufficient statistic

Let $(\Omega_X,\mathcal{F}_X)$ and $(\Omega _T,\mathcal{F}_T)$ be measurable spaces. Let $\mathfrak{M}$ be a family of probability measures on $(\Omega_X,\mathcal{F}_X)$. Let $X:\Omega\to \Omega _X$ ...
rfloc's user avatar
  • 163
13 votes
11 answers
2k views

When is it important for a practitioner to understand CIs?

I have a physician friend who asks me questions about stats. He gets confused about stuff, e.g., the definition of a confidence interval (CI) and its intricacies. For example, he finds the following ...
Yair Daon's user avatar
  • 2,684
2 votes
2 answers
109 views

Fixed vs random: how can a fixed effect be constant across individuals?

For years, I have been trying to find a definition for fixed and random effects. I often see statements like this: Fixed effects are constant across individuals. - almost universal We define effects (...
opiczak's user avatar
  • 23
1 vote
1 answer
51 views

Motivation behind definition of PMF of function of $2$ variables

I am really curious to understand what motivates the definition $$p_{g(X,Y)} (g(X,Y)=z) = \sum_{(x,y)\in g^{-1}(\{z\})} p_{X,Y} (x,y)$$ where $g$ is a two variable function, and $X,Y$ are random ...
niobium's user avatar
  • 189
6 votes
4 answers
387 views

Frequentist probability: Can we prove mathematically what we are setting probabilities equal to, or are they just assumptions/definitions?

Just as an example, let's say I am modeling the rolling of a die. We can use the frequentist definition of probability to define a probability of an event, say rolling a 6, as the $\lim_{n\to\infty}$$\...
1 vote
1 answer
65 views

Unbiased estimator for mean

The question: Given a random sample $X_1,...,X_n$ show that $\frac{1}{n}\sum_{i=1}^n X_i$ is an unbiased estimator for $E(X_1)$. My confusion: Given a statistical model $(\Omega,\Sigma,p_{\theta})$, ...
user124910's user avatar
0 votes
0 answers
29 views

Kendall's Notation for Kelly Networks?

Background Kendall's notation (also see Ciw docs') is a convenience when you work with queues a lot. It not only provides an abbreviation, but its wide use helps make literature searches more specific....
Galen's user avatar
  • 9,680
1 vote
1 answer
83 views

Finding a source for the definition of "clustering accuracy"

In papers about unsupervised clustering I see a lot of references to a metric "clustering accuracy" or "unsupervised clustering accuracy" (ACC) which is usually defined as ...
Cyo's user avatar
  • 11
0 votes
0 answers
20 views

Example of statistical experiment in which the statistical unit is time

Could you give me some examples of statistical experiment/analysis in which the statistical unit is the time? If I conduct a study on the number of people who have attended a bachelor degree over time,...
Elia's user avatar
  • 1
2 votes
1 answer
90 views

Definition of $\text{“}R^2\text{”}$

I have always taken $\text{“}R^2\text{”}$ to mean the proportion of a sum of squares explained by a model. The context in which this idea is first encountered is when one explains part of the total ...
Michael Hardy's user avatar
1 vote
2 answers
196 views

Definition of Uncertainity

I have some confusion regarding Measurement Uncertainty. In some books/articles it is defined wrt true value as "Uncertainty in the average of measurements is the range in which true value is ...
Govind Prajapat's user avatar
15 votes
4 answers
1k views

Do testability and falsifiability have statistical definitions?

Psychology: the Core Concepts says Psychology differs from the pseudosciences in that it employs the scientific method to test its ideas empirically. The scientific method relies on ...
Tim's user avatar
  • 19.8k
2 votes
0 answers
158 views

Definition of exponent measure (extreme value theory)

Let $F$ be a distribution function on $\mathbb{R}^2$, and let $U_i$ be the left continuous inverse of $\frac{1}{1-F_i}$, where $F_i$ is the marginal distribution of $F$. In my textbook, there is the ...
Phil's user avatar
  • 656
10 votes
3 answers
964 views

For a confusion matrix, is there a name for FP / (FP + FN)?

For a confusion matrix, there are a variety of useful rates, ratios and indices. But I cannot find the one I care about: FP / (FP + FN) Of course this measure is ...
David Bridgeland's user avatar
2 votes
2 answers
269 views

What is a formal and authoritative definition of an 'assumption' in a statistical model?

The description of the tag in this website states that it Refers to the conditions under which a statistics procedure yields valid estimates and/or inference. E.g., many statistical techniques ...
Kuku's user avatar
  • 1,592
11 votes
2 answers
360 views

Definition of "unpredictable"

How do we rigorously define the term "unpredictable" in cases of point and density prediction? The term "unpredictable" is employed in various contexts, e.g. "the outcome of ...
Richard Hardy's user avatar
0 votes
0 answers
46 views

Clarification: confidence interval of the (slope of the) regression line

When confidence intervals are referenced in the regression context, I often see them mentioned (generally speaking) as "of the slope of the regression line," or as "of the regression ...
anumberofthem's user avatar
10 votes
7 answers
2k views

Data vs. Information. What’s the difference between these two terms?

Following are the definitions "representative" of the countless articles I have read about data vs information and yet have nothing to say about them. Can anyone just put some accent upon ...
Vinay Sharma's user avatar
1 vote
0 answers
40 views

Definition of Statistics by Sheldon M. Ross [closed]

Sheldon M. Ross in his book, "Introductory Statistics" states the definition of statistics to be: "Statistics is the art of learning from data. It is concerned with the collection of ...
TankFishToad's user avatar
1 vote
2 answers
282 views

meaning of independent samples

I am getting confused about the concept of independent samples, which is related to the concept of pseudoreplications. This question is related to this topic. But reading it does not make it clear for ...
quibble's user avatar
  • 1,704
1 vote
1 answer
2k views

The link between AQL, RQL and the reliability of a process

The problem is to distinguish between the definitions of AQL, RQL and reliability in the field of quality control. Let us just pose the context. We wish to calculate the reliability of a manufacturing ...
lulufofo's user avatar
  • 472
1 vote
1 answer
71 views

Questions on what it means when we talk about "overfit"

"Overfit" is a commonly discussed concept in ML community. However, I tend to feel that there might be abuse of using this terminology. I wonder what it means when we talk about overfit, ...
mw19930312's user avatar
13 votes
3 answers
2k views

What's the relationship between these two definitions of martingales?

On wikipedia, the definition of a martingale is given as follows: A basic definition of a discrete-time martingale is a discrete-time stochastic process (i.e., a sequence of random variables) $X_1, ...
ExcitedSnail's user avatar
  • 3,050
5 votes
1 answer
246 views

How to demonstrate amount of virus required to lyse all cells using Poisson distribution?

In a seminal article in virology ('Sur l’unité lytique du bactériophage' Comptes rendus des séances de la Societé de biologie et de ses filiales, 1939, 130, pp. 904-907) the Nobel prize winner ...
Gigiux's user avatar
  • 211
1 vote
0 answers
64 views

Definition p-value and find p-value in practice

I have a problem that I can't solution. Let $\mathbf{X}=\{X_1,X_2,\ldots,X_n\}\sim\mathrm{Uniform}(0,\theta)$ and we have $H_0:\theta=\theta_0$ and $H_1:\theta>\theta_0$. We reject the $H_0$ when $...
Samvel Safaryan's user avatar
0 votes
0 answers
38 views

marginal cdf and cdf of marginals

given a joint distribution of 2 variables $P(X,Y)$, is the cdf of the Y-marginal distribution equals to the Y-marginal of the cdf of $P(X,Y)$?
Alucard's user avatar
  • 325
2 votes
0 answers
86 views

Why are Square Integrable Functions important in Statistics?

I'm reading a paper by Giles Hooker on Functional Decomposition through the use of Functional ANOVA. In the paper he defines a function: $$ F(x) : \mathbb{R}^k \rightarrow \mathbb{R} $$ and explicitly ...
Connor's user avatar
  • 667
1 vote
0 answers
164 views

Is there a connection between the Kernels in Statistics and Linear Algebra?

According to this question, the etymology of the terms is related, and Kernel is used to mean the "core" of something. In general it seems to refer to an unchanging transformation at the ...
Connor's user avatar
  • 667
3 votes
1 answer
1k views

What are balanced and orthogonal designed experiments?

I have recently started studying Designed Experiments and have have come across non-rigorous definitions of "balanced" and "orthogonal" experiments and would be interested in ...
FD_bfa's user avatar
  • 243
2 votes
1 answer
653 views

Relationship between blocks, factors and treatments

I have recently began studying a course on Designed Experiments and am having some trouble understanding some of the terminology. I've looked at some other answers on the site and I think that I am ...
FD_bfa's user avatar
  • 243
4 votes
0 answers
554 views

Is there a difference between out-of-domain and out-of-distribution?

When I research papers on the generalisability of ML models, both terms "Out of Distribution" and "Out of Domain" pop up. Are these essentially the same? In my understanding, yes. ...
amsulic's user avatar
  • 61
7 votes
1 answer
210 views

Can we consider the loadings as a proxy for correlation, in a Principal Component Analysis (PCA)?

In a PCA, the loadings can be understood as the weights for each original variable when calculating the principal component, or "how much each variable influence a principal component". Thus,...
Idervas's user avatar
  • 107
1 vote
1 answer
103 views

Questions regarding power of test and type II error

I´m preparing for a lecture in decision theory and I´m a little bit confused by the notation used by my prof. On the first slide under remark 3.2 point v) its written, that $\beta(\varphi)$ is equal ...
this_is_not_easy's user avatar
3 votes
2 answers
986 views

What is independent censoring and what are its assumptions?

I'm reading the Introduction to Statistical Learning book. In chapter 11 (Survival Analysis, page 463), the authors state: In general, we need to assume that the censoring mechanism is independent: ...
Juan's user avatar
  • 57
5 votes
1 answer
1k views

Bounded density function: definition?

What is the correct definition of bounded probability density function: $\sup_{x} f(x)<\infty$. If this is the correct definition of bounded probability density function, can you give the example ...
Star's user avatar
  • 935
0 votes
0 answers
55 views

Understanding likelihood vs conditional joint pdf

My course notes (3rd-year module in Bayesian Statistics, unpublished) have a paragraph, Gaussian data with known variance Suppose we have $\textbf{x}=\{x_1,\dots,x_n\}$ iid given $\theta$ and $\...
mjc's user avatar
  • 599
13 votes
3 answers
1k views

Rationale behind defining distribution function with strict inequality

A friend from Italy pointed me to a nice Probability Theory book written in Italian by G. Dall'Aglio, a well-known Italian probabilist; Original title: Calcolo delle Probabilità, terza edizione, ...
utobi's user avatar
  • 12.1k
2 votes
1 answer
137 views

Are random variables mathematical functions when dealing with continuous sample spaces?

I'm struggling with some basic concepts regarding the definitions of random variables. If I'm not mistaken, random variables are functions which aim to "translate" outcomes of a sample space ...
Idervas's user avatar
  • 107
0 votes
0 answers
73 views

Definition of Hypothesis Testing in All of Statistics

In the book All of Statistics by Larry Wesserman, the author says "Let $X$ be a random variable and let $\mathcal{X}$ be the range of $X$. We test a hypothesis by finding an appropriate subset of ...
user159566's user avatar
7 votes
7 answers
3k views

Regression vs. Classification: Is there a clear, generally accepted definition?

As a mathematician/economist, I am not trained to think in classification and regression tasks. This is why I wonder: is there a clear, widely accepted definition of regression and classification ...
Arne Jonas Warnke's user avatar
1 vote
0 answers
39 views

Provide an example of a dataset where maximum likelihood is inapplicable as third moments and fourth moments "assumptions" do not apply

An additional complication arises with estimation, since maximum likelihood estimation may not be feasible without making unrealistically strong ?????"assumptions"????? about third‐ and ...
user avatar
31 votes
5 answers
5k views

What is the difference between something being "true" and 'true with probability 1"?

In the beginning of chapter 2 of Information Theory, Inference and Algorithms, the author says that he will refrain from being unnecessarily rigorous and provides the example of saying that something ...
stochasticmrfox's user avatar

1
2 3 4 5
12