Skip to main content

Questions tagged [survey]

An instrument used to collect a sample from a population. Surveying often refers to sampling of human populations and is primarily done by administering questionnaires or interviewing individuals.

Filter by
Sorted by
Tagged with
0 votes
0 answers
17 views

Can I use a series of linear regressions to perform post-hoc tests on survey data in R following a comparison of means for 3+ variables?

Thomas Lumley explained within this StackOverflow post that the survey package's regTermTest() function can be used to compare ...
KBurchfiel's user avatar
1 vote
0 answers
35 views

How to compare two likert scale surveys by independent respondents?

I need to compare the ratings of two (multi item) scales. The catch is: the two scales, though being the same scale, are two different sets of experts. Basic idea: do they agree or do not agree on the ...
Marcel's user avatar
  • 11
2 votes
1 answer
30 views

Alternative to R-squared when calculating glm model using survey package

I'm writing my master thesis using the European Social Survey. The data requires weighting, so therefore I have to use the survey package and its ...
Moritary 's user avatar
0 votes
0 answers
53 views

Am I going about statistical significance testing for weighted categorical survey data the right way?

I created a (fictional) dataset that contains individuals' car colors and their level of agreement to the question "I enjoy driving fast". The survey includes three car colors (red, black, ...
TFXB's user avatar
  • 1
3 votes
1 answer
58 views

Would withholding marks until students respond to a survey bias the responses?

My university is running an anonymous survey, mostly to check if we understand how we are going to be assessed, if we are comfortable with the material, and if we find the material well organised. ...
Porter's user avatar
  • 33
1 vote
0 answers
45 views

Test (quantify) the association between categorical and numerical variable in survey data

I've just started working with survey data and I want to test independence between a numerical variable and a categorical one. I've heard of weighted ANOVA, but how can I test the normality and ...
Benco Myo's user avatar
2 votes
1 answer
47 views

Repeated Measures ANOVA/Linear Mixed Effects Model and Missing Data

I'm conducting a study measuring happiness across 4 time points, aiming to determine if there's an increase in overall happiness. The required sample size is 24 for four time points and 28 for three. ...
anna eyre's user avatar
  • 111
1 vote
0 answers
13 views

Calculating population estimators when number of units sampled in a stratum is $\leq$ 1

A survey of 100 houses across 20 states in a country is conducted with each state being used as a stratum. To find the population mean I know i need to use the formula $$\bar{y}_{str}=\sum^{20}_{h=1} \...
confusedstudent's user avatar
1 vote
1 answer
25 views

Cross-sectional analysis combining multiple years of complex survey data

Many large health surveys such as the Survey of Health, Ageing and Retirement in Europe have a longitudinal complex survey design with multiple years of data collection available. I am aware that a ...
Mark O'Donovan's user avatar
0 votes
0 answers
17 views

FIML in 2-level svyglm / svylm?

I am learning data analysis in R so please let me know if this is a weird question. I am analyzing an complex survey data using the survey package. I would also want my model to be 2-level, meaning ...
Sarah Zhang's user avatar
0 votes
0 answers
15 views

How to Calculate Cumulative Incidence for Each Case ID in Competing Risks Analysis Using R?

I’m working with a dataset in R and trying to calculate the cumulative incidence for each case ID in the presence of competing risks. My dataset looks like this: ...
Ali Roghani's user avatar
1 vote
1 answer
56 views

Is it possible for the residual variance in a model to be greater than the total variance of the variable being modeled?

I've fitted a linear regression in R with svyglm from the survey package. The data is weighted, and the model uses a ...
edstatsuser's user avatar
0 votes
0 answers
22 views

Follow up question on the proper use of subset in complex survey designs

I read and tried to understand the various contributions on the proper subsetting in complex survey design, including: Proper subsetting of survey data When is subsetting survey data prior to ...
Pxu80's user avatar
  • 1
2 votes
2 answers
82 views

Different AIC in R survey package vs. R stats package

I get different AIC values when using svyglm()$aic as opposed to using stats::AIC(model). I understand that topic has been ...
mirei's user avatar
  • 85
0 votes
0 answers
12 views

Pooling survey data with panel households

I am analyzing some data from a national survey. I am working with two sets of data for the same variable (wealth) and some of those households are panel (i.e. interviewed both times). Now I want to ...
Tom's user avatar
  • 1
1 vote
2 answers
37 views

Mixed model vs. imputation for questionnaire scale score?

I would like to fit a statistical model where the dependent (response) variable is a validated scale score from a questionnaire. For each subject, this dependent variable is calculated from the values ...
user167591's user avatar
4 votes
1 answer
95 views

Can regression coefficients be bootstrapped when sampling weights must be used?

I have non-complex survey data - it has sample weights but nothing else about the survey design to be taken into account. I want to get bootstrap distributions for regression coefficients, using the ...
Amorphia's user avatar
  • 997
0 votes
0 answers
40 views

Can I conduct a repeated measures ANOVA if there is unequal spacing between measurement times?

I'm planning a longitudinal survey study with a small samples (~20-25), trying to determine the best schedule of measurements to make analysis go smoothly down the line. My initial plan: Pre-post ...
Jenn's user avatar
  • 1
3 votes
3 answers
169 views

Survival analysis for survey (case-control?) data

I have a dataset comprising approximately 500 patients with about 10 different diseases, potentially with correlated outcomes, and 200 healthy controls. Patient data is sourced from the hospital, ...
processing_statistician's user avatar
1 vote
0 answers
18 views

Handling Missing Groups in Stratified Sampling for Weighted Mean Calculation

I conducted a statistical survey using a stratified sample to measure the knowledge of Italian students on a specific topic. The population was stratified according to the following categories: Area (...
Erik De Luca's user avatar
1 vote
1 answer
38 views

Survey-adjusted standard errors for a compound annual growth rate

I am trying to calculate a standard error for a compound annual growth rate. Both the begin and final estimate are from a complex survey. I am using the following formula to calculate the growth rate:...
MatthewR's user avatar
  • 113
0 votes
1 answer
23 views

Defining a variable based on two different variables with different weighting schemes in NHANES

I want to define a derived variable (Let's call it $Z$) using two original variables ($X$ and $Y$) in NHANES. If any of $X$ or $Y$ meets the criteria, the value for $Z$ should be 1; otherwise, it ...
Abdullah Abdelaziz's user avatar
1 vote
1 answer
52 views

Applying weights instead of matching

I want to conduct a Difference-in-Differences (DiD) design study. I have two groups (which you can see in the sector_eu_without variable) that I want to keep equal regarding certain variables: sex, ...
Ronald's user avatar
  • 21
0 votes
0 answers
39 views

How to determine correct correlation structure for GEE model?

I want to use Generalized Estimating Equations (GEE) to model longitudinal survey data. Participants' responses were recorded at two separate treatment sites at three timepoints, but the exact days ...
Erb's user avatar
  • 1
0 votes
0 answers
6 views

What are the best books about human rating and feedback systems?

I'm studying evaluation data collection and rating evaluation of LLM models using platforms of human raters like Surge or Scale AI. I'm also studying how we can use survey methods of users of LLMs to ...
Estimate the estimators's user avatar
9 votes
3 answers
768 views

Is prescreening not detrimental for paid surveys?

Survey sites like Swagbucks have often a prescreening mode in which one is asked questions like your annual income, whether you own car or not. It is observed that most of the time if one selects ...
Splendid Digital Solutions's user avatar
0 votes
0 answers
26 views

Calculate model with two sets of survey weights for the same population

I am working with a double set of survey weights for the same population X. The survey is probabilistic, stratified, multi-staged. Respondents have to answer two sets of questions: questionnaire A and ...
YouLocalRUser's user avatar
0 votes
0 answers
12 views

confirmation using the right tests

Users were asked to pick the AGE GROUP, GENDER and EDUCATIONAL LEVEL each from 3 provided options. I want to check the connection between each of these variables and answers to the question where user ...
anonymours's user avatar
2 votes
1 answer
30 views

Removing subjects before creating the survey design object

I have a question regarding the example in this page https://asdfree.com/national-health-and-nutrition-examination-survey-nhanes.html I noticed that the author has subsetted the data before creating ...
Abdullah Abdelaziz's user avatar
1 vote
2 answers
288 views

Likert scale and sample size

I want to study the satisfaction of a mother receiving paper or website information. I will have at least 5 questions, with Likert scale (5 points) each time. For the sample size: Should I calculate ...
F cachat's user avatar
2 votes
1 answer
62 views

Minimum Survey Respondents for Hypotheses I want to answer

I am an eager statistics noob, so apologies on the basic questions. I am looking to send out a survey to a population that is large (people with diabetes who take insulin in the US) (~ 8.4 million) ...
Christine's user avatar
0 votes
0 answers
8 views

Can you build up statistical validity with multiple month's worth of the same survey questions?

The company I work for conduct consulting where we analyse company survey responses for statistical validity against the general company population. To prevent survey fatigue, the company sends out ...
Will F's user avatar
  • 1
0 votes
1 answer
35 views

Sample size for survey

My interest is to perform a statistically significant survey on a population of 1700 people, that can be described in different categories, so each person belongs to only one category. I have two ...
user avatar
1 vote
0 answers
54 views

Multi-level Model and Multi-level Data

I have a question about multi-level models with multi-level survey data. I am working with survey data that has a two-stage sampling design with primary sampling units defined as schools randomly ...
UT_Max's user avatar
  • 11
3 votes
1 answer
96 views

What is the R Equivalent of this Stata Survey Weighting Code?

I am working with the SIPP data set and they use replicate weights. They offer advice here for what code you should use in Stata or SAS for applying their replicate weights correctly. Specifically ...
Daniel's user avatar
  • 117
1 vote
1 answer
144 views

Power Analysis for Multiple Treatments per Respondent in Survey Experiment

I am running a vignette survey experiment with 6 total treatment options (5 treatments, 1 control), where each respondent randomly receives 2 of the treatment options and answers the same set of ...
flâneur's user avatar
0 votes
0 answers
39 views

Which Analysis to Use for 2 Likert Scale Questionnaire Groups With Multiple Responses

I have a data set that I acquired through running a questionnaire. The questionnaire consisted of participants having to rate 80 different phrases using a 5-point Likert scale; ranging from Strongly ...
Barry's user avatar
  • 1
0 votes
0 answers
13 views

Whether to specify ddf in confint.svyglm()

I have a main question about whether to specify ddf in confint.svyglm(). Using the two specifications below generate slightly ...
Guoqiang Zhang's user avatar
0 votes
0 answers
29 views

Weighing Data Issue

I am looking at e-cig prevalence within a city. I used surveys to collect data from residents, and I have a query around weighing data. I have made the assumption, due to over and underrepresentation ...
Aidan's user avatar
  • 1
0 votes
1 answer
41 views

Within subject experiments done by (some) Psychologists

Thanks in advance for any responses or resources you can link me to. I've had more of an Econ/Stats training and recently I've been working with some people who have a Psych background at work. They ...
DirichletC's user avatar
1 vote
0 answers
69 views

Predicting current area-level counts from repeated cross sectional presence/absence surveys

Problem statement I’m trying to predict the “current” distribution of wood-burning fireplaces at ZIP code level across 9 California counties based on 15 years of surveys with presence/absence data on ...
dholstius's user avatar
  • 101
0 votes
0 answers
47 views

Association between individual treatment & covariates and aggregated survey outcome (per 1 covariate only)

I would like to assess whether employees who perform training (Treatment T) have a better perception of growth opportunities (Outcome Y) in a company given covariates X such as: Gender, Age range Sub ...
Blueheaven's user avatar
0 votes
0 answers
32 views

Hypothesis testing on total surveys

How should one think about hypothesis-testing ratio scale variables that do not have a distribution resembling a normal/t-distribution, when the data comes from a total survey? Some background: I have ...
MalinU's user avatar
  • 1
2 votes
1 answer
62 views

PCA with correlated variables

I'm analyzing data from around 10 survey questions focused on regulatory issues. I've noticed these questions are highly correlated (of course since they are all about regulation), and I'm concerned ...
chunguc1004's user avatar
0 votes
0 answers
20 views

Standardizing variables from complex surveys

I have several continuous variables from a survey that I want to analyze. I will use survey weights for regression. But I want to standardize those variables first. Should I use the weighted means ...
user410323's user avatar
0 votes
0 answers
45 views

Does survey R package allow me to do beta regression?

I have a complex survey dataset with a response (dependent variable) bounded between 0 and 1, where I have applied multiple imputation to the dataset to account for missing data. The response formally ...
user45765's user avatar
  • 1,465
1 vote
1 answer
43 views

Specifying panel data as survey object

I have a panel dataset of household characteristics (income, expenditure, asset ownership etc.) collected in different waves (30 waves in total). The survey data is stratified, with primary sampling ...
Afsal Najeeb's user avatar
0 votes
0 answers
17 views

Difference in margin of error for boostrap and parametric approaches in survey

I'd like to know if using a bootstrap-derived margin-of-error for a simple survey is appropriate. I'm worried that the estimate is too small. I've pasted the code below for both approaches for context....
Statfan's user avatar
  • 165
1 vote
1 answer
23 views

Checking covariate balance in survey response

In survey adjustment for nonresponse, a model-based approach can model response propensity on a variable where each observation is a respondent or nonrespondent from the full sample. An approach to ...
four77's user avatar
  • 111
1 vote
1 answer
43 views

Identifying an outlier nominal group different than the norm, based on an ordinal dependent variable

As part of my graduate thesis, I've measured the satisfaction levels of a couple hundred respondents across 7 districts of a city about a local public service, using a 4-point ordinal Likert scale (...
ToastedHambugerBun's user avatar

1
2 3 4 5
25