Questions tagged [survey]
An instrument used to collect a sample from a population. Surveying often refers to sampling of human populations and is primarily done by administering questionnaires or interviewing individuals.
1,219 questions
0
votes
0
answers
17
views
Can I use a series of linear regressions to perform post-hoc tests on survey data in R following a comparison of means for 3+ variables?
Thomas Lumley explained within this StackOverflow post that the survey package's regTermTest() function can be used to compare ...
1
vote
0
answers
35
views
How to compare two likert scale surveys by independent respondents?
I need to compare the ratings of two (multi item) scales. The catch is: the two scales, though being the same scale, are two different sets of experts.
Basic idea: do they agree or do not agree on the ...
2
votes
1
answer
30
views
Alternative to R-squared when calculating glm model using survey package
I'm writing my master thesis using the European Social Survey. The data requires weighting, so therefore I have to use the survey package and its ...
0
votes
0
answers
53
views
Am I going about statistical significance testing for weighted categorical survey data the right way?
I created a (fictional) dataset that contains individuals' car colors and their level of agreement to the question "I enjoy driving fast". The survey includes three car colors (red, black, ...
3
votes
1
answer
58
views
Would withholding marks until students respond to a survey bias the responses?
My university is running an anonymous survey, mostly to check if we understand how we are going to be assessed, if we are comfortable with the material, and if we find the material well organised. ...
1
vote
0
answers
45
views
Test (quantify) the association between categorical and numerical variable in survey data
I've just started working with survey data and I want to test independence between a numerical variable and a categorical one.
I've heard of weighted ANOVA, but how can I test the normality and ...
2
votes
1
answer
47
views
Repeated Measures ANOVA/Linear Mixed Effects Model and Missing Data
I'm conducting a study measuring happiness across 4 time points, aiming to determine if there's an increase in overall happiness. The required sample size is 24 for four time points and 28 for three. ...
1
vote
0
answers
13
views
Calculating population estimators when number of units sampled in a stratum is $\leq$ 1
A survey of 100 houses across 20 states in a country is conducted with each state being used as a stratum. To find the population mean I know i need to use the formula
$$\bar{y}_{str}=\sum^{20}_{h=1} \...
1
vote
1
answer
25
views
Cross-sectional analysis combining multiple years of complex survey data
Many large health surveys such as the Survey of Health, Ageing and Retirement in Europe have a longitudinal complex survey design with multiple years of data collection available. I am aware that a ...
0
votes
0
answers
17
views
FIML in 2-level svyglm / svylm?
I am learning data analysis in R so please let me know if this is a weird question.
I am analyzing an complex survey data using the survey package. I would also want my model to be 2-level, meaning ...
0
votes
0
answers
15
views
How to Calculate Cumulative Incidence for Each Case ID in Competing Risks Analysis Using R?
I’m working with a dataset in R and trying to calculate the cumulative incidence for each case ID in the presence of competing risks. My dataset looks like this:
...
1
vote
1
answer
56
views
Is it possible for the residual variance in a model to be greater than the total variance of the variable being modeled?
I've fitted a linear regression in R with svyglm from the survey package. The data is weighted, and the model uses a ...
0
votes
0
answers
22
views
Follow up question on the proper use of subset in complex survey designs
I read and tried to understand the various contributions on the proper subsetting in complex survey design, including:
Proper subsetting of survey data
When is subsetting survey data prior to ...
2
votes
2
answers
82
views
Different AIC in R survey package vs. R stats package
I get different AIC values when using svyglm()$aic as opposed to using stats::AIC(model). I understand that topic has been ...
0
votes
0
answers
12
views
Pooling survey data with panel households
I am analyzing some data from a national survey. I am working with two sets of data for the same variable (wealth) and some of those households are panel (i.e. interviewed both times). Now I want to ...
1
vote
2
answers
37
views
Mixed model vs. imputation for questionnaire scale score?
I would like to fit a statistical model where the dependent (response) variable is a validated scale score from a questionnaire. For each subject, this dependent variable is calculated from the values ...
4
votes
1
answer
95
views
Can regression coefficients be bootstrapped when sampling weights must be used?
I have non-complex survey data - it has sample weights but nothing else about the survey design to be taken into account. I want to get bootstrap distributions for regression coefficients, using the ...
0
votes
0
answers
40
views
Can I conduct a repeated measures ANOVA if there is unequal spacing between measurement times?
I'm planning a longitudinal survey study with a small samples (~20-25), trying to determine the best schedule of measurements to make analysis go smoothly down the line.
My initial plan: Pre-post ...
3
votes
3
answers
169
views
Survival analysis for survey (case-control?) data
I have a dataset comprising approximately 500 patients with about 10 different diseases, potentially with correlated outcomes, and 200 healthy controls. Patient data is sourced from the hospital, ...
1
vote
0
answers
18
views
Handling Missing Groups in Stratified Sampling for Weighted Mean Calculation
I conducted a statistical survey using a stratified sample to measure the knowledge of Italian students on a specific topic. The population was stratified according to the following categories: Area (...
1
vote
1
answer
38
views
Survey-adjusted standard errors for a compound annual growth rate
I am trying to calculate a standard error for a compound annual growth rate. Both the begin and final estimate are from a complex survey. I am using the following formula to calculate the growth rate:...
0
votes
1
answer
23
views
Defining a variable based on two different variables with different weighting schemes in NHANES
I want to define a derived variable (Let's call it $Z$) using two original variables ($X$ and $Y$) in NHANES. If any of $X$ or $Y$ meets the criteria, the value for $Z$ should be 1; otherwise, it ...
1
vote
1
answer
52
views
Applying weights instead of matching
I want to conduct a Difference-in-Differences (DiD) design study. I have two groups (which you can see in the sector_eu_without variable) that I want to keep equal regarding certain variables: sex, ...
0
votes
0
answers
39
views
How to determine correct correlation structure for GEE model?
I want to use Generalized Estimating Equations (GEE) to model longitudinal survey data. Participants' responses were recorded at two separate treatment sites at three timepoints, but the exact days ...
0
votes
0
answers
6
views
What are the best books about human rating and feedback systems?
I'm studying evaluation data collection and rating evaluation of LLM models using platforms of human raters like Surge or Scale AI. I'm also studying how we can use survey methods of users of LLMs to ...
9
votes
3
answers
768
views
Is prescreening not detrimental for paid surveys?
Survey sites like Swagbucks have often a prescreening mode in which one is asked questions like your annual income, whether you own car or not. It is observed that most of the time if one selects ...
0
votes
0
answers
26
views
Calculate model with two sets of survey weights for the same population
I am working with a double set of survey weights for the same population X. The survey is probabilistic, stratified, multi-staged. Respondents have to answer two sets of questions: questionnaire A and ...
0
votes
0
answers
12
views
confirmation using the right tests
Users were asked to pick the AGE GROUP, GENDER and EDUCATIONAL LEVEL each from 3 provided options. I want to check the connection between each of these variables and answers to the question where user ...
2
votes
1
answer
30
views
Removing subjects before creating the survey design object
I have a question regarding the example in this page
https://asdfree.com/national-health-and-nutrition-examination-survey-nhanes.html
I noticed that the author has subsetted the data before creating ...
1
vote
2
answers
288
views
Likert scale and sample size
I want to study the satisfaction of a mother receiving paper or website information. I will have at least 5 questions, with Likert scale (5 points) each time.
For the sample size:
Should I calculate ...
2
votes
1
answer
62
views
Minimum Survey Respondents for Hypotheses I want to answer
I am an eager statistics noob, so apologies on the basic questions.
I am looking to send out a survey to a population that is large (people with diabetes who take insulin in the US) (~ 8.4 million)
...
0
votes
0
answers
8
views
Can you build up statistical validity with multiple month's worth of the same survey questions?
The company I work for conduct consulting where we analyse company survey responses for statistical validity against the general company population.
To prevent survey fatigue, the company sends out ...
0
votes
1
answer
35
views
Sample size for survey
My interest is to perform a statistically significant survey on a population of 1700 people, that can be described in different categories, so each person belongs to only one category.
I have two ...
1
vote
0
answers
54
views
Multi-level Model and Multi-level Data
I have a question about multi-level models with multi-level survey data. I am working with survey data that has a two-stage sampling design with primary sampling units defined as schools randomly ...
3
votes
1
answer
96
views
What is the R Equivalent of this Stata Survey Weighting Code?
I am working with the SIPP data set and they use replicate weights. They offer advice here for what code you should use in Stata or SAS for applying their replicate weights correctly. Specifically ...
1
vote
1
answer
144
views
Power Analysis for Multiple Treatments per Respondent in Survey Experiment
I am running a vignette survey experiment with 6 total treatment options (5 treatments, 1 control), where each respondent randomly receives 2 of the treatment options and answers the same set of ...
0
votes
0
answers
39
views
Which Analysis to Use for 2 Likert Scale Questionnaire Groups With Multiple Responses
I have a data set that I acquired through running a questionnaire. The questionnaire consisted of participants having to rate 80 different phrases using a 5-point Likert scale; ranging from Strongly ...
0
votes
0
answers
13
views
Whether to specify ddf in confint.svyglm()
I have a main question about whether to specify ddf in confint.svyglm(). Using the two specifications below generate slightly ...
0
votes
0
answers
29
views
Weighing Data Issue
I am looking at e-cig prevalence within a city. I used surveys to collect data from residents, and I have a query around weighing data.
I have made the assumption, due to over and underrepresentation ...
0
votes
1
answer
41
views
Within subject experiments done by (some) Psychologists
Thanks in advance for any responses or resources you can link me to.
I've had more of an Econ/Stats training and recently I've been working with some people who have a Psych background at work.
They ...
1
vote
0
answers
69
views
Predicting current area-level counts from repeated cross sectional presence/absence surveys
Problem statement
I’m trying to predict the “current” distribution of wood-burning fireplaces at ZIP code level across 9 California counties based on 15 years of surveys with presence/absence data on ...
0
votes
0
answers
47
views
Association between individual treatment & covariates and aggregated survey outcome (per 1 covariate only)
I would like to assess whether employees who perform training (Treatment T) have a better perception of growth opportunities (Outcome Y) in a company given covariates X such as:
Gender, Age range
Sub ...
0
votes
0
answers
32
views
Hypothesis testing on total surveys
How should one think about hypothesis-testing ratio scale variables that do not have a distribution resembling a normal/t-distribution, when the data comes from a total survey?
Some background:
I have ...
2
votes
1
answer
62
views
PCA with correlated variables
I'm analyzing data from around 10 survey questions focused on regulatory issues. I've noticed these questions are highly correlated (of course since they are all about regulation), and I'm concerned ...
0
votes
0
answers
20
views
Standardizing variables from complex surveys
I have several continuous variables from a survey that I want to analyze. I will use survey weights for regression. But I want to standardize those variables first. Should I use the weighted means ...
0
votes
0
answers
45
views
Does survey R package allow me to do beta regression?
I have a complex survey dataset with a response (dependent variable) bounded between 0 and 1, where I have applied multiple imputation to the dataset to account for missing data. The response formally ...
1
vote
1
answer
43
views
Specifying panel data as survey object
I have a panel dataset of household characteristics (income, expenditure, asset ownership etc.) collected in different waves (30 waves in total). The survey data is stratified, with primary sampling ...
0
votes
0
answers
17
views
Difference in margin of error for boostrap and parametric approaches in survey
I'd like to know if using a bootstrap-derived margin-of-error for a simple survey is appropriate. I'm worried that the estimate is too small. I've pasted the code below for both approaches for context....
1
vote
1
answer
23
views
Checking covariate balance in survey response
In survey adjustment for nonresponse, a model-based approach can model response propensity on a variable where each observation is a respondent or nonrespondent from the full sample.
An approach to ...
1
vote
1
answer
43
views
Identifying an outlier nominal group different than the norm, based on an ordinal dependent variable
As part of my graduate thesis, I've measured the satisfaction levels of a couple hundred respondents across 7 districts of a city about a local public service, using a 4-point ordinal Likert scale (...