User Guide
Everything you need to know about NHANES analysis
Getting Started
NHANES to Lancet automates the process of analyzing NHANES data for epidemiological research.
Step 1: Choose Your Research Topic
Select a phenotype (Obesity, Diabetes, CVD, etc.) to auto-populate recommended variables, or manually enter NHANES variable codes.
Step 2: Configure Analysis
Select survey cycles (2017-2018 recommended), analysis type, and optionally upload a Word research proposal.
Step 3: Review Results
Review tables, figures, and generated manuscript. Download the complete results as a ZIP file.
Survey Weights Explained
NHANES uses a complex, multistage probability sampling design. Valid analyses MUST use:
- Survey weights (WTMEC2YR for exam data) — account for selection probabilities and non-response
- PSU (SDMVPSU) — primary sampling units
- Strata (SDMVSTRA) — sampling strata
Analysis Methods
Survey-Weighted Logistic Regression
For binary outcomes (e.g., hypertension yes/no). Produces adjusted odds ratios (OR) with 95% CI.
Kaplan-Meier Survival Analysis
Estimates survival functions over time. Generates survival curves with log-rank test.
Cox Proportional Hazards
Evaluates multiple variables on time-to-event outcomes. Produces hazard ratios (HR).
Fine-Gray Competing Risk
When competing events exist (e.g., CVD death vs non-CVD death). Produces sub-hazard ratios.
Interpreting Results
| Measure | Interpretation |
|---|---|
| OR > 1 | Higher odds of outcome per unit increase in exposure |
| OR < 1 | Lower odds of outcome |
| HR > 1 | Higher hazard (risk) of event |
| P < 0.05 | Statistically significant at alpha = 0.05 |
| 95% CI excludes 1 | Significant at alpha = 0.05 |
STROBE Checklist
All generated papers follow STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) guidelines.
- Title and abstract indicate study design
- Structured abstract with Background, Methods, Findings, Interpretation
- Scientific background and rationale
- Eligibility criteria and data sources
- Statistical methods with survey weight handling
- Baseline characteristics table
- Main results with effect sizes and confidence intervals
- Limitations and generalizability discussion