Data Upload & Auto-Profiling
Upload your dataset and the system automatically detects variable types, missingness, and structural risks.
Drop your CSV or Excel file here
or click to browse · .csv · .xlsx
trial_cohort_v3.csvuploaded
Rows
842
Columns
8
Event rate
12%
Events
101
Dataset Preview
First 8 rows · 842 total| id | treatment_group | age | sex | baseline_score | site | comorbidity_index | survival_90d |
|---|---|---|---|---|---|---|---|
| 001 | A | 52 | M | 47.2 | S1 | 2 | 1 |
| 002 | B | 38 | F | 53.8 | S2 | 0 | 1 |
| 003 | A | 61 | M | 41.5 | S1 | 3 | 0 |
| 004 | B | 45 | F | 58.1 | S3 | 1 | 1 |
| 005 | A | 70 | M | 35.9 | S4 | 4 | 0 |
| 006 | B | 29 | F | 64.3 | S2 | 0 | 1 |
| 007 | A | 56 | M | 49.7 | S5 | 2 | 1 |
| 008 | B | 63 | F | 38.2 | S6 | 3 | 0 |
Auto-Profile Summary
Automaticbinary·survival_90d
categorical·treatment_groupsexsite
continuous·agebaseline_score
count·comorbidity_index
id·id
Missingness
id✓
treatment_group✓
age1.2%
sex✓
baseline_score3.6%
site✓
comorbidity_index✓
survival_90d✓
Risk Flags
⚠
LOW_EVENTS_PER_VARIABLEWith 101 events and 7 predictors (including an 8-level site variable), events per variable (EPV) may be borderline for complex models. The system will calculate the exact events-per-parameter (EPP) ratio once you confirm variable encodings in step 2.
ℹ
MISSINGNESS_DETECTED2 predictors have missing values (age 1.2%, baseline_score 3.6%). Multiple imputation is available as a sensitivity analysis option.
Variable Distributions
Automaticagecontinuous
baseline_scorecontinuous
comorbidity_indexcount
treatment_groupcategorical
sexcategorical
sitecategorical
survival_90dbinary
Pairwise Correlations
AutomaticPearson correlations across continuous and count variables. Strong correlations may indicate collinearity.
age
baseline_score
comorbidity
age
—
-0.24
0.38
baseline_score
-0.24
—
-0.31
comorbidity
0.38
-0.31
—