Generate Healthcare Readmission ML Training Data
Generate publication-ready healthcare readmission datasets with clinical feature distributions, configurable readmission rates, and realistic class imbalance. No PHI, no HIPAA concerns.
Healthcare Readmission template configuration
Here's the pre-built template configuration. Customize everything after loading.
{
"templateName": "Healthcare Readmission",
"taskType": "classification",
"numSamples": 5000,
"features": [
{ "name": "age", "type": "numeric", "distribution": "normal", "mean": 65, "std": 12 },
{ "name": "num_procedures", "type": "numeric", "distribution": "poisson", "mean": 3 },
{ "name": "length_of_stay", "type": "numeric", "distribution": "log-normal", "mean": 5, "std": 3 },
{ "name": "num_medications", "type": "numeric", "distribution": "normal", "mean": 12, "std": 5 },
{ "name": "diagnosis_category", "type": "categorical", "categories": ["cardiac", "respiratory", "digestive", "musculoskeletal", "endocrine"] },
{ "name": "has_diabetes", "type": "boolean", "trueRatio": 0.3 }
],
"target": { "labels": ["not_readmitted", "readmitted"], "weights": [70, 30] },
"noise": 0.15
} Built for Healthcare
Every feature is configured with domain-appropriate distributions and realistic parameters.
Clinical Distributions
Age follows normal distributions centered on elderly populations. Length of stay uses log-normal for realistic right-skewed hospital stays. Procedure counts use Poisson distributions matching clinical patterns.
Diagnosis Category Encoding
Five diagnosis categories (cardiac, respiratory, digestive, musculoskeletal, endocrine) with weighted distributions reflecting real-world admission patterns.
Readmission Class Imbalance
Pre-configured 30/70 readmitted/not-readmitted split matching typical hospital readmission rates. Adjust the weights to test different imbalance scenarios.
Comorbidity Indicators
Boolean features like has_diabetes with configurable prevalence rates. Add multiple comorbidity flags to increase clinical realism.
Who uses Healthcare Readmission training data?
Hospital Quality Teams
Build readmission prediction models to identify high-risk patients before discharge. Test interventions with controlled synthetic data before deploying on real patient records.
Health Informatics Students
Learn clinical ML workflows with realistic healthcare data. No IRB approval needed, no HIPAA concerns. Focus on modeling, not data access.
Healthcare ML Engineers
Prototype and benchmark readmission models with known ground truth. Compare feature engineering strategies across controlled data variations.
Realistic Clinical Data Patterns
SynthForge IO generates healthcare data that mirrors real clinical distributions without exposing any protected health information.
Age-Appropriate Distributions
Patient age distributions centered on realistic ranges for hospital readmission populations, with configurable mean and standard deviation.
Right-Skewed Stay Duration
Length of stay follows log-normal distributions. Most stays are short, with a long tail of extended hospitalizations matching real-world patterns.
Configurable Readmission Rate
Set the readmission rate to match your hospital's baseline (typically 15-30%) or test extreme scenarios. The class balance is fully adjustable.
HIPAA-Safe by Construction
All data is generated from statistical distributions. No real patient records are used, referenced, or derivable. Safe for research, education, and development.
More ML use cases
Frequently asked questions
Is this data HIPAA-compliant?
How realistic are the clinical distributions?
Can I adjust the readmission rate?
What export formats are available?
Can I use this to evaluate baseline models?
Start Generating Healthcare Readmission Training Data
Load the Healthcare Readmission template, customize features and parameters, and export publication-ready datasets in seconds.