Healthcare Dataset Generator

The Best Way to Practice with Realistic, Anonymized Patient Data.

Advanced Options

Select your options and click "Generate Data".

Data Preview

Preview will appear here...

The Ultimate Healthcare Dataset Generator

Every healthcare data analyst knows the challenge: analyzing patient outcomes, tracking insurance claims, or understanding clinical workflows requires a complex, multi-faceted dataset. Finding realistic, anonymized practice data that connects patient demographics, clinical encounters, and billing information is nearly impossible.

That’s why we built the PNRao Healthcare Data Generator.

This powerful tool creates a complete, enterprise-grade healthcare dataset with a single click. Whether you’re a healthcare administrator analyzing readmission rates, a student learning about health informatics, or a data analyst building a provider performance dashboard, this generator provides the data you need. Stop working with simple patient lists and start analyzing the entire patient journey.

Why This Dataset is a Powerful Learning Tool

Learning healthcare analytics with abstract numbers can be challenging. This generator provides a realistic context that makes complex concepts easier to grasp.

  • Builds a Narrative: You can step into the role of a Data Analyst for the PNRao Healthcare System. Your task could be to identify which diagnoses are most common, analyze the cost of different procedures, or track patient vital signs across multiple encounters. This narrative-driven practice helps solidify your understanding.
  • Makes Concepts Concrete: When you see a Diagnoses table linked to Encounters and Patients, abstract concepts become tangible. This makes metrics like Average Length of Stay, Patient Readmission Rates, and Claim Denial Rates easier to understand because you can see how they are built from the ground up.
  • Encourages Realistic Analysis: The rich context inspires more creative and insightful analysis. You might build a dynamic provider scorecard in Power BI or create a report in Excel to analyze the frequency of different medical procedures. This allows you to practice not just the technical skills, but also the art of data storytelling.

Designed for Real-World Healthcare Analysis

This dataset isn’t just a random collection of tables. It has been carefully crafted to mirror the data structures you will find in actual Electronic Health Record (EHR) and billing systems, making it the perfect tool to develop practical, job-ready skills.

  • Reflects Core Healthcare Operations: Every hospital or clinic deals with patients, providers, diagnoses, and billing. The tables in this generator represent the core of what you will encounter professionally. By understanding how these tables relate to each other, you are learning the fundamental blueprint of healthcare data.
  • Focus on a High-Demand Skillset: Healthcare analytics is a critical and rapidly growing field. The skills you build here—analyzing clinical outcomes, tracking financial claims, and evaluating provider efficiency—are directly transferable to the tasks and challenges you will face in a real data analytics role in the healthcare sector.
  • Covers a Wide Range of Concepts: The structure of this data is intentionally designed to be a comprehensive playground for learning. With this single resource, you can practice:
    • Excel/Sheets Functions: Use VLOOKUP or XLOOKUP to connect patient IDs with names, SUMIFS and COUNTIFS to create summary reports, and PivotTables to analyze charges by department.
    • Statistical Analysis: Use the data to practice correlation analysis (e.g., specific procedures vs. patient outcomes) or to build simple predictive models.
    • Data Cleaning: Use the “messy data” option to practice cleaning and validating patient and claims data, a critical real-world skill.
    • Advanced Analysis: The interconnected tables are perfect for practicing relationship-building in Power BI or writing SQL JOIN queries to combine data from multiple sources for a complete analysis.

In short, learning with this dataset ensures you are not just practicing abstract concepts, but preparing yourself with the practical knowledge and experience valued by employers.

Tables Overview

The generator produces a wide array of datasets across different healthcare functions. Each dataset has a unique structure, as detailed below.

Patient & Demographics

Table Name Description Columns
Patients Master The master list of all patients in the system. PatientID, FullName, DateOfBirth, Gender, BloodType
Patient Contacts Contact and emergency information for each patient. ContactID, PatientID, Address, Phone, Email

Clinical & Encounters

Table Name Description Columns
Encounters Log A record of every patient visit or admission. EncounterID, PatientID, ProviderID, FacilityID, EncounterDate, EncounterType
Diagnoses Diagnoses recorded for each encounter. DiagnosisID, EncounterID, ICD10_Code, DiagnosisDescription
Procedures Procedures performed during each encounter. ProcedureID, EncounterID, CPT_Code, ProcedureDescription
Vitals Log A log of patient vital signs taken during encounters. VitalsID, EncounterID, SystolicBP, DiastolicBP, HeartRate, Temperature_F

Billing & Claims

Table Name Description Columns
Charges A list of all billable charges for procedures. ChargeID, EncounterID, CPT_Code, ChargeAmount
Insurance Claims Records of claims submitted to insurance providers. ClaimID, ChargeID, InsuranceProvider, AmountBilled, AmountPaid, Status

Providers & Facilities

Table Name Description Columns
Providers Master A list of all doctors and healthcare providers. ProviderID, FullName, Specialty
Facilities Master A list of all hospitals and clinics. FacilityID, Name, Type

Pharmacy & Medications

Table Name Description Columns
Prescriptions A log of all medications prescribed to patients. PrescriptionID, EncounterID, MedicationID, PharmacyID, DatePrescribed, Quantity, Refills
Medications Master A list of all available medications. MedicationID, Name
Pharmacies Master A list of pharmacies. PharmacyID, Name

How to Use This App

  1. Select a Table to Preview: Use the “Select Table to Preview” dropdown menu to choose the type of data you want to inspect.
  2. Specify the Number of Patients: Enter the desired number of patients. This is the main driver; a higher number will generate more related records in all other tables.
  3. Configure Advanced Options (Optional): Click on Advanced Options to expand the menu.
    • Date Range: Select a Start Date and End Date to constrain the generated data to a specific time period.
    • Data Quality: Choose the quality of your dataset (Clean, Slightly Messy, or Very Messy).
  4. Generate the Data: Click the “Generate All Datasets” button. The application will process your request and display a preview of your selected table.
  5. Download Your Data:
    • Preview: To quickly download just the data shown in the preview table, click “Download Preview (CSV)” or “Download Preview (Excel)”.
    • All Related Tables: Click “Download All Tables (Excel)” to download a single Excel file with every table on a separate, clearly named sheet. This is the best option for relational analysis.

How to Use This Data

  • For Data Analysts & BI Professionals:
    • Clinical Dashboards: Use the Encounters, Diagnoses, and Providers tables to build a dashboard in Tableau or Power BI that analyzes patient visit reasons and provider workloads.
    • Financial Analysis: Analyze the Charges and Claims tables to understand revenue cycles and claim denial rates.
    • Practice SQL: Load the tables into a database and practice writing complex JOIN queries to link patients to their encounters, diagnoses, and prescriptions.
  • For Students & Educators:
    • Create realistic case studies for health informatics, healthcare administration, and data science courses.
    • Use the data as a foundation for assignments on analyzing patient demographics, tracking clinical outcomes, and understanding healthcare billing.

Feedback & Suggestions

This tool is built for you. If you have an idea for a new healthcare table or an improvement, please leave your feedback in the comments section below.