Truveta Data
The most complete, timely, and clean EHR data
Complete EHR data, including labs, images, and clinician notes – linked with claims and SDOH to accelerate research across all diseases, drugs, and devicesÂ
Advancing patient care and outcomes has been limited by fragmented, inaccessible, and unstructured data – until now
Truveta Data comes from a growing collective of more than 30 health systems that provide 18% of the daily clinical care across all 50 states from 800 hospitals and 20,000 clinics, representing the full diversity of the US

patients and growing
clinical notes per patient
unique medical devices
years patient history
Discover the most complete picture of US health
EHR data is linked across health systems and integrated with social drivers of health (SDOH), mortality, and claims data for a complete view of patient journeys
Truveta Data includes the full medical record, unlike claims data which lacks lab values, symptoms, side effects, and clinical outcomes.
All age groups, infants, children, and elderly. All care settings, ambulatory out-patient and acute in-patient. All insurance, commercial, Medicare, and the uninsured.
Clinical data
Partner data

“We are thrilled to partner with Truveta to use their unprecedented access to EHR data for real-world data research and advance our understanding of epilepsy and seizure disorders. Through Truveta, we can uncover insights into patient care and outcomes that we’ve never seen before at this scale and breadth of data, including seizure frequency in clinical notes. Together, we believe we can drive meaningful improvements in the lives of patients living with epilepsy.”
Sean Stern
Director Health Economics Outcomes Research (HEOR), SK Life Science Inc.
Unprecedented access to notes and images
Unlock meaningful insights within clinical notes
Uncover previously hidden, deep clinical insights from more than 2 billion free text notes with an average of 75 clinical notes per patient.
Progress notes, discharge reports, lab reports, and transcribed telephone encounters reveal details like disease symptoms, clinical status measures or scores, and reasons for medication changes.
Learn from medical images
Gain access to full images, including MRI’s, CT scans, X-rays, ultrasounds and digital pathology to study digital images alongside the complete, longitudinal patient record to better understand diagnosis, treatment, and prognosis.
Learn from medical images
Gain access to full images, including MRI’s, CT scans, X-rays, ultrasounds and digital pathology to study digital images alongside the complete, longitudinal patient record to better understand diagnosis, treatment, and prognosis.
Daily updated data
Data from more than 30 health systems is updated daily, empowering researchers to discover insights on yesterday’s care, today
Billions of data points cleaned with unmatched accuracy
Truveta Language Model, a large-language, multi-modal AI model, transforms billions of data points with industry-leading normalization for unmatched accuracy, without the commercial bias found in claims data, which is normalized for revenue optimization.






























Every Truveta study can be a health equity study
Comprehensive social drivers of health data enable researchers to better understand the impact of lifestyle and social factors on clinical care and outcomes