Ehr Data+x: Expanding The Reach Of Ehr Through Data Integration

Loading...
Thumbnail Image
Degree type
Doctor of Philosophy (PhD)
Graduate group
Epidemiology & Biostatistics
Discipline
Subject
Data Integration
Distributed algorithm
Electronic health records
Evidence synthesis
High-dimensional regression
Multiview Learning
Biostatistics
Funder
Grant number
License
Copyright date
2021-08-31T20:20:00-07:00
Distributor
Related resources
Author
Duan, Rui
Contributor
Abstract

The growth of availability and variety of healthcare data sources has provided unique opportunities for data integration and evidence synthesis, which can potentially accelerate knowledge discovery and enable better clinical decision making. However, many practical and technical challenges, such as data privacy, high-dimensionality and heterogeneity across different datasets, remain to be addressed. In Chapters 1-3, we develop several methods for effective integration of electronic health records (EHRs) and other healthcare datasets. We develop communication-efficient distributed algorithms for joint analyses of multiple datasets without the need of sharing patient-level data. Our algorithms do not require iterative communication across sites, and are able to account for heterogeneity across different datasets. We provide theoretical guarantees for the performance of our algorithms, and examples of implementing the algorithms to real world clinical research networks, including the observational health data sciences and informatics (OHDSI) and the national patient-centered clinical research networks (PCORnet). In Chapter 4, we propose a novel bilinear regression model for linking EHR with genetic or imaging data, which incorporates the low-rank and sparse structure of the association between high-dimensional covariates and outcomes. We develop an iterative algorithm to solve the non-convex optimization in the parameter estimation, and a simultaneous hypothesis testing procedure with theoretical guarantees of false discovery rate control. Our method is applied to a multi-view brain network analysis for Parkinson's Disease.

Advisor
Yong Chen
Date of degree
2020-01-01
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation