Development of an open-source, flexible framework for complex inter-institutional disparate data sharing and collaboration

AMIA Jt Summits Transl Sci Proc. 2013 Mar 18:2013:103. eCollection 2013.

Abstract

Clinical information, "-omic" datasets, and tissue samples are difficult to harmonize and manage for data mining. We have developed a platform for storing clinical research data while providing access to associated data from other information stores. Data on 34 metrics from 11,000 neuroblastoma patients were instantiated into a database. The Django web framework was used to create a model for rapid development of tools and views with a front-end interface for generating complex queries. Working with Nationwide Children's Hospital, we can now consume their tissue inventory data through an API. The end-user sees the number of patients who both match their search and have tissue available. Since initial implementation, the current tasks revolve around developing a governance structure and the necessary data use agreements. Efforts now are to (1) update the data with 5000 more patients, and (2) link to genomic data stores, facilitating disparate data acquisition for research studies.