Migrating a Well-Established Longitudinal Cohort Database From Oracle SQL to Research Electronic Data Entry (REDCap): Data Management Research and Design Study.

Kusejko, Katharina; Smith, Daniel; Scherrer, Alexandra; Paioni, Paolo; Kohns Vasconcelos, Malte; Aebi-Popp, Karoline; Kouyos, Roger D; Günthard, Huldrych F; Kahlert, Christian R (2023). Migrating a Well-Established Longitudinal Cohort Database From Oracle SQL to Research Electronic Data Entry (REDCap): Data Management Research and Design Study. JMIR formative research, 7, e44567. JMIR Publications 10.2196/44567

[img]
Preview
Text
document.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (640kB) | Preview

BACKGROUND

Providing user-friendly electronic data collection tools for large multicenter studies is key for obtaining high-quality research data. Research Electronic Data Capture (REDCap) is a software solution developed for setting up research databases with integrated graphical user interfaces for electronic data entry. The Swiss Mother and Child HIV Cohort Study (MoCHiV) is a longitudinal cohort study with around 2 million data entries dating back to the early 1980s. Until 2022, data collection in MoCHiV was paper-based.

OBJECTIVE

The objective of this study was to provide a user-friendly graphical interface for electronic data entry for physicians and study nurses reporting MoCHiV data.

METHODS

MoCHiV collects information on obstetric events among women living with HIV and children born to mothers living with HIV. Until 2022, MoCHiV data were stored in an Oracle SQL relational database. In this project, R and REDCap were used to develop an electronic data entry platform for MoCHiV with migration of already collected data.

RESULTS

The key steps for providing an electronic data entry option for MoCHiV were (1) design, (2) data cleaning and formatting, (3) migration and compliance, and (4) add-on features. In the first step, the database structure was defined in REDCap, including the specification of primary and foreign keys, definition of study variables, and the hierarchy of questions (termed "branching logic"). In the second step, data stored in Oracle were cleaned and formatted to adhere to the defined database structure. Systematic data checks ensured compliance to all branching logic and levels of categorical variables. REDCap-specific variables and numbering of repeated events for enabling a relational data structure in REDCap were generated using R. In the third step, data were imported to REDCap and then systematically compared to the original data. In the last step, add-on features, such as data access groups, redirections, and summary reports, were integrated to facilitate data entry in the multicenter MoCHiV study.

CONCLUSIONS

By combining different software tools-Oracle SQL, R, and REDCap-and building a systematic pipeline for data cleaning, formatting, and comparing, we were able to migrate a multicenter longitudinal cohort study from Oracle SQL to REDCap. REDCap offers a flexible way for developing customized study designs, even in the case of longitudinal studies with different study arms (ie, obstetric events, women, and mother-child pairs). However, REDCap does not offer built-in tools for preprocessing large data sets before data import. Additional software is needed (eg, R) for data formatting and cleaning to achieve the predefined REDCap data structure.

Item Type:

Journal Article (Original Article)

Division/Institute:

04 Faculty of Medicine > Department of Haematology, Oncology, Infectious Diseases, Laboratory Medicine and Hospital Pharmacy (DOLS) > Clinic of Infectiology

UniBE Contributor:

Aebi-Popp, Karoline Lieselotte

Subjects:

600 Technology > 610 Medicine & health

ISSN:

2561-326X

Publisher:

JMIR Publications

Language:

English

Submitter:

Pubmed Import

Date Deposited:

01 Jun 2023 08:06

Last Modified:

04 Jun 2023 02:27

Publisher DOI:

10.2196/44567

PubMed ID:

37256686

Uncontrolled Keywords:

HIV REDCap cohort study data collection digital solution eCRF electronic case report forms electronic data entry software

BORIS DOI:

10.48350/183098

URI:

https://boris.unibe.ch/id/eprint/183098

Actions (login required)

Edit item Edit item
Provide Feedback