AUTHOR=Schink Tania , Wentzell Nadine , Dathe Katarina , Onken Marlies , Haug Ulrike TITLE=Estimating the Beginning of Pregnancy in German Claims Data: Development of an Algorithm With a Focus on the Expected Delivery Date JOURNAL=Frontiers in Public Health VOLUME=8 YEAR=2020 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2020.00350 DOI=10.3389/fpubh.2020.00350 ISSN=2296-2565 ABSTRACT=

Background: Estimating the beginning of pregnancy is crucial when studying drug safety in pregnancy, but important information in this regard, such as the last menstrual period (LMP), is generally not recorded in claims databases. The beginning of pregnancy is therefore usually estimated by subtracting a median length of pregnancy from the date of birth. Due to the variability in pregnancy lengths, this might result in non-negligible errors. German claims data may offer the possibility to estimate the beginning of pregnancy more precisely based on the expected delivery date (EDD) which can be coded once or more often during a pregnancy.

Purpose: To estimate the beginning of pregnancy in German claims data focusing on the potential of the expected delivery date (EDD).

Methods: We included data of all pregnancies in women aged 12–50 years ending in a live birth between 2006 and 2015 identified in the German Pharmacoepidemiological Research Database (GePaRD). We assessed the number of coded EDDs per pregnancy and the concordance if ≥ 2 EDDs were coded. We estimated the beginning of pregnancy by subtracting 280 days from the EDD or the most frequent EDD (in case of discordant EDDs). To examine plausibility, we determined the distribution of pregnancy lengths and assessed whether the gestational age at which prenatal examinations were coded was plausible. For pregnancies without EDD, the beginning was estimated by subtracting the respective observed median lengths of pregnancy for preterm births, term births, and births after due date from the actual dates of birth.

Results: In 82.4% of pregnancies, at least one EDD was available (thereof 6.1% with only one EDD and 80.9% with ≥ 2 EDDs that were all concordant). The maximal difference between discordant EDDs was in median 5 days (interquartile range: 3–7 days). Based on the EDD, the median length of pregnancy was 276 days for term births and in 84.7% of pregnancies the second antibody screening test was performed in the recommended interval ± 2 weeks. In pregnancies without EDD the respective proportion was 84.9%.

Conclusions: By using the EDD, the beginning of pregnancy can plausibly be estimated in German claims data.