Abstract
Aim: To identify differentially expressed proteins (DEPs) in 1st trimester maternal plasma between pregnant women at risk for gestational diabetes mellitus (GDM) and uncomplicated controls. Materials and Methods: First-trimester plasma from five women who developed GDM and five from non-diabetic ones were analyzed using isobaric tag for relative and absolute quantitation – labeled proteomics. Enzyme-linked immunosorbent assay was further applied in an independent cohort of 25 GDM cases and 25 controls for verification. Results: Prenylcysteine oxidase 1 (PCYOX1), beta-ala-his dipeptidase (CNDP1), extracellular matrix protein 1 (ECM1), basement membrane-specific heparan sulfate proteoglycan core protein (HSPG2), thrombospondin-4 (TSP-4) demonstrated significant differences in expression between the two groups (p<0.05). DEPs are mainly associated with complement and coagulation cascades. Conclusion: The reported plasma proteomic changes represent potential biomarkers for the early identification of women at risk for GDM. Future studies using larger and more diverse cohorts are necessary to assess the clinical utility of these findings.
- Biomarkers
- GDM
- gestational diabetes mellitus
- mass spectrometry
- TMT isobaric labeling
- nano-LC-MS/MS
- prenatal screening
- pregnancy complications
Gestational diabetes mellitus (GDM) is a common metabolic pregnancy-related complication defined as carbohydrate intolerance resulting in hyperglycemia with onset or first diagnosis during pregnancy (1). Depending on the population studied and the criteria employed for screening, the prevalence ranges between 1-14% with growing incidence, which is associated with the increased prevalence of obesity among women of reproductive age (2, 3). Although the pathogenetic mechanism leading to GDM is poorly understood, the cause of the condition is linked to insulin resistance and insulin deficiency.
GDM is associated with various adverse consequences for both, the mother and their offspring, including preeclampsia, cesarean delivery, preterm birth, macrosomia, fetal malformations, hyperbilirubinemia and neonatal metabolic disorders as well as an exceptionally high risk for developing cardiovascular disease and type 2 diabetes mellitus (T2DM) after the index pregnancy (4, 5).
Currently, in clinical practice GDM is diagnosed using the oral glucose tolerance test (OGTT) at 24-28 weeks of pregnancy (1). Even though this test is minimally invasive, it requires a hospital visit and multiple blood draws. Furthermore, it is offered at 24-28 weeks of pregnancy, when sufficient fetal adipose tissue has developed (6). Earlier detection, however, and more accurate diagnostic methods will be highly beneficial for patients as it will lead to increased diagnosis rates, improved perinatal outcomes through timely interventions and prevent maternal and offspring risk for future metabolic disease.
Mass spectrometry-based comparative proteomic profiling of maternal plasma could provide a deeper insight in the pathogenesis of the disease. Through identification of GDM-related non-invasive predictive and diagnostic biomarkers it can broaden the horizons on disease screening, diagnosis and treatment (7). The underlying rationale for such studies is that proteins secreted into maternal circulation reflect physiological or disease states and alterations in expression levels are indicative of pregnancies at risk. However, due to the complexity and the high dynamic range of the plasma proteome, identification of plasma biomarkers remains challenging and state-of-the-art quantitative approaches allowing precise assessment of low-abundance proteins in mixed samples are required.
The present study aims to identify proteins with significantly altered expression in the 1st trimester maternal plasma, in women subsequently diagnosed with GDM and their matched controls via employment of tandem mass tags (TMT) isobaric labeling followed by nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS). These proteins represent candidate biomarkers for prediction and diagnosis of GDM at an earlier stage and may provide important clues for the mechanisms of this pregnancy related complication.
Materials and Methods
Study participants. Peripheral blood samples for this case-control study were retrospectively selected for analysis from the maternity plasma bio-bank at the 3rd department of Obstetrics and Gynecology of Athens University School of Medicine. Patients were recruited between 2015 and 2017. Blood samples were collected from women undergoing 1st trimester (11-13 weeks of gestation) prenatal screening for fetal aneuploidies into EDTA- tubes and centrifuged at 2500×g for 10 min at 4°C. Plasma was aliquoted in sterile vacutainers supplemented with protease inhibitor and stored at -80°C for subsequent analysis.
Women's demographic and clinical data including results of the OGTT test, which is routinely offered to all pregnant women at our hospital, were recorded in the hospital records and reviewed by the obstetricians involved in the project for patient selection. A total of 60 samples were retrieved for analysis. Maternal clinical characteristics and pregnancy outcomes are shown in Table I.
During the first phase of the study, the discovery step, plasma samples from 5 women subsequently diagnosed with GDM and 5 non-diabetic participants (controls) were analyzed using TMT isobaric labeling coupled with n-LC-MS/MS in order to identify proteins showing significant differences in expression levels between GDM and control samples. GDM cases and controls were matched for maternal age, parity and pre-pregnancy body mass index (BMI).
At the second phase, the differential expression of selected proteins, identified during the discovery step was verified by performing enzyme-linked immunosorbent assay (ELISA) in an independent cohort of 25 GDM cases and an equal number of non-GDM controls randomly selected.
All women included in the study had a singleton pregnancy, were free from any medical disorders or pregnancy complications and delivered phenotypically normal neonates at >37 weeks' gestation. Pregnancies ending in termination or miscarriage were excluded as were smokers, women who underwent assisted reproductive treatment and those with a previous delivery of a large for gestational age neonate. None of the samples were previously thawed and refrozen.
In accordance with the guidelines of the American College of Obstetricians and Gynecologists women were considered to have GDM if venous plasma glucose levels reached or exceeded one or more of the following criteria: a fasting morning plasma glucose of 92 mg/dl (5.1 mM); a 1 h post-load 75 g oral glucose of 180 mg/dl (10 mM); a 2 h post-load glucose of 153 mg/dl (8.2 mM) (8). Control participants had a negative OGTT result and did not developed GDM during pregnancy.
The Scientific and Ethics Committee at the “Attikon” University Hospital Greece approved the study protocol (approval reference number: 27-268 ex 14/15). All women gave written informed consent to participate in the study, which was performed in accordance with the relevant guidelines and regulations.
Sample preparation and mass spectrometry analysis.
Depletion of high-abundance plasma proteins. To increase the accuracy of proteomic analysis, human plasma was depleted of abundant proteins (albumin and antibody components) using the Pierce™ Top 2 Abundant Protein Depletion Spin Columns (Thermo Fisher Scientific™, Waltham, MA, USA) as per the manufacturer's specifications. The total protein concentration was determined using the Pierce™ BCA Protein Assay Kit (Thermo Fisher Scientific™, Waltham, MA, USA).
Protein tryptic digestion and TMT reagent labeling. For digestion, 40 μg of depleted plasma proteins from each sample were reduced by 10mM Tris-(2-carboxy-ethyl) phosphine (TCEP) at 56°C for 1 h, alkylated by addition of 20 mM iodoacetamide for 1h at room temperature in the dark and treated by trypsin protease (Promega, Madison, WI, USA) at a ratio of 1:100 at 37°C over-night to generate proteolytic peptides (9). After trypsin digestion, peptides were lyophilized to near dryness. Peptide-containing solutions from each sample (GDM or control) were separately labeled using TMT10plex™ Isobaric Label Reagent Set (Thermo Fisher Scientific™, Waltham, MA, USA), according to the manufacturer's instructions. Labeled peptides were mixed together at equal amount before further analysis.
High-performance liquid chromatography (HPLC) fractionation. To reduce sample complexity the labeled peptides were loaded onto a C18 column (10 mm×250 mm, 5μm) (Thermo Fisher Scientific™, Waltham, MA, USA) for peptide fractionation. The mobile phase was a mixture of 0.1% formic acid in water (A) and 0.1% formic acid in acetonitrile (B) with a liquid chromatography (LC) linear gradient of 8-32% buffer B over 60 min. Each mixed sample was fractionated into 6 sections for subsequent mass spectrometry analysis.
nanoLC-MS/MS analysis. nano-LC-MS/MS was carried out on an Orbitrap Q Exactive™ HF mass spectrometer with an electro-spray ionization nano-spray source coupled on-line to a Ultimate-3000 system (Thermo Fisher Scientific™, Waltham, MA, USA).
Briefly, 5 μl of labeled peptide mixtures were injected onto a trapping column (PepMap C18, 100A, 100 μm×2 cm, 5 μm) and washed with buffer A (0.1% formic acid in water). Following valve switching, peptides were separated on an analytical column (PepMap C18, 100A, 75 μm×50 cm, 2 μm) at a constant flow of 250 nl/min and eluted with a linear gradient from 5 to 7% buffer B (0.1% formic acid in acetonitrile) in 2 min, from 7% to 20% buffer B in 80 min, from 20% to 40% buffer B in 35 min, then from 40% to 90% buffer B in 4 min. The instrument was operated in data-dependent acquisition mode (IDA). Data were collected in a positive ion mode over a broad mass to charge (m/z) precursor ion selection scan range of m/z 300-1650. The 15 most intense ions were isolated for fragmentation by collision-induced dissociation at 40% normalized collision energy (NCE) (10).
Data analysis. The raw data were analyzed using the Maxquant (version 1.5.1.0) software to identify proteins and peptides and searched against the human proteome of reference found at UNIPROT Knowledgebase v.2.16 (http://www.uniprot.org) (11). The parameters were set as follows: the protein modifications were carbamidomethylation (C) (fixed), oxidation (M) (variable); the enzyme specificity was set to trypsin; the maximum missed cleavages were set to 2; the precursor ion mass tolerance was set to 10 ppm and MS/MS tolerance was 0.6 Da. Only peptides/proteins identified at a False Detection Rate (FDR) <1% were accepted. For protein abundance ratios, we used fold change >1.5 or <0.83 as the threshold.
Gene ontology and pathway analysis. Gene ontology (GO) annotation of the differentially expressed proteins was performed in three aspects: biological process (BP), molecular function (MF) and cellular component (CC) via employment of the Webgestalt web-tool (12). KEGG pathway enrichment analysis was conducted using the Database for Annotation, Visualization and Integrated Discovery (DAVID) online tools with the cut-off criterion of p<0.05 (13).
ELISA analysis. ELISA analysis was performed to verify the quantitative proteomics results. First trimester maternal plasma samples from an independent cohort of pregnant women (25 with GDM cases and 25 controls) were analyzed using commercially available kits for thombospondin-4 (TSP-4) (#OKAG00254) and beta-ala-his dipeptidase (CNDP1) (#OKEH00851) (Aviva Systems Biology, Corp., San Diego, CA, USA) according to the manufacturer's instructions.
Statistical analysis. Statistical tests were performed using MATLAB® (The Mathworks, Inc, Natick, MA). Differences in clinical characteristics between the two groups were assessed using ANOVA for independent samples or the Mann-Whitney U-test for continuous variables.
Receiver operating characteristic (ROC) curves were carried out to generate the diagnostic potential of differentially expressed proteins with 95% standard error (SE) and 95% confidence intervals (CI). Significant differences were defined as two-tailed p-value<0.05.
Results
NanoLC-MS/MS comparative proteomic analysis. On the basis of data acquisition, a total of 238 DEPs demonstrating a >1.5 and <0.8 were identified: 198 (83,2%) up-regulated and 40 (16.8%) down-regulated. Among them, five proteins namely prenylcysteine oxidase 1 [PCYOX1 (Q9UHG3)], beta-ala-his dipeptidase [CNDP1 (Q96KN2)], extracellular matrix protein 1 [ECM1 (Q16610)], basement membrane-specific heparan sulfate proteoglycan core protein [HSPG2 (P98160)], thrombospondin-4 [TSP-4 (P49744)] demonstrated a statistically significant altered expression between the two groups (p<0.05). TSP-4 was the most up-regulated protein, and its levels were increased by 3.4-fold in GDM cases as compared to normoglycemic ones. No protein with significant down-regulation was detected.
Bioinformatics analysis of the differentially expressed proteins. By applying the gene ontology (GO) classification of the abundant molecules identified by n-LC-MS/MS, according to their biological processes, the following categories emerged: biological regulation (n=215), metabolic process (n=208), response to stimulus (n=205), localization (n=168), multicellular organismal processing (n=167), cellular component organization (n=116), development (n=115), cell communication (n=102) and other classes such as multi-organism process, cell proliferation, growth and reproduction in lower numbers (Figure 1A).
Clustering of identified proteins as differentially expressed in GDM cases, under the term “Cellular Components”, predicted that the majority are extracellular (n=226), followed by vesicles (n=190) and proteins localized in the endomembrane system (n=127) (Figure 1B). Other significantly enriched categories include the membrane-enclosed lumen, membranes, protein containing complexes and the extracellular matrix as well as proteins residing in the nucleus, cytosol and organelle compartmentalization such as mitochondria, endoplasmatic reticulum, Golgi apparatus and ribosomes.
Associations of the DEPs using their “Molecular function” as criterion are illustrated in Figure 1C. The top-5 over-represented terms include protein binding activity (n=181), ion binding (n=115), hydrolases (n=69), enzyme regulators (n=50) and lipid binding (n=38).
Through application of the Kyoto Encyclopedia Genes and Genomes (KEGG) bioinformatics software, proteins with altered expression levels in the first trimester maternal plasma of GDM cases were found mainly engaged in the complement and coagulation cascades, staphylococcus aureus infection, prion diseases and cholesterol metabolism (Figure 2).
Verification of protein expression by ELISA. Consistent with the n-LC-MS/MS data, ELISA analysis verified the overexpression of TSP-4 and CNDP1 in the first trimester maternal plasma in women who subsequently developed GDM (Figure 3). Both proteins were examined for their performance in differentiating between GDM and control samples. The area under the curve (AUC) obtained for TSP-4 was 0.94 and for CND1 0.98 at p<0.001 (Figure 4).
Discussion
The present study focused on the detection of differential protein expression patterns between GDM and normoglycemic pregnant women, to determine whether specific 1st trimester maternal plasma proteins might represent candidate early predictors for the complication. The comparative proteomic analysis was carried out using n-LC-MS/MS. In order to increase the coverage of protein identification and the confidence in the data generated, proteins were labeled with TMT isobaric labeling reagents.
To the best of our knowledge, only two studies have previously applied TMT isobaric labeling combined with n-LC-MS/MS on screening and identifying novel candidate plasma biomarkers for GDM, using well-defined clinical samples obtained early during gestation. Zhao et al., analyzed plasma collected from women in the early 2nd trimester of gestation (12-16 weeks) and reported 31 proteins as differentially expressed in the GDM group mainly involved in blood coagulation, inflammation, immune response and complement activation (14). Ravnsborg et al. using serum from obese women at 11-13 weeks of pregnancy reported afamin, serum amyloid P-component and vitronectin as potential biomarkers for the complication (15).
In the present study, a total of 238 DEPs were identified. Out of those, 198 were up-regulated and 40 down-regulated. KEGG pathway enrichment analysis revealed that several of these proteins are implicated in the coagulation and complement pathway. The up-regulation of coagulation factors II (prothrombin) (P00734), V (P12259), IX (P00740), X (P00742) and XII (P00748) observed in the GDM group as compared to uncomplicated pregnancies suggests an exaggeration of the existing hyper-coagulation state associated with pregnancy (16). The list was accompanied by the altered expression of all three fibrinogen- polypeptide chains [α (P02671), β (P02675) and γ (P02679)] probably due to a common thrombocytosis state in the GDM patients (17, 18). Complement C3 (P01024), an important player in the activation of the complement system, in both, classical and alternative pathways, was found moderately over-expressed in the present study emphasizing the essential role of the immune system activation in the pathogenesis of GDM. Conflicting evidence, however, exists regarding C3 levels in GDM cases. Increased levels of C3 have been associated with diabetes and insulin resistance (19-21). Shen et al., however, reported decreased complement C3 concentration in GDM samples (6). Another significant finding of the present study is the moderate up-regulation of vitronectin (P04004), which has been previously identified as an independent candidate biomarker for GDM. Additionally, over-expression of the complement factor H (P08603) was observed, which has been previously associated with various metabolic disorders and obesity (6, 22).
A major finding of the present study is the significantly altered expression (p<0.05) of CNDP1, ECM1, HSPG2, PCYOX1 and TSP-4, which is novel and has not been previously associated with the complication.
CNPD1 is synthesized in the brain and secreted into the systematic circulation (23). CNDP1 hydrolyzes L-carnosine and is essential for glucose metabolism and the transportation of activated fatty acids across the mitochondrial membrane (24, 25). A genetic polymorphism of CNDP1 has been associated with decreased serum levels of L-carnosine and diabetic nephropathy in patients with T1DM and T2DM suggesting a protective role of L-carnosine for late complications of diabetic disease (26). Sauerhofer et al., in a mouse model mimicking T2DM pathology, demonstrated that lowered endogenous L-carnosine serum levels were associated with glycosuria, higher fasting plasma glucose levels and glycosylated haemoglobin A1c levels as well as decreased body weight throughout life (25). In contrast, supplementation of L-carnosine resulted in a milder diabetes manifestation indicating an association between CNDP1 expression levels and the pathogenesis of T2DM.
PCYOX1 is broadly expressed in various embryonic and adult tissues, predominantly the liver (27). It interacts with various proteins and plays an important role in the respiratory chain, cell death, cell signaling, movement and transport, metabolism and protein degradation. Increased PCYOX1 mRNA expression levels have been detected in the visceral adipose tissue of mice that were receiving a high fat diet whereas decreased PCYOX1 expression was reported in a group of patients under coptis chinensis administration, a common treatment for diabetes in China (27, 28). These observations combined with the findings from the present study suggest that PCYOX1 should be further investigated in relation to the pathogenesis of diabetes.
According to our findings increased ECM1 1st trimester plasma expression levels may have a crucial role in the initiation of GDM. ECM1 is a main component of the extracellular matrix involved in bone formation, angiogenesis in cancer and endothelial cell proliferation (29-31). Oliva et al., identified altered expression levels of ECM1 in the adipose tissue of pregnant women who subsequently developed GDM (29). They hypothesized that this increase in ECM1 protein expression levels results in higher rigidity in the adipose tissue, promoting fibrosis. This hypothesis is supported by a recent study by Fan et al., who demonstrated that high expression levels of ECM1 inhibit transforming growth factor beta (TGFB) activation to prevent liver fibrosis (32). Lee et al. investigated the functional role of ECM1 and concluded that it interacts with the epidermal growth factor (EGF) domain of various proteins, including HSPG2, and induce heparin-binding growth factor responses (33). HSPG2 also known as perlecan, is a heparan sulfate proteoglycan involved in catabolism of lipids and glucose lipoprotein metabolism, obesity and the onset of metabolic syndrome (34).
In our initial proteomic study TSP-4 was significantly up-regulated in the GDM group, showing a >3-fold change, the largest difference in a protein's expression level observed in the present study. TSP-4 is a member of the thrombospondin family, implicated in the regulation of endothelial cell and smooth muscle cell proliferation, fibrosis, cardiac hypertrophy, inflammation, the production of heparin sulfate core proteins and the remodeling of the myocardium in rats (35). In contrast to the rest molecules of the thrombospondin family, TSP-4 promotes angiogenesis (36, 37). Recently, Zierfuss et al. found increased levels of TSP-4 in T2DM patients with peripheral arterial disease suggesting that microvascular angiogenesis promoted in diabetes is associated with the overexpression of TSP-4 (38).
The strength of the present study lies in the use of a sensitive and reproducible mass spectrometry-based technique followed by ELISA confirmatory experiments to analyze plasma samples obtained from well-characterized GDM cases. Notably, several of the proteins identified in the present study have been previously reported in association with the later development of GDM including sex hormone-binding globulin (SHBG), adiponectin and C-reactive protein (CRP) as well as afamin, serum amyloid P-component and vitronectin reported as candidate biomarkers (15, 39, 40). Even though, their altered expression was not statistically significant, their detection further supports a potential role as GDM predictors and adds strength to the findings of the present study. We acknowledge, however, that the limited number of cases and the weakness of the case-control study design applied may have led to an overestimation. For this reason, further studies using a larger population representative of various ethnic groups are required to validate the data obtained and determine their performance in early predicting GDM.
Using proteomic analyses we identified over-expression of CNDP1, ECM1, HSPG2, PCYOX1 and TSP-4 in the 1st trimester maternal plasma of women who were subsequently diagnosed with GDM followed by confirmatory ELISA. These proteins show great potential to be used as early predictive biomarkers for prenatal screening as well as guiding the development of new therapeutic interventions. Our research, however, is preliminary and future studies is using a larger and more diverse cohort are necessary to validate these initial findings, assess their clinical utility and uncover their role in the pathogenesis of GDM.
Acknowledgements
The Authors acknowledge the support of Dr. G. Lambrou in statistical and bioinformatic analyses.
Footnotes
Authors' Contributions
DM performed all experiments, participated in the interpretation of data analyses and drafted the manuscript. NE performed patients' diagnosis and treatment, sample and data collection and assisted in all experiments' performance. NP participated in the design and coordination of the study and supervised the study. AK conceived and designed the study, and drafted the manuscript. All Authors approved the final manuscript.
This article is freely accessible online.
Funding
The research was co-funded by the European Union and Greek national funds through the Operational Program “Human Resources Development, Education and Lifelong Learning” (project code: MIS 5007192).
Conflicts of Interest
The Authors declare that there are no conflicts of interest.
- Received December 13, 2019.
- Revision received December 27, 2019.
- Accepted December 27, 2019.
- Copyright© 2020, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved