This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (CC BY).
ORIGINAL RESEARCH
Comparative proteomic and genetic testing methods in miscarriage
1 Pirogov Russian National Research Medical University, Moscow, Russia
2 Institute of Biomedical Chemistry, Moscow, Russia
3 Institute of Biology of Aging and Healthy Longevity Medicine with a Clinic of Preventive Medicine, Petrovsky National Research Centre of Surgery, Moscow, Russia
4 Peoples' Friendship University of Russia, Moscow, Russia
Correspondence should be addressed: Pyotr A. Klimenko
Novatorov, 3, Moscow Москва, 119421, Russia; ur.umsr@ap_oknemilk
Funding: the study was conducted as part of the Program for Basic Research in the Russian Federation for the Long Term (2021–2030) (No. 122030100170-5).
Acknowledgements: mass spectrometry measurements were conducted using the equipment of the Human Proteome SRF of the Institute of Biomedical Chemistry (Russia).
Author contribution: equally.
Compliance with ethical standards: the study was approved by the Ethics Committee of the Pirogov Russian National Research Medical University (protocol No. 228 dated 17 April 2023). All the surveyed individuals submitted the informed consent to take part in the study.
According to some data, spontaneous abortion in the first trimester most often results from natural selection, the number of embryos with chromosome abnormalities reaches 60–80% [1]. The rate of structural karyotype abnormalities in patients with recurrent pregnancy loss is 10 times higher than in the population: 2.4%. The standard embryo karyotyping involves testing for the Down syndrome, Patau syndrome, Edwards syndrome, Turner syndrome, Klinefelter syndrome. The patients having a history of two or more spontaneous abortions receive medical genetic counseling including the partners’ cytogenetic testing with explanation of the genealogical and cytogenetic findings, assessment of the risk of giving birth to children with developmental abnormalities, clarification of the need for prenatal diagnosis (including testing of the chorion), and the use of donor cells in severe cases. Cytogenetic testing of the embryo, chorion, determination of translocations in parents are necessary in all stillbirth cases.
The chorionic villi sampling for prenatal testing involving collection of a tissue sample from the placenta for karyotyping and detection of specific genetic or biochemical abnormalities in an unborn child is most often used in obstetric practice [2]. According to the transcriptome analysis, 65% (n = 13,074) of all human proteins (n = 20,090) are expressed in the placenta, and 288 of these genes show expression that is increased relative to other tissue types. In contrast to conventional biochemical approaches involving tracing one specific protein or more, proteomics, specifically LC-MS/MS, represents an effective method to detect the altered protein expression, as well as proteins involved in the disease pathogenesis. In recent years, there have been great advances in identification of the differentially regulated proteins, biomarkers, protein modifications and polymorphisms in various human tissues, and the so-called missing proteins by mass spectrometry [3].
A missing protein is an unconfirmed genetic sequence, for which a protein has not yet been discovered [4]. According to the international Human Proteome Project (HUPO), there are currently 1343 missing proteins without any annotated functions predicted by the bioinformatics analysis or studied experimentally. The missing protein detection complexity can be due to not only their low abundance in many tissues, but also expression confined to several cell types within the human body.
The ribosomal genes that encode ribosomal RNAs (18S, 28S, and 5,8S rRNA) being part of ribosomes, the proteinsynthesizing cytoplasmic organelles, are clustered in the ribosomal repeat (rDNA) represented in the genomes of eukaryotes by a large number of copies. The number of rDNA repeats in diploid human genomes varies between 200–711 copies [5]. Clusters of the rDNA tandem repeats of various size that are localized in short arms of five acrocentric chromosome pairs form the nucleolar organiser regions (NORs) of chromosomes. In humans, the rDNA copy number (R parameter) is determined by the combination of five pairs of parental acrocentric chromosomes. All rDNA copies in the cell are differentiated into potentially active and inactive. The number of active rDNA copies that is usually proportional to the number of inactive copies accounts for about 30–40% of the total copy number. The more rDNA copies there are in the genome, the more active copies there are, which are transcribed to ensure the rRNA quantity that is essential for ribosome biogenesis [6].
It has previously been shown that the ribosome biogenesis level, which depends on the number of rDNA copies in the genome, can affect conception and the course of pregnancy. The analysis of the share of non-viable zygotes in the samples of married couples with normal fertility, infertility, and miscarriage has shown that the loss of zygotes in the sample of healthy couples is significantly lower, than in the samples having reproductive problems. Therefore, the zygotic selection based on the dose of active rDNA copies in the genome can be one of the factors determining reproductive problems in some couples. In other words, very low and very high rDNA content in the embryonic genome can potentially hinder normal embryogenesis [7]. Another study has shown that the in vitro fertilization (IVF) procedure success depends on the total rDNA copy number in the woman’s genome [8]. Women with the lower rDNA copy number had a lower chance of getting pregnant through IVF. One of the hypotheses explaining this fact assumes that women with the low rDNA copy number transmit the lower number to the embryo, which is insufficient for successful embryogenesis.
Thus, the level and presence or absence of specific proteomes in the chorion must correspond to a certain number of different DNA forms in the extracellular DNA.
The study aimed to identify uncommon proteins and determine rDNA in the maternal and fetal genomes during embryogenesis.
METHODS
The continuous prospective study of patients with miscarriage was conducted in May–July, 2023 at the Family Planning and Reproduction Center. Observations, in which we failed to collect al three biological media within the first 3 h, were excluded. Inclusion criteria: frozen pregnancy of unknown origin; teratogenic effect; abortion by the patient’s free will. Exclusion criteria: other obstetric complications.
The study involved 45 patients, who were divided into three groups. A total of 25 patients (group I) were admitted to the hospital due to frozen pregnancy (5–13 weeks). Anembryonic pregnancy was reported in five of them, another 20 had frozen pregnancy of unknown origin. Furthermore, ultrasonography revealed discrepancies between the dead embryos’ anthropometric data and gestational age (in early pregnancy there was no heartbeat, and at pregnancy periods over 7 weeks a decrease in the crownrump length by 3–5 weeks was detected) in these patients. Five patients (group II) underwent pregnancy termination for medical reasons (teratogenic effects) after genetic counseling and the council at 13–21 weeks of pregnancy. The control group (group III) was represented by 15 healthy patients having no problems with their reproductive function, who were through artificial abortion by their own will at 8–11 weeks of pregnancy.
The chorionic and/or embryonic tissue (sample E) and decidual tissue (sample D) were collected from all the surveyed individuals during surgery. Moreover, blood was collected from the women’s cubital vein before surgery (sample C).
Proteins of the chorion of patients with frozen pregnancy and patients post artificial abortion were tested by electrospray ionization tandem mass spectrometry (LS-MS/MS) as previously reported [9]. Preparation of chorionic villi samples for further proteomic analysis involved protein extraction using the lysis buffer based on the 2% SDS (sodium dodecyl sulfate), ultrasonic treatment, 1DE-gel concentration procedure for SDS removal [9]. Protein levels in the chorionic villi extracts were determined using bicinchoninic acid as a reference sample [10].
Reduction, alkylation with iodoacetamide, and in-gel tryptic digestion were performed as previously reported [11]. The peptide mixture was analyzed using the Ultimate 3000 nanoflow HPLC system (Dionex, USA) integrated with the Orbitrap Q Exactive HF mass spectrometer (Thermo Scientific, USA) and the Nanospray Flex ion source (Thermo Scientific, USA) [9].
Mass spectra in the “.raw” format were converted into appropriate mgf files using the ProteoWizard MS Convert v. 3.0.6867 software tool (http://proteowizard.sourceforge.net). Files were imported to the SearchGUI platform (v. 3.3.17) [12] and analyzed using the X!Tandem and MS-GF+ search algorithms applied to the SwissProt database (v. 2.22.2022, FASTA format) for the species Homo sapiens. The search was performed in the database of inverted and random amino acid sequences (decoy). The PeptideShaker integrator [13] was used to produce the Excel spreadsheet files with the protein identification results.
To determine the relative content of proteins identified in the chorion, the normalized spectral abundance factor (NSAF) showing high reproducibility was used [14].
DNA was isolated from the samples C, E, D by the extraction method involving the use of organic solvents. The solution containing 0.04 M EDTA, 2% sodium lauryl sarcosylate, and 150 µm/mL RNAse A (Sigma, USA) was added to the samples for 45 min at 37 °C, treated with proteinase K (200 µm/mL, Promega, USA) for 24 h at 37 °C, extracted using the equal volumes of the phenol/chloroform/isoamyl alcohol mixture (25 : 24 : 1), phenol and the chloroform/isoamyl alcohol mixture (24 : 1). DNA was precipitated by adding 1/10 volume of 3 M sodium acetate (pH 5.2) and 2.5 volumes of ice-cold ethanol. Phenol was stabilized with 8-hydroxyquinoline. DNA was collected by centrifugation at 10,000 G for 15 min at 4 °C, washed with 70% ethanol (v/v), dried, and dissolved in water.
Ribosomal gene copy number determination: the number of rDNA copies in DNA was determined by non-radioactive quantitative hybridization (NQH) [5]. To detect human rDNA (GenBank sample No. U13369), the mixture of probes for the rDNA oligo(18S) biotin-CTGTAATGATCCTTCCGCAGGTTCACCTAC and oligo(18S) biotin-TATCGGTCTCGTGCCGGTATTTAGCCTTAG was used.
Denatured DNA was applied onto the filter (Optitran BA-S85, GE Healthcare, USA), 4–6 spots per sample. Standard genomic DNA samples (50 ng/ml) with the known rDNA content were applied onto the same filter to construct a calibration curve of signal intensity vs. rDNA copy number. The lambda phage DNA (50 ng/mL) was also applied onto the same filter to control the noise level. Then the filter was vacuum heated at 80 °C for 1.5 h. After the hybridization completion, the membrane filter was treated with the streptavidin-alkaline phosphatase conjugate (Sigma, USA) and placed in a solution of alkaline phosphatase substrates (bromo-chloro-indolyl-phosphate/ nitro blue tetrazolium, BCIP/NBТ). Then the filter was washed with water, dried in the dark, and scanned. The Imager 6 software allowing one to calculate the integrated intensity of signals from each point was used for rDNA quantification. Signals from all points corresponding to the same sample were summed up to calculate the mean and standard error for each sample.
No sample size calculation principles were used. Statistical analysis methods: descriptive statistics for quantitative variables are presented as the mean, median, and range of values. Pairwise comparison of samples was performed using the Mann–Whitney U-test (р). This test is best suited for our task due to small size of the samples used in the study. The test can be used to compare two groups, when there are at least three different trait values in each group. The StatPlus2007 software tool (http://www.analystsoft.com/) was used for calculation. The differences were considered significant at p < 0.05. The number of rDNA copies in the embryo’s genome was used to predict frozen pregnancy in patients. Our results are preliminary and descriptive due to the fact that the sample was small.
RESULTS
Infertility has become a global health problem, with the number of people suffering from this condition growing every year. The IVF procedure holds great promise for infertility treatment. However, the early embryogenesis is complex, since a number of processes take place in this phase, including the transition from mother to zygote. In humans, the early embryonic development can be complicated by genomic errors that occur after fertilization. The nuclear abnormalities found in human embryos, especially those resulting from IVF, are due to DNA damage, aneuploidy, and decreased developmental potential. Transcription and expression of certain genes in the embryo are through a number of changes in the early embryonic development phase [15].
Comparative proteomic analysis
The multicopy genes that encode ribosomal rRNAs (rDNA) determine the ribosome biogenesis and, therefore, protein biosynthesis levels in the body, especially in early embryogenesis [16, 17]. The comparative proteomic analysis can provide new insights into biological pathways underlying the spontaneous abortion pathogenesis. That is why in the first phase of the study we assessed the human chorionic protein profile alterations associated with frozen pregnancy by panoramic mass spectrometry. Among proteins identified, pregnancy-specific glycoproteins were determined (PSG; tab. 1). Human PSGs represent a group of molecules that are almost exclusively expressed by placental trophoblasts (chorionic villi) in pregnancy. Ten protein-encoding closely linked human PSG genes (PSG1–PSG9 and PSG11) form a subgroup of the carcinoembryonic antigen (CEA) gene family [18] (https://www.proteinatlas.org/humanproteome/tissue/placenta). CEA is an important tumor marker of colorectal and some other carcinomas [19]. We have also managed to identify all the CEA family subgroup members (tab. 1). Furthermore, in individuals with frozen pregnancy, there was a significant decrease in the levels (estimated based on NSAF values) of such glycoproteins, as PSG3 and PSG2; glycoproteins PSG7 and PSG4 were not detected in the chorionic tissue. The decrease in PSG7 levels during fetal development can result in pregnancy loss [20, 21]. Thus, our data showing that low PSG levels are associated with poor pregnancy outcomes are consistent with the results of other authors [18].
Moreover, alpha-L-fucosidase, which plays an important role in cell adhesion during attachment and detachment of fetal membranes, was detected in samples of the chorion after abortion [22]. We also detected the decrease in expression of the reticulon-4 protein (RTN4), which is involved in apoptosis (GO: 0006915), in cases of embryo loss. The RTN4 deficiency can result in such phenotypes, as “abnormal trophoblast layer morphology”, “fetal growth restriction”, “reduced fetal size”, and “embryonic lethality” [23].
In cases of embryo loss, PSG8, PSG7, PSG4 proteins were not determined in samples of the chorion; the pregnancyspecific beta-1-glycoprotein 6 (Q00889, PSG6) levels were low relative to the control (tab. 1).
Furthermore, in cases of frozen pregnancy, we revealed the decreased levels of such proteins, as coactosin-like protein (COTL1), protein canopy homolog 2 (CNPY2), sideroflexin-3 (SFXN3), prohibitin-2 (PHB2), and hyaluronan and proteoglycan link protein 1 (HAPLN1), in the chorionic villi compared to the control samples of the chorion. For example, HAPLN1 is essential for production of the cartilaginous proteoglycan aggregates having a broad spectrum of biological functions. The lack of HAPLN1 results in perinatal mortality associated with severe chondrodysplasia [24] and heart malformations [25].
Thus, comparative proteomic analysis allowed us to determine the low abundance proteins specific for pregnancy development that are characterized by considerable alteration of their content (decrease and/or lack) during early embryogenesis in cases of spontaneous abortion.
Genetic testing
As is known, the body’s level of protein biosynthesis is determined by ribosome biogenesis, the multicopy genes encoding ribosomal rRNA (rDNA), which represents one of the characteristics of adaptive capacity in humans. In this regard, in the next phase we determined rDNA in the maternal and fetal genomes during embryogenesis. Table tab. 2 presents experimental data that reflect the number of rDNA copies (R parameters) in DNA of blood leukocytes, decidua and the embryos’ DNA.
Group I (figure) — frozen pregnancy (No. 1–25) or suspected (visually) lack of the embryo (No. 20–25). Samples of the DNA isolated from blood cells and D tissue of the same women showed no differences in rDNA content (p > 0.05). This fact confirms the earlier reported data on equal rDNA levels in different tissues within the same body [26].
Embryonic genomes (tissue E) were dramatically different from maternal genomes (tissues C and D) based on the rDNA copy number. It is interesting to note that the differences were reported for five cases (No. 20–25), when the embryo could not be visually detected. It is likely that the cell division arrest occurred in early pregnancy. Among the maternal cells, there are most likely also cells from another organism. It is possible that the rDNA content in these cells is even lower.
As for the ratio of rDNA content in the embryo’s genome and maternal cells, all embryos in group I split into two subgroups. In subgroup Ia (n = 20), the R (E) parameter was 1.4–3.3 times lower (mean — 1.7 times; p < 0.001), than R (C or D). In subgroup Ib (n = 5), the R (E) parameter was 1.7–2 times higher (mean — 1.8 times; p = 0.02), than R (C). Thus, the embryo development arrest is associated with either too low, or too high rDNA content compared to the woman’s genome.
Group II — termination of pregnancy due to medical reasons (encephalocele, signs of congenital limb deformity, congenital CNS abnormality acrania, Edwards syndrome, and fusion of the pulmonary artery to aorta in the fetus). In this group, genomes of 4 embryos comprised more rDNA copies, than in group Ia (р = 0.01). The differences between groups IIC and IIE were non-significant. Only one embryonic DNA sample had lower rDNA content compared to the woman’s leukocyte DNA.
As for groups with healthy and frozen pregnancy, the most significant results were obtained for the embryos only (figure), and the threshold number of rDNA repeats was 322. In our study, lower rDNA content values were associated with the embryos’ death.
The analysis of the data reported allows us to draw the following conclusions
In cases of embryo loss, there are severe abnormalities of protein composition and the genes involved in pregnancy development in tissues of the chorion. Thus, low PSG levels were associated with poor pregnancy outcomes. In particular, in cases of frozen pregnancy no pregnancy-specific beta-1glycoprotein 7 (PSG7), the decrease in the levels of which during fetal development can result in pregnancy loss, was found in the chorion.
We also detected downregulation of the reticulon-4 protein (RTN4) involved in apoptosis (GO: 0006915) in cases of embryo loss. The RTN4 deficiency can result in such phenotypes, as “abnormal trophoblast layer morphology”, “fetal growth restriction”, “reduced fetal size”, and “embryonic lethality” [23].
The rDNA content is the same in blood cells and cells of the decidua of the same female body.
The threshold values of rDNA repeats in embryonic tissues (322) were determined, after the decrease of which pregnancy was terminated.
DISCUSSION
Frozen pregnancy is associated with severe imbalance of rDNA content in the embryonic and maternal genomes. In most cases, the embryo’s genome comprises significantly less rDNA copies, than the maternal genome and genomes of other embryos, the development of which has not been not spontaneously interrupted. Very low rDNA content in the genome is likely to be associated with the low number of ribosomes, which is incapable of ensuring the protein synthesis level appropriate for the development of a particular embryo. It has been previously shown that the low rDNA copy number in the human genome (less than 300 copies) is associated with the lower life expectancy and dementia in the elderly [7, 27]. Cystic fibrosis, a monogenic disorder caused by the CFTR gene mutation, is associated with the larger rDNA copy number in the affected individual’s genome. Schizophrenia, being a multifactorial disorder, is also associated with the increased rDNA content in the affected individual’s genome [27, 28]. It is likely that low amounts of rDNA in the genome do not allow the genome with genetic abnormalities to be realized, and embryogenesis is interrupted at an early stage. Only five chromosomes in the human genome contain the rDNA encoding ribosomal RNA, of which the ribosome is assembled with the help of certain proteins. The ribosome provides molecular machinery for the synthesis of all proteins in our body [17, 29]. The very high rDNA content in the embryonic genome compared to maternal one also negatively affects embryogenesis. There are two possible explanations for this fact. First, high ribosome biogenesis levels resulting from the larger number of rDNA copies in the embryo’s genome require a large amount of nutrients from the mother’s body. If the maternal genome contains a low number of rDNA copies, it will not be able to fulfill the needs of the embryo. Second, genomes with the larger number of rDNA copies can contain mutations that block embryogenesis at the later stages, but allow for early embryo development. Genomes with low rDNA amounts and this genetic abnormality are rejected in early embryogenesis.
CONCLUSIONS
Determination of the PSG8 (pregnancy-specific beta-1glycoprotein 8), PSG7 (putative pregnancy-specific beta-1glycoprotein 7), PSG6 (pregnancy-specific beta-1-glycoprotein 6), and PSG4 (pregnancy-specific beta-1-glycoprotein 4) content be useful for prenatal prediction of pregnancy course. In most cases (16 surveyed individuals out of 20), frozen pregnancy is associated with the low rDNA content in the embryo’s DNA relative to the control group (normal pregnancy). The low R parameter value shows that the cells are unable to ensure the ribosome biogenesis level appropriate for embryogenesis. Determination of specific proteins in the chorionic villi and rDNA copy number in the potential parents’ genomes with subsequent modeling of the rDNA copy number in the embryo can help determine the cause of infertility in married couples and predict the ongoing pregnancy course.