
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (CC BY).
ORIGINAL RESEARCH
Genetic polymorphism of the NF-kB1 р105/р50 processing region in pulmonary tuberculosis
Kemerovo State Medical University, Kemerovo, Russia
Correspondence should be addressed: Alina V. Meyer
Voroshilov, 22а, Kemerovo, 650056, Russia; ur.xednay@anila-opahs
Funding: the study was conducted within the framework of the basic budgetary funding source for project of the State Assignment of the Ministry of Health of the Russian Federation (agreement No. 056-03-2023-050 dated 17.01.2023).
Author contribution: Meyer AV, Lavryashina MB — developing the study concept and design, manuscript writing and final approval; Thorenko BA — genotyping, statistical analysis, working with the database; Imekina DO — genotyping, statistical analysis, technical editing; Dutchenko AP — search for literature, manuscript writing; Pyanzova TV — research management, clinical and biological material collection, final approval of the manuscript; Karabchukov KB — clinical history taking.
Compliance with ethical standards: the study was approved by the Ethics Committee of the Kemerovo State Medical University (protocol No. 301 dated 08 February 2023) and conducted in accordance with the ethical principles stated in the WMA Declaraction of Helsinki; the written informed consent to participation in the study was obtained from all patients.
Pulmonary tuberculosis (TB) is a pressing global issue of theoretical medicine and practical healthcare [1]. High incidence and mortality rates persist in a number of countries, which are caused by drug resistance of the pathogen, high prevalence of HIV infection among tuberculosis patients, low patient adherence to treatment, and the genetically determined features of the response to infection and treatment [2–5].
Diverse studies, including the analysis of genomes of Mycobacterium tuberculosis (M. tuberculosis) and the host, are conducted in order to investigate the genetic component value. One avenue in this field is the search for associations between the fact of carrying certain genotypic and allelic variants of genes and the specifics of human body’s response to M. tuberculosis infection and anti-tuberculosis therapy. Despite a significant amount of data accumulated [6–9], there is still ambiguity and even some inconsistency of the results obtained, which determines the need for further research in this field.
Considering current ideas about the key immune mechanisms ensuring recognition of M. tuberculosis and subsequent destruction of the pathogen [10, 11], intracellular signaling pathways and molecular cascades involved in differential expression of the genes engaged in immune responses and ensuring regulation of inflammation as the body’s systemic protective response are of great interest for scientific community [12–14]. These include the NF-kB signaling pathway, activation of which results in stimulation of inflammation via enhanced biosynthesis of TNFα, IFNγ, IL6, IL8 pro-inflammatory factors and other cytokines [15].
The NF-kB1 transcription factor (nuclear factor kappalight-chain-enhancer of activated B cells) is represented in the cell by the full-length precursor protein (р105) and its processed form (р50). NF-kB1/р50 as part of the complex with р65 (RelA) is a transcription activator, while NF-kB1/р105 in the form of homodimers (or together with IkB, the NF-kB inhibitor) functions as a suppressor of this process. Therefore, the р105→р50 processing modification can affect the NF-kB pathway direction and effectiveness, while intracellular р105/р50 balance can determine adequacy of the cellular response to activation signals, thereby contributing to the tuberculosis pathogenesis.
The Ub-independent р105→р50 posttranslational processing involving the 20S proteosome is currently considered as the main mechanism underlying generation of NF-kB1/р50 [16]. The endoproteolysis region is long enough and includes amino acid bases (AA) 430–530. At the gene level this region covers the exon region (Е) Е13 (403–433 AA) — Е15 (499–546 AA) with the overall length of 2771 bp. The study aimed to assess associations with tuberculosis of the light NFKB1 gene allelic variants localized within the NF-kB1 р105→р50 processing zone based on the SNP panel.
METHODS
The study involved total DNA extracted from blood samples of the patients of the Kopylova Kuzbass Clinical Phthisiopulmonology Medical Center (Kemerovo) with pulmonary tuberculosis (TB group, n = 93) and the population control group (PC group, n = 96) represented by the sample of residents of the city of Kemerovo and Kemerovo Region. The group of patients with pulmonary tuberculosis included 74 males and 19 females aged 26–88 years diagnosed for the first time (n = 78) and having relapses (n = 15). Clinical forms were represented by infiltrative (n = 49), disseminated (n = 23), focal (n = 6), fibrous-cavernous (n = 5) pulmonary tuberculosis, tuberculoma (n = 7), pleurisy (n = 3). Both groups were formed considering ethnicity (based on self-determination), demographic and medical history data.
Inclusion criteria for the TB group: established diagnosis of incident tuberculosis or tuberculosis relapse; age over 18 years. Exclusion criteria: HIV infection; fact of detecting antibodies against hepatitis C virus (HCVAg) and hepatitis В virus (HBsAg) in blood serum; refusal of participation in the study. All the patients were assessed in accordance with the standard regulated by current clinical guidelines. A complex of clinical, laboratory, and instrumental testing data was used to verify the diagnosis. The diagnosis of tuberculosis was established by the central medical board of the Kopylova Kuzbass Clinical Phthisiopulmonology Medical Center. DNA was extracted from biological samples by the phenol-chloroform extraction method. Genotyping based on rs4648050, rs4648051, rs4648055, rs4648058, rs4648068, rs1609993 was performed by real-time PCR in the Applied Biosystems QuantStudio 5 Real-Time PCR System (Thermo Fisher Scientific, USA) using the commercially available kits (DNA-Synthesis, Russia). According to the manufacturer’s instruction, the amplification programs had the following settings: initial denaturation for 3 min at a temperature of 95 °C; 40 cycles of primer annealing at a temperature specific for each polymorphism (54–59 °C); chain elongation at a temperature of 72 °C, and denaturation at a temperature of 95 °С. The amplification reagent mixture composition was as follows: DNA of the studied sample — 1 µL, Taq DNA polymerase — 0.5 µL, 10× buffer for Taq DNA polymerase — 2.5 µL, primer mixture (forward F, reverse R) — 2.5 µL, solution of four dNTP — 1 µL, fluorescent labeling probes TaqMan (FAM, VIC) — 1 µL each, deionized water — to the total mixture volume of 25 µL. Primary results were subjected to standard analysis using the resources of the Statictica, SNPStats, Arlequin software packages. Genotypic and allelic frequencies were calculated. The Hardy–Weinberg equilibrium was assessed using the Pearson’s chi-squared (χ2) test (χ2X-W). The analysis of associations of the candidate genes’ polymorphic variants was conducted based on the odds ratio (OR) considering the confidence interval (CI) for the odds ratio (95% CI). The null hypothesis was rejected when p-value was below 0.05.
RESULTS
The NFKB1 gene (HGNC:7794) located in 4q24 with the length of 115.973 kbp (GRCh38: CM000666.2 – 4: 102,501,330102, 617,302) contains 27 exons (Е). The SNP (Single Nucleotide Polymorphism) panel for the study was generated considering the following: 1) SNP location within the processing zone — the region of exons Е13-Е15 was used as a target, together with the adjacent introns (I) — I12, I15; 2) minor allele frequency (MAF) in the population of at least 0.1. Information was taken from the Ensembl genome browser (http://www.ensembl.org), and the NCBI data were used (https://www.ncbi.nlm.nih.gov/).
The total number of SNPs in the NFKB1 gene is 41,781. The number of SNPs found in the populations with the minor allele frequency (MAF) exceeding 0.1 was 152. After selecting polymorphic variants in the target region I12-E13-I13-E14-I14E15-I15 with the length of 7325 bp (102,593,569–102,600,894 bp) a total of five variants located in introns were identified, along with one exonic variant. The characteristics of the SPN panel generated are provided in tab. 1.
The data characterizing frequencies of the alternative allele based on the panel of six SNPs (rs4648050, rs4648051, rs4648055, rs4648058, rs4648068, rs1609993) in the gene NFKB1 in patients with TB and in the PC group, along with the indicators of the genotypic frequency equilibrium (χ2X-W), as well as the data on the alternative allele frequencies in the global (Global) and European (EUR) populations are provided in tab. 2.
Determination of the state of genotype frequency equilibrium in the studied samples has shown the following. Values of the χ2X-W parameter in the PC group suggest that there was no significant deviation from the Hardy–Weinberg equilibrium throughout the entire studied SNP panel in the gene NFKB1. In the sample of TB patients, genotypic frequency deviation (p < 0.05) was reported based on two SNPs (rs4648068 and rs1609993): χ2X-W was 5.06 and 9.15, respectively.
To analyze specifics of the gene pool of Russians of Siberia, we performed pairwise comparison of the rs4648050, rs4648051, rs4648055, rs4648058, rs4648068, rs1609993 allele frequencies in the gene NFKB1 in the studied sample of Russians of the Kemerovo Region (Kuzbass; PC group) with the available data on the global and European populations. The results obtain reflect the features of the genetic profile of the Russian population of Siberia compared to global frequencies and the frequencies typical for populations of Europe based on rs4648050 (p < 0.05), as well as in terms of matching with the global population based on rs4648051 (p < 0.05). As for other studied SNPs, there were no significant differences in allelic frequencies in the PC group.
Comparison of the nature of allelic frequency distribution in the TB sample and PC group revealed a significant difference (p < 0.05) based on rs4648068, for which the χ2 value was 3.86. The fact attracts attention that comparison of allelic frequencies in the sample of TB patients with the frequencies in the global and European populations demonstrates specifics throughout the entire SNP complex, except for rs1609993. This suggests that the increase in sample size with make it possible to reveal a broader range of associations between the studied SNPs and TB in the future.
Frequencies of the rs4648050, rs4648051, rs4648055, rs4648058, rs4648068, rs1609993 genotypes in the gene NFKB1 and the results of assessing the association of the generated SNP panel with TB are provided in tab. 3.
Comparison of genotypic frequencies revealed significant differences for two SNPs: rs4648068 (p = 0.03) and rs4648055 (p = 0.05). As for rs4648068, as mentioned above, we have shown significant differences based on the data of comparing the allelic frequencies as well. The study demonstrates the increased frequency of the homozygous GG variant comprising the alternative allele. As for rs4648055, in this case the sample of TB patients also shows higher frequency of the homozygous genotype comprising the alternative allele, АА. Both genotypes, GG *rs4648068 and АА*rs4648055, are considered as the etiological fraction genotypes, i.e. the genotype carrier state is associated with the increased susceptibility to developing TB in case of mycobacterial infection. The study has determined a significant correlation with TB of the genotypic variants АА*rs4648055 (OR = 2.51; p = 0.05) and GG*rs4648068 (OR = 2.16; p = 0.03).
DISCUSSION
The results of matching the alternative allele frequencies to the cumulative data by projects (ALFA dataset, https://ncbiinsights.ncbi.nlm.nih.gov/2020/03/26/alfa/) (tab. 2) indicate the differences between the values obtained in this study for the population groups of the Siberian region relative to both global values (Global) and Caucasoid populations (EUR) throughout the range of studied SNPs, except for rs1609993. This suggests the features of the “genetic portrait” of the Russian population of Siberia related to the studied complex, which should be considered when conducting meta-analysis, arranging association studies, and determining the range of informative TB biomarkers.
In this study, significant associations with the risk of developing TB were reported for intronic variants rs4648055, rs4648068 (tab. 3). In case of variant rs4648055, the alternative allele A in homozygous state increases the risk of disease 2.5-fold, while the 2-fold increased disease risk reported for rs4648068 results from the presence of alternative variant of allele G in homozygous state in the genotype. The trend towards statistical significance for protective effects of the major genotypes GG*rs4648055 (OR = 0.55; p = 0.07) and АА*rs4648068 (OR = 0.54; p = 0.07) can also be noted.
It should be noted that to date the literature data on the rs4648055 and rs4648068 contribution to the development of disorders are scarce, and the results are ambiguous. A number of papers report that there are no significant associations of rs4648055 and rs4648068 with the development of such disorders, as lung cancer [17] and coronary artery disease [18]. At the same time, when studying head and neck cancer in residents of Pakistan infected with HPV, the role of allele G in heterozygous and homozygous states in the increased risk of developing cancer was reported for rs4648068 [19]. When studying the risk factors of developing gastric cancer in the population of Han Chinese, significant results were reported for allele G of rs4648068 (OR = 1.43; p = 0.0001) [20]. Frequencies of genotype distribution in the control group were as follows: АА — 27.71%, AG — 53.58%, GG — 18.71%; in the group of patients — 22.94%, 45.64%, 31.42%. The alternative allele G frequency in the comparison group was 0.455, and in the cancer group it was 0.542, which is similar to the data obtained in our study for the TB group (0.538). Similar results were reported when studying the rs4648055 and rs4648068 contribution to ovarian cancer development in the Chinese population, and significant associations were also reported only for allele G of rs4648068 (OR = 1.38; p = 0.001), frequency of which in the group of patients was 0.532, and in the comparison group it was 0.454 [21].
CONCLUSIONS
The following aspects can be considered as the main findings of the study. First, new research data on the frequencies of rs4648050, rs4648051, rs4648055, rs4648058, rs4648068, rs1609993 in the gene NFKB1 in Russians of Siberia were obtained. These data make it possible to draw a conclusion about the specifics of genetic structure of the Russian population that should be considered when conducting association studies. Second, the statistically significant differences of the rs4648050, rs4648051, rs4648055, rs4648058, rs4648068 allelic frequencies in the sample of tuberculosis patients from the global frequencies and frequencies typical for populations of Europe suggest potential informational value of the generated SNP panel, which requires further research. Third, the detection of associations between susceptibility to TB and homozygous genotypes based on the alternative allele for rs4648055 and rs4648068 is indirect evidence in favor of modifying effects of genetic polymorphism, SNPs located within the processing zone in the gene NFKB1, and its possible contribution to the р105→р50 processing effectiveness, balance of the NF-kB1 gene products (р105/р50), and regulation of target gene expression. This assumption requires further research based on the analysis of the р105 and р50 levels with parallel assessment of transcription activity of the target genes of this transcription factor. Identification and analysis of the features of molecular mechanisms at the population and individual levels not only provide detailed understanding of molecular mechanisms underlying the TB pathogenesis, but also contribute to improvement of diagnostic procedures, including those involving prediction of disease progression, as well as to improvement of therapeutic strategies and the search for the ways to develop new drugs.