OPINION
Strategies of RT-PCR-based assay design and surveillance of SARS-CoV-2
1 N.F. Gamaleya Research Institute of Epidemiology and Microbiology, Moscow, Russia
2 Lomonosov Moscow State University, Moscow, Russia
Correspondence should be addressed: Nadezhda A. Kuznetsova
Gamalei, 16, str. 1, Moscow, 123098; moc.liamg@0avostenzukaydan
Author contribution: the autors equally contributed to the manuscript.
The emergence and rapid spread of the novel coronavirus SARS-CoV-2 has sparked the pandemic of COVID-19 [1]. Over 2 billion confirmed cases and more than 150, 000 deaths were reported within less than 3 months after the infection found its way out of China [2]. In the absence of specific treatments and vaccines, quarantine and lockdown are seen as the only available containment measure [3]. Since the virus is capable of presymptomatic transmission, molecular diagnostic techniques based on the amplification of nucleic acids have become the primary tool for monitoring its spread [4]. China, Singapore, South Korea, and Germany succeeded in implementing the broad testing strategy using reverse-transcription polymerase chain reaction (RT-PCR) kits and thus were able to timely identify individuals infected with SARS-CoV-2; these countries seem to have gained control of the epidemic and saved medical resources for treating critically ill patients. By contrast, Italy, Sweden and USA, who were late to adopt the broad testing strategy or initially denied its efficacy, are now facing a truly grave situation.
Molecular methods based on the amplification of nucleic acids boast high sensitivity and high specificity; they can detect viral RNA in both severely ill patients and asymptomatic individuals and thus significantly contribute to stemming the spread of the virus [5, 6]. Immediately after the first complete SARS-CoV-2 genome sequences were obtained, Chinese researchers and WHO released primer and probe sequences for PCR-based virus detection; in the months that followed, thousands of SARS-CoV-2 whole genomes were sequenced and new RT-PCR assays were developed. In this article, we talk about the main approaches to designing PCR assays and some of their specific characteristics.
In diagnostic tests, proper sample collection techniques and adequate, informative samples are essential to valid and reliable results. Knowledge of viral tropism is critical to choosing the type of the specimen to be collected. The efficacy of RT-PCR for SARS-CoV-2 detection depends on the specimen source and the applied sampling technique. It is reported that bronchoalveolar lavage fluids have the highest diagnostic value in terms of SARS-CoV-2 detection, followed by sputum, nasal swabs, fibrobronchoscope brush biopsy, pharyngeal swabs, feces, and blood (1%) [7]. Still, nasal and oropharyngeal swabs remain the most available and informative specimen type used in screening tests for SARS-CoV-2 [8, 9]. When properly performed, swabbing allows obtaining good quality samples and is safe for the medical staff [10].
Real-time PCR with fluorescent hybridization probes (real-time PCR) is the primary molecular genetic technique for SARS-CoV-2 detection
[6, 11, 12]. It is widely available, highly sensitive and specific. PCR assay kits are instrumental in implementing mass screening aimed at detecting infected individuals and quantifying viral loads in each patient.
Approaches RT-PCR assay design
So far, the oligonucleotides and real-time RT-PCR kits for SARS-CoV-2 detection have been described in a few dozens of publications by international authors. In those studies, several different approaches can be identified to designing real-time RT-PCR assays for SARS-CoV-2 detection. Singleplex PCR assays, in which oligonucleotides are selected to target only one specific gene, are the simplest and the most available. Multiplex assays are more advanced and allow targeting a number of different genes simultaneously. Primers and probes for multiplex assays can have different specificity or enable discrimination between SARS-CoV-2 and the related coronaviruses or other respiratory infections. For SARS-CoV-2 detection, primers and probes are usually selected to target the nucleocapsid genes N1 and N2, the RNA-dependent RNA polymerase gene (RdRP) and the E protein gene of the viral envelope. For example, CDC (Centers for Disease Control and Prevention) recommends that identification of COVID-19 patients should start with a screening test for the E protein gene, whose nucleotide sequence does not differ from that of SARS, and then proceed to differentiating SARS from SARS-CoV-2 using oligonucleotides for the RdRP target [13]. According to the WHO protocol, the collected samples should be screened for N and Orf1b. However, the proposed oligonucleotides do not help in discriminating between SARS-CoV-2 and SARS; therefore, sequencing is advised to finalize the identification procedure [14]. tab. 1 features publicly accessible primer and probe sequences for SARS-CoV-2 detection recommended by WHO and CDC.
Characteristics of existing RT-PCR assays for SARS-CoV-2 detection
Due to high demand, over 10 different commercial kits for SARS-CoV-2 detection have been launched on the Russian market; some of them have already received a medical device registration certificate (:tab_2);. Singleplex kits have higher sensitivity (up to 500 GE/ml) than their multiplex counterparts (1,000 to 10,000 GE/ml), whereas multiplex kits targeting several SARS-CoV-2 genes are more specific and help to avoid false-negative results associated with the variability of the virus resulting from mutations at the oligonucleotide binding site. Of note, results generated by multiplex kits are sometimes difficult to interpret due to the insufficient optimization of the oligonucleotide sequence. Nevertheless, all Russian manufacturers claim the sensitivity of their kits to be 1,000 GE/ml.
Importantly, an internal control should be included in the kit, regardless of the number of specific targets. The internal control can be endogenous (human DNA) or exogenous (e.g., an RNA phage). It is used to control all stages of the protocol, from nuclear acid extraction to amplification.
Although the internal control is necessary, not every assay has it. For example, it is not found in the kits based on isothermal amplification, including loop-mediated isothermal amplification (LAMP). Such assays are advantageously fast (they take no longer than 40 min), do not require sophisticated instrumentation and can be used as point-of-care tests outside the lab, as no thermocycler is needed. LAMP-based assays boast a sensitivity of up to 1–3 RNA copies per reaction [15]. However, the actual sensitivity of currently available commercial LAMP assays is lower than claimed (tab. 2).
Whole-genome sequencing
Among all techniques for molecular genetic analysis, sequencing still has the highest informative value. In the current pandemic caused by SARS-CoV-2, the number of complete genomic sequences of the virus obtained within very short time is record-breaking. Whole-genome sequencing has never been so close to adoption in the clinical setting as it is now. There are a few approaches to whole-genome sequencing of SARS-CoV-2. The classic approach consists in the extraction of nucleic acids from nasopharyngeal and/or oropharyngeal swabs, subsequent depletion of the host’s ribosomal RNA for library preparation and sequencing itself carried out according to the protocols supplied by the manufacturer. However, this approach requires a fair amount of viral RNA and good read depth. With low viral loads, the virus can be replicated using cell cultures. For that, serial passages are performed in Vero V, Vero E6, LLC-MK2, and some other cell lines. This approach has been successfully used in some laboratories, including the Reference center for coronavirus infection (GISAID ID: EPI_ ISL_421275).
Whole-genome amplification is an alternative to cell cultures. So far, a few panels have been designed for sequencing the entire genome of SARS-CoV-2. Among them is the Ion AmpliSeq SARS-CoV-2 Research Panel (Thermo Fisher Scientific; USA). It consists of two primer pools for the amplification of 125–275 bp-long fragments [16].
Another panel was developed by Paragon Genomics Inc (USA). It is a multiplex PCR research panel with two primer pools and an average amplicon size of 99 bp [17]. The panel can potentially detect 1.15 viral copies at 95% probability. Using two overlapping pools of primers will ensure full coverage of the entire viral genome, with a calculated detection limit of 0.29 copies at 95% probability. So far, there is no data on the actual SARS-CoV-2 detection limit for the AmpliSeq SARS-CoV-2 Research Panel.
A new protocol for sample preparation and bioinformatic analysis was proposed by the ARTIC network [18]. It was developed for Oxford Nanopore sequencing platform and generates results within 8 h.
CONCLUSIONS
Methods based on the molecular genetic analysis of nucleic acids are instrumental in the surveillance and monitoring of SARS-CoV-2 spread and help to contain the COVID-19 pandemic. Their primary advantage over thermometry or evaluation of symptoms is the ability to detect asymptomatic carriers or infected presymptomatic individuals. In spite of a plethora of designs, classic RT-PCR is still the preferred detection technique. Refinement of isothermal amplification tools will make molecular analytical techniques more accessible in the future and improve their efficacy in monitoring and controlling biological threats. Sequencing allows accumulating more data about changes occurring in the viral genome and using it for RT-PCR primer optimization, vaccine development, study of the evolution of the virus, and reconstruction of epidemiological processes that drive the epidemic. Sequencing platforms make it possible to analyze collected samples in the clinical setting, outside the lab, thereby reducing the turnaround time.