SVOC takes an unannotated VCF file as input and automatically invokes ANNOVAR to generate necessary annotations, including refGene, dbnsfp47a, and dbscsnv11. Concurrently, SVOC automatically employs TransVar to derive annotations such as transcript, genomic coordinates, transcript-dependent cDNA, protein coordinates, and variant consequence. It utilizes MANE v1.4 (Matched Annotation from NCBI and EMBL-EBI) to annotate canonical transcripts, specifically the MANE Select transcript. However, for certain genes, the MANE Select transcript alone may not be sufficient to report all known variants. This is particularly relevant in genes with mutually exclusive exons, where both exons harbor clinically significant variants. To ensure comprehensive representation, a MANE Plus Clinical transcript is added alongside the MANE Select transcript.
We expect that the versions of annotation databases will be updated in a regular basis, given the continuous development of various databases. SVOC will then take into account the information as well as some criteria listed below to make the final classification.
According to the 2022 ClinGen-CGC-VICC guidelines, there are five types of evidence to classify the oncogenicity of somatic variants, including population data, functional data, predictive data, cancer hotspots and computational evidence.
Based on these evidence, variants are classified into five categories: Oncogenic, Likely Oncogenic, Variants of Uncertain Significance (VUS), Likely Benign and Benign. There are a total of 17 criteria: 3 criteria for the population data (SBVS1, SBS1 and OP4), 2 criteria for the functional data (OS2 and SBS2) , 7 criteria for the predictive data (OVS1, OS1, OM1, OM2, OM4, OP2, SBP2), 3 criteria for the cancer hotspots (OS3, OM3 and OP3), and 2 criteria for the computational evidence (OP1 and SBP1). The prefix "OVS" represents "Oncogenic Very Strong", corresponding to 8 points; "OS" represents "Oncogenic Strong", corresponding to 4 points; "OM" represents "Oncogenic Moderate", corresponding to 2 points; "OP" represents "Oncogenic Supporting", corresponding to 1 point; "SBVS" represents "Somatic Benign Very Strong", corresponding to -8 points; "SBS" represents "Somatic Benign Strong", corresponding to -4 points; and "SBP" represents "Somatic Benign Supporting", corresponding to -1 point. SVOC assigns the corresponding points when a criterion is met; otherwise, 0 points are assigned.
SVOC automatically scores all 17 criteria and calculates the total score of the variant based on the applicable rules defined in the guidelines (e.g., OM1 cannot be used if OS1 or OS3 is applicable). The oncogenicity category is then assigned according to the total score: ≥10 points for Oncogenic, 6–9 points for Likely Oncogenic, 0–5 points for VUS, -1 to -6 points for Likely Benign, and ≤-7 points for Benign.
Currently, Single Nucleotide Variants (SNVs) and Insertion/Deletion variants (Indels).
We acknowledge that inversions, gene fusions, and gene expression alterations may play important roles in cancer development and progression. Therefore, when specific guidelines for these types of mutations become available, we will promptly incorporate the corresponding functionalities.
Classification Summary page displays the classification results for user-queried variants.
The Statistics module at the top provides an overview: Donut Plot visually represents the proportion of variants across different oncogenicity classifications; Bar Plot details the variant count per gene and their oncogenicity distribution.
The Table below lists all variants with their location and classification information. The STATUS indicates whether the classification is 'Automated' (system-generated) or 'Adjusted' (manually modified by the user). Use the search box in the table's top-left corner to filter table content globally. To review or adjust how a specific variant meets the 17 classification criteria, click its corresponding Details & Adjust link. This will navigate you to the Classification Details page.
For a detailed description of the evidence codes, please refer to the publication of ClinGen/CGC/VICC guidelines.
SVOC utilizes the representative transcript from MANE v1.4 (Matched Annotation from NCBI and EMBL-EBI), specifically the MANE Select transcript. However, for certain genes, the MANE Select transcript alone may not be sufficient to report all known variants. This is particularly relevant in genes with mutually exclusive exons, where both exons harbor clinically significant variants. To ensure comprehensive representation, a MANE Plus Clinical transcript is added alongside the MANE Select transcript. For further details, refer to the following resources:
MANE v1.4 with MANE Select for Non-Coding GenesClassification Details page displays the fulfillment status of the 17 criteria for a specific variant, along with reviewable text evidence for each met criterion. It supports users in interactively refining results using their own internal databases.
Users can adjust which criteria a variant meets by selecting or deselecting the radio buttons or checkboxes next to each standard. When changes are made, the system will prompt for supporting information. After the user provides and submits this information, the classification result will update automatically.
Or, users can directly modify the final classification conclusion using the dropdown menu at the bottom of the page. If the selected classification contradicts the variant's classification based on scoring criteria, the system will prompt the user to either reselect or confirm the change. Upon confirmation, the system will automatically update and apply the new classification.
Yes. You can download results from SVOC through clicking the button 'download results to CSV' on the 'Classification Summary' page.
No. Users' personalized adjusted results are solely accessible to the account holder and not available to other users. This ensures both data security and preserves the integrity of each user's research.
No. All files uploaded by users along with any resultant output files are retained for a period of 24 hours. Upon expiration of this retention period, these files will be automatically purged from the server.
SVOC's classifications and evidentiary supports are generated entirely through automated pipelines. This automation eliminates human error and bias, ensuring efficient and reproducible classification across the human genome. Therefore, no manual work is required during the annotation process. While fully automated, SVOC also provides reviewable text evidence for each criterion met by any given variant and supports users in interactively refining results using their own internal databases.
SVOC stands out due to its genome-wide coverage powered by expert-defined parameters, enabling systematic classification of both known and novel variants. It outperforms competitors and offers user-friendly features like high-throughput queries and interactive criteria adjustment, bridging the gap between theoretical frameworks and cancer research applications.
In a benchmarking analysis involving 94 variants with expert-curated classifications, SVOC achieved a Matthews Correlation Coefficient (MCC) of 0.70—outperforming ClinVar-germline (0.46), OncoKB (0.38), InterVar (0.34), VIC (0.06), and CancerVar (0.04). For variants reported in OncoKB and ClinVar, SVOC demonstrated high concordance with OncoKB (88.1%) and ClinVar somatic submissions (84.2%), confirming its accuracy for known variants. Additionally, SVOC’s classifications showed strong alignment with results from saturation genome editing (SGE), achieving over 96% consistency for oncogenic calls and 97% for benign calls, underscoring the algorithm’s robustness for both established and novel variants.
SVOC adheres to consensus parameters from prominent expert panels. These include ClinGen/CGC/VICC Guidelines Working Group, ClinGen’s Variant Curation Expert Panels (VCEP), the Somatic Variant Interpretation (SVI) Working Group, and Chinese expert panels.
Currently, users are not able to modify the parameters of SVOC online. Instead, since SVOC is open-source, advanced users with relevant technical capabilities can obtain the source code and deploy the SVOC tool on their own computing resources if they wish to explore custom configurations. Source code avaliable at: https://github.com/leqingsang/SVOC.
No. Currently, SVOC is specifically designed for and can only be used in the human genome. Our development efforts are focused on enhancing its performance and coverage for human somatic variants.
The thresholds, parameters, and functional data utilized in SVOC are tailored specifically for somatic variant classification. As such, SVOC is not designed for germline annotation, and we do not recommend its use for this purpose.
SVOC has not been validated in clinical scenarios. Therefore, we do not recommend using SVOC directly in clinical settings. If users intend to reference SVOC results for clinical decision-making, we highly recommend that they conduct a thorough manual review with the help of clinical and genomic experts.