The pipeline contains a database for proteome searches, a SAAV quality checker, a query builder based on clinical, biological, and pharmacological information, and a retrieving and interpreting system. The platform consists of four analysis modules provided by the SAAVpedia web, stand-alone, and REST applications.
SAAVidentifier module automatically performs simultaneous detection, quality control, and assessment of massive amounts of SAAVs from peptide sequencing data. The results are provided by the SAAV-calling format (SCF) that we designed to facilitate information transmission, by which the text file format contains meta-information including the amino acid sequence and protein and genomic position of each SAAV.
SAAVannotator module performs pharmacological, clinical, and biological annotation based on the genomic and proteomic location of identified SAAVs, e.g., annotating phenotype-variant relationships, post-translational modifications (PTMs), and various kinds of variant or gene database identifiers (IDs) such as dbSNP and Ensembl.
SNVretriever module searches for SAAVs that are in the same location as input nsSNV genomic variants and provides the variant list along with diverse information including diseases and drugs. Otherwise, the SAAVretriever module enables querying and navigating SAAVs that were identified from different proteomic data of various phenotype conditions.
SAAVvisualizer module provides statistical summary plots based on the results of the SAAVannotator and SNV/SAAVretriever modules.