Resources - MSCypher

Resources - MSCypher



MSCypher is a command-line workflow currently running on Microsoft Windows OS platforms. The input to MSCypher are MGF files which can be generated by Maxquant and the APLToMGFConverter program.

MSCypher’s novel contribution to the field is better support for identifying chimeric (co-fragmented) peptide features which are common in complex datasets.

The workflow currently supports the following steps:

  1. Customised sequence database creation using the MSPnr100 sequence database as input. Any FASTA sequence database can be created by specifying multiple taxonomic identifiers by species. These sequence databases are written to disk for future use.
  2. DeNovo peptide sequencing using the PepNovo program. All denovo sequences are written into the combined MSCypher table for comparison with database searching output.
  3. Database searching using the Digger search algorithm which is fast and sensitive.
  4. Retention time prediction using the program Elude.
  5. Machine learning to classify peptide identifications. Current tools are RandomForest, Percolator and Newbee (a newly developed tool in our laboratory). The RandomForest classifier is the default tool in this category.
  6. Protein Inference at 1%FDR.
  7. Quantitative proteomics at the peptide and protein level supporting spectral counting via NSAF and intensity based quant and statistics using MSstats and Limma.
  8. Visualisation and support for other tools such as Protter.

Download MSCypher and associated example data. Please e-mail Eugene Kapp for assistance.