./dripTrain.py [options] --psm-library <PSM library>
--spectra <spectra file>
DripTrain learns the model parameters for DRIP via the expectation-maximization algorithm utilizing a library of high-confidence PSMs (such as this PSM library and the corresponding set of spectra). The learned parameters may then be used in dripSearch. If you use DRIP trained parameters in your research, please cite:
John T. Halloran, Jeff A. Bilmes, and William S. Noble. "Learning Peptide-Spectrum Alignment Models for Tandem Mass Spectrometry". Thirtieth Conference on Uncertainty in Artificial Intelligence (UAI 2014). AUAI, Quebic City, Quebec Canada, July 2014.
--psm-library <PSM library>– Collection of high-confidence peptide-spectrum matches (PSMs). File must be in tab-delimited format with fields
--spectra <spectra file>– Corresponding ms2 spectra for the PSM library.
The program writes the learned parameters
dripLearned.params by default. The name of the
output file can be set by the user using
--dripTrain-file <string>– Name of GMTK output file. Default =
--output-mean-file <string>– Name of output file for learned Gaussian means. Default =
--output-covar-file <string>– Name of output file for learned Gaussian covariances. Default =
--high-res-ms2 <T|F>– boolean, whether the search is over high-res ms2 (high-high) spectra. When this parameter is true, DRIP uses the real valued masses of candidate peptides as its Gaussian means; for low-res ms2 (low-low or high-low), the observed m/z measurements are much less accurate so these Gaussian means are learned using training data. Default =
--mods-spec <string>– The general form of a modification specification has three components, as exemplified by 1STY+79.966331.
--cterm-peptide-mods-spec <string>– Specify peptide c-terminal modifications. See nterm-peptide-mods-spec for syntax. Default =
--nterm-peptide-mods-spec <string>– Specify peptide n-terminal modifications. Like --mods-spec, this specification has three components, but with a slightly different syntax. The max_per_peptide can be either "1", in which case it defines a variable terminal modification, or missing, in which case the modification is static. The residues field indicates which amino acids are subject to the modification, with the reside X corresponding to any amino acid. Finally, added_mass is defined as before. Default =
output.params, run in the download directory: