DataGrid information in QuiXoT

From PROTEOMICA
Jump to: navigation, search

The DataGrid of QuiXoT contains different columns, according to the quantification strategy used, along with some user-defined options. Advanced users can modify the .xsd files in the configuration folder to change these columns according to their needs.

Here is a list of the standard columns used in most quantification methods:

FileName: MSF or SRF file containing SEQUEST identifications
RAWFileName: RAW file containing the mass spectra
Index: Identification number of the scan in increasing order of FDR (error rate of peptide identification)
spectrumIndex: Identification number of a spectrum to find its data in the binStack folder
FirstScan/LastScan: Number of the first and last scans in each raw file where the peptide was identified
Charge: Peptide’s ionization charge
FDR: False Discovery Rate with which a peptide was identified in the database from its MS/MS spectrum
FASTAshort: Truncated form of FASTAProteinDescription
FASTAProteinDescription: Description of the protein in FASTA database
Sequence: Amino acid sequence of the peptide (including all kind of modifications)
EqSequence: Amino acid sequence of the peptide taking into account that the different labeled forms belongs to the same peptide
pI: Theoretical Isoelectric Point of the peptide
PrecursorMass: Experimentally-determined molecular mass of the peptide (non-ionized form)
q_peptide_Mass: Theoretical molecular mass of the non-labeled form of the peptide (this is calculated during the quantification)
XC1D: Sequest Xcorr score of the best match
XC2D: Sequest Xcorr score of the second best match
deltaCn: Normalized difference between the first and second best scores
Sp: Sequest Sp parameter
SpRank: Sequest Sp-based ranking
Proteinswithpeptide: Number of additional proteins containing the peptide (and shown within the redundance table)
q_A: Amount of peptide in sample A
q_SD_A: Standard deviation of q_A
q_B: Amount of peptide in sample B
q_SD_B: Standard deviation of q_B
q_log2Ratio: Base 2 logarithm of A/B ratio
q_f: Labeling efficiency (18O labeling only)
q_SD_f: Standard deviation of f
q_Calibration: Mass shift (allows user to manually compensate for a systematic error in mass calibration)
q_DeltaMZ: Difference between theoretical and fitted m/z
q_Alpha: Leptokurtosis’ coefficient for peak fitting
q_SD_Alpha: Standard deviation of q_Alpha
q_Sigma: Peak half-width at 50% intensity
q_SD_Sigma: Standard deviation of q_Sigma
q_DeltaR: Mass increase introduced by labeling
q_Background: Fitted background signal
q_SQWindowsLeft: Mean squared deviation between experimental and theoretical spectra in a window at the left of the isotopic cluster
q_SQWindowsRight: Mean squared deviation between experimental and theoretical spectra in a window at the right of the isotopic cluster
q_SQPeptide: Mean squared deviation between experimental and theoretical spectra in a window spanning the isotopic cluster
q_windowRight: ??
q_SQTotal: Total Squared deviation between experimental and theoretical spectra
numLabel1: Allows the user to manually reject the spectra for statistics (1: used in statistics / 0: not used in statistics)
protLabel: Allows the user to consider the peptide as belonging to a different protein when performing the statistics
peptLabel: Allows the user to consider the scan as belonging to a different peptide when performing the statistics
DoubleFree1/2/3: Allows the user to introduce free text, numerical labels or to store the result of mathematical operations
Label4: Free field
Label5: Contains information about the reliability of Newton-Gauss fit method during the quantification process (Pedro: explicar las opciones)
dp_deployment:
st_Meth: Variable which informs whether a peptide contains methionines (1), oxidized methionines (2) or no methionines (0)
st_Cterm: Variable which informs if the last the peptide is tryptic (st_Cterm = 0) or not (st_Cterm = 1). Note that although originally this variable was intended to inform about C-terminus peptides, this is not always true (as C-terminus peptides ending with lysine or arginine will have st_Cterm = 0). However the name and the current definition is kept for practical reasons.
st_PartialDig: Variable which informs if the peptide has been partially digested (0: no partial digestion / 1: partially-digested peptide without a subpeptide identified / 2: subpeptide from a partially-digested peptide/ 3: partially-digested peptide with subpeptide identified)
st_excluded: Indicates that the scan has been excluded from statistics
Vs: Fitting weight of the scan
Xs: Base 2 logarithm of A/B ratio (corrected for 18O labeling efficiency)
SD_Xs: Standard Deviation of Xs
Xs_NoCorrf: Base 2 logarithm of A/B ratio (not corrected for 18O labeling efficiency)
Ws: Statistical weight of the scan
Zs: Standard normal value at scan level (Xs-Xp normalized to have a SD of one)
FDRs: False discovery rate at the scan level: proportion of scans expected to deviate by chance alone from peptide mean within the population of observed scan outliers (used to detect scan outliers)
Xp: Averaged log2ratio of the peptide
SD_Xp: Standard Deviation of Xp
Wp: Statistical weight of the peptide
Zp: Standard normal value at peptide level (Xp-Xq normalized to have a SD of one)
FDRp: False discovery rate at peptide level: proportion of peptides expected to deviate by chance alone from protein mean within the population of observed peptide outliers (used to detect peptide outliers)
Xq: Averaged log2ratio of the protein
SD_Xq: Standard Deviation of Xq
Wq: Statistical weight of the protein
Zq: Standard normal value at protein level (Xq-X normalized to have a SD of one)
Pq: Probability that a protein is outlier with respect to the corresponding average
FDRq: False discovery rate at the protein level: proportion of proteins expected to deviate by chance alone from the grand protein mean within the population of observed protein outliers (used to detect significant protein expression changes)
q_index: Identification number of each quantified protein
p_index: Identification number of each quantified peptide within its corresponding protein
s_index: Identification number of each quantified scan within its corresponding peptide
scan_per_peptide: Number of scans with which the peptide was quantified
pep_per_prot: Number of peptides with which the protein was quantified