A systematic and thorough method for identifying the significant links in biochemical pathways with high sensitivity.

Data management

FEvER software requires that the input dataset is a list of accessions (only UniProt accessions are supported right now) together with relative abundance ratios and p-values for these ratios, calculated from replicates. The input datafile can be either tab-separated text based or Microsoft Excel files, with or without a header row. In case there is a header row the software will attempt to parse the columns right, however if it fails it will assume that the first three columns of the file are accessions, ratios and p-values, respectively. For frequent users of the software MaxQuant, simply importing the resulting text file into Microsoft Excel should be enough.

For several different system files, FEvER software will use a system directory which will be created the first time the software is run. It is strongly recommended to note where this directory is created and re-use the files there, as this will improve the performance of the software dramatically. To minimize the idle time during web queries, FEvER caches results of previous queries. These are stored in a particular format (*.fsf) within the system directory, and updated when the relevant databases are updated.

Parameters

The first two parameters are used to define the paths to the input file, and where the output should be saved, respectively. The next two parameters, SIG and REG, are used to define threshold values for what constitutes an interesting, or significantly regulated, protein in the in the experimental data. The last parameter in this portion of the parameter file is where the accession types for proteins in the input file is given. Currently only UniProt and IPI accessions are supported by the software, however future releases will add further support for different databases.

INPUT - path to the input file
HEADERROW - flag whether or not the input file has a header row
SEPCHAR - the character used to separate multiple identities from one another
OUTPUT - path to an output folder (optional)
SYSDIR - path to the FEvER system folder
ID - the type of accessions in the datafile (UniProt is expected)
SIG - ROI threshold value for p-values (default value = 0.05)
REG - ROI threshold for absolute fold change (default value = 2)
ALPHA1 - alpha1 coefficient (default value = 0.75)
ALPHA2 - alpha2 coefficient (default value = 0.15)
ALPHA3 - alpha3 coefficient (default value = 0.10)
K1 - k1 variable (default value = 0.75)
K2 - k2 variable (default value = 8.0)
SORT - Sorting method for the non-parametric model (Def: fold change)
RAND - Sampling method for the parametric model (Def: empirical dist.)

The two parameters, SIG and REG, are used to define threshold values for defining the borders of the region of interest (ROI) for proteins in the in the experimental data. Proteins that end up in the ROI have much more contribution towards the enrichment score of a pathway.

ALPHA coefficients are used for weighing of three functions in a linear combination, these functions are explained in detail in the supplementary material to the original article. It is important to note that ALPHA coefficients have to sum up to 1.0, in order to normalize the total effect of the weights, hence valid values for these coefficients are decimal values in the range of (0,1).

The last two parameters K1 and K2 are used for fine-tune non-linearity of the first and third function, respectively. Note that the non-linearity property is optional, both values can be set to 1.0 to eliminate non-linearity otherwise positive rational values are expected for both values, given in decimal notation.