WeSA Help

Help and documentation

What is WeSA?

WeSA, (weighted socioaffinity), is a statistical score which can rank results from protein experiments of the bait-prey type, e.g. affinity purification, immunoprecipitation or proximity labelling.

You can enter lists of interacting proteins according to the results of your experiment. Input your raw data before any filtering is done.

As a result you will get a re-ordered list of scores which allows one to understand which interactions are more likely than others based on what previous experiments can add to your data.

Input

WeSA calculations require protein pair data to compute the scores of interactions. There are two possibilities: calculating the score using user input data or displaying the scores as they are computed from databases. Users can provide their input in different ways but always using gene names.

You can input a list of proteins (one protein per line) and it will look for all interactions involving any protein on the list and display the scores as calculated from the data in the selected database (IntAct, BioGRID, BioPlex or any combination of the three).
If you want to re-calculate the scores while including your own experimental results, then enter a list of protein pairs. In case you are adding results from a single experiment, you can add a simple protein pair list (two proteins in the same line space-separated). Alternatively, for results from multiple experiments, each line should contain a protein pair and the experiment ID in which it was identified.
In all cases you should select a database, the data in which most closely resembles your experimental design. IntAct is selected as default as it has more accurate records than the other two, but it is missing a lot of information which can be found elsewhere.

Currently, it supports human proteins only.

Output

The output is a table of protein pairs together with their calculated WeSA scores. The table is interactive and the user is encouraged to manually explore their results using the search and sort buttons.

The resulting table can be exported in several file formats (SVG, Excel, PDF).

In addition to the overall WeSA score, the output table contains the observed numbers for each protein pair, meaning, the total count of experiments in the data, in which the pair was found (separately for the three cases: 1. spoke A - when the first protein was bait in the experiment and retrieved the other protein, 2. spoke B - when the second protein was bait and retrieved the first, 3. matrix - when the two proteins were captured in an experiment involving another protein distinct from the two proteins of interest.

Submission options explained through examples

Option 1: Query the existing data

Input: Single-column list
You can input a list of proteins (one protein per line) and the webtool will output all interactions involving those proteins and their pre-computed WeSA scores.

Sample input:
IFT20
BBS4
LZTFL1

If you do not select a dataset from the dropdown menu, the default is to display interactions discovered and recorded within IntAct (and WeSA scores calculated from IntAct data).

Sample output

Protein A	Protein B	WeSA	SA	Count: A retrieved B	Count: B retrieved A	Count: matrix
BBS4	BBS1	71.6	16.9	5	1	12
IFT172	IFT20	65.8	11.5	3	0	14
BBS4	BBS2	63.6	16	3	2	13
LZTFL1	BBS9	46.2	0	4	2	0
IFT88	IFT20	58.8	10.9	5	0	9
...

Option 2: Re-calculate the scores to rank the results from your own single experiment

If you performed a single experiment (for more than one experiment refer to option 3) and are wondering how to filter the list of your results, you can use this tool to re-calculate the WeSA scores to rank your results. To do this, enter a list of protein pairs (two proteins in the same line space-separated).

Input: Two-column space-separated list
The first column in every line should be the tagged bait protein and that should be separated via space from the second protein. In this case, the second column contains all proteins which were identified in the precipitate.

Sample input:
IFT20 IFT72
IFT20 IFT81
IFT20 IFT74

If you do not select a dataset from the dropdown menu, the default is to use interactions discovered and recorded within IntAct (and WeSA scores calculated from IntAct data enriched with the submitted data).

Sample output

Protein A	Protein B	WeSA	SA	Count: A retrieved B
IFT20	IFT72	10.6	10.6	1
IFT20	IFT74	8.5	8.5	1
IFT20	IFT81	6.9	6.9	1

Important note: The results only show the scores of the inputted pairs as long as evidence supporting the respective interaction exists in the chosen background dataset used for computations.

Option 3: Re-calculate the scores to rank the results from your own experiments

If you performed multiple experiments and are wondering how to filter the list of your results, you can also use this tool to re-calculate the WeSA scores and rank your results. To do this, enter a list of protein pairs alongside unique experiment identifiers.

Input: Three-column space-separated list
The first column in every line should be the tagged bait protein and that should be separated via space from the second protein. In this case, the second column contains proteins which were identified in the precipitate of the respective bait. The third entry in every line should be the unique experiment identifier. This identifier can be anything as long as it is unique for every separate experiment.

Sample input:
IFT20 IFT80 E1
IFT20 IFT81 E1
IFT20 IFT74 E2
IFT20 IFT80 E2
IFT80 IFT81 E3
IFT80 IFT172 E3
IFT80 IFT57 E3
IFT80 IFT46 E3

If you do not select a dataset from the dropdown menu, the default is to use interactions discovered and recorded within IntAct (and WeSA scores calculated from IntAct data enriched with the submitted data).

Sample output

Protein A	Protein B	WeSA	SA	Count: A retrieved B	Count: B retrieved A	Count: matrix
IFT80	IFT20	64	18.8	2	1	14
IFT20	IFT80	8.1	8.1	1	0	0
IFT80	IFT172	8	8	1	0	0
IFT80	IFT46	7.9	7.9	1	0	0
IFT80	IFT57	7.9	7.9	1	0	0
IFT20	IFT81	7.6	7.6	1	0	0
IFT20	IFT74	7.4	7.4	1	0	0
IFT80	IFT81	7.3	7.3	1	0	0

Important note: The results only show the scores of the inputted pairs as long as evidence supporting the respective interaction exists in the chosen background dataset used for computations.

Output table columns

Protein A, Protein B: Two proteins which have some evidence supporting their interaction;

WeSA: WeSA score for the respective protein pair, computed from the selected dataset: IntAct, BioGRID, BioPlex or a combination. Higher WeSA scores correspond to interactions which are better supported by the evidence. The visualisation distinguishes red and green interactions: the red interactions have scores below an optimal threshold which is specific, unique and pre-computed for each dataset based on the optimal balance between true positives and false positives; the green edges correspond to WeSA scores above the optimal threshold.;

SA: Socioaffinity score for the respective protein pair;

Count: A retrieved B: Count of recorded experiments in which protein A was the tagged bait protein and protein B was retrieved;

Count: B retrieved A: (Reverse situation in comparison to previous) Count of recorded experiments in which protein B was the tagged bait protein and protein A was retrieved;

Count: matrix: Count of recorded experiments in which some protein X was the tagged bait protein and proteins A and B were retrieved simultaneously (in the same experiment).