Searches closest gene to each feature and returns associated distance and inclusion informations
From a BAM file, creates a track file of the read count/density along the whole genome, in the chosen format.
Read counts are divided by 10^-7 times the normalization factor (which is total number of reads by default). Positive and negative strand densities are generated and optionally merged (averaged) if a shift value >=0 is given. The read extension is the number of basepairs a read will cover, starting from its most 5’ position (e.g. with a read extension of 1, only the starting position of each alignment will be considered, default is read length).
Gets the score associated to each genomic feature in each sample and runs DESeq for differential analysis within them. It returns a tab-delimited file with the following fields:
Name, MeanA, MeanB, fold change, adjusted p-value.
The input can be of two different types:
Run a domainogram analysis (ref article) on a set of fragments scores (e.g., such as the one obtained with the 4C-seq pipeline)
Converts a file to another equivalent format (examples: wig to bedgraph, gff to bed). Recognised input formats are bed, bedGraph, bedgraph, bigWig, bigwig, bw, db, fps, gff, gtf, sam, sga, sql, text, txt, wig.
Calculates diverse statistics from a track file, such as a distribution of scores and feature lengths, and prints them to the output file.
Generates a whole genome overview of several signal and/or feature tracks.
Returns the elements that are common to a set of text files, for instance the list of genes common to several lists of genes or annotation files.
In the case when more that two files are given, all possible combinations of intersections are performed (2-by-2, 3-by-3, etc.), in the manner of a Venn diagram. If the elements to intersect are not in the first column, one can specify the column to consider by its index (first column is 1).
Since the number of comparisons is approximately 2^(number of files), it is unadvised to compare more that a dozen of files (10 input files -> 2^10-11=1013 comparisons).
The output is a compressed folder containing a summary file and a sub-folder with all the possible intersections, i.e. for each intersection one text file with the list of common elements.
Create a fully annotated track file from a features type or a subset of Ensembl IDs.
Either upload a raw text file with one Ensembl ID on each line, or choose a feature type to fetch them all.
Creates an MA-plot to compare levels of expression of genomic features across two samples.
The input can be of two different types:
Shift and average scores from forward and reverse strand densities.
Typically built to merge ChIP-seq signals from both DNA strands, it can also be used to add (average) several numeric genomic tracks, replicates for instance.
The output is the average of all the input signals, position by position.
Search over-represented motifs in a set of a genomic regions using MEME
Normalize the columns of a tab-delimited file using a specified method and returns a normalized tab-delimited file.
Apply a numeric transformation to the track scores - such as logarithm or square root.
Returns only the regions of the first input file that overlap (or contain) some feature from the second (‘filter’).
Computes statistics and genome-wide distribution of fragment sizes from mapped paired-end reads.
Quantify signal tracks on a set of regions.
Given a set of signal tracks, and a bed-like track containing intervals (e.g. genes), builds a table of the score of each signal in each of the intervals. That is, each cell of the output table is the score given by one of the tracks to a specific interval.
Scores can be the sum/mean/median/min/max of the tag count in the interval.
Divides the scores of the first track by the scores of the second, and returns a single track with the ratios as new scores.
Applies a moving average transformation to smooth the signal of a quantitative track. <br /><br />
Generates signal tracks from a tab-delimited table.
Makes a GO analysis on a list of Ensembl IDs.
Given a file with one Ensembl ID on each line, it returns a summary table (.txt) and GO networks in a pdf.
The first regroups the most significant terms concerning Biological Processes (BP), Cellular Components (CC) and Molecular Function (MF). One can choose the maximum number of each of these terms to include in the output, with a threshold on the p-value.
For input as a set of tracks, creates a Venn diagram of the proportions of total coverage/total score attributed to each track.
If the parameter ‘type’ has the value ‘intervals’, the diagram will show the percent of the genome covered by each possible combination of the input tracks. For instance, If tracks A and B are given, it will show the portions covered by A only, B only, or A and B.
If it has the value ‘score’, the diagram will show the percent of the total score due to each combination of the input tracks, as above.
The output includes the figure of the Venn diagram and a text summary of the different statistics. If more than 4 samples are given, no graph is produced, but the text summary still contains all the information.
For input as a table of numeric value, logical rules will be applied to selected columns, and the Venn diagram will be based on the number of rows passing the rulein each combination of columns. Rules (possibly empty) must be specified in the same order as column numbers. ‘