COLOM

"G-banding is technique used in cytogenetics to produce differently stained regions on condensed chromosomes. The metaphase chromosomes are treated with trypsin (to partially digest the protein) and stained with Giemsa. Dark bands that take up the stain are strongly A,T rich (gene poor). (...) Banding can be used to identity chromosomal abnormalies, such as translocations, because there is a unique pattern of light and dark bands for each chromosome."
(from Wikipedia)

Biological role:
Ideogram

List of Chromosomal bands.
COLOM expects a tab-delimited ideogram data file containing:

one chromosomal band per line
1. column: Chromosome number (e.g. 1..22,x,y)
2. column: arm (e.g. p,q)
3. column: band (ter,cen,1,11,11.1,11.1a, ...)

Such a file may be downloaded from NCBI's FTP site: e.g. ideogram.gz (right mouse-click, Save as):

#chromosome	arm	band	iscn_start	iscn_stop	bp_start	bp_stop	stain	density	bases
1	p	ter	0	1	0	0			1
1	p		0	7335	1	124300000			124300000
1	p	3	0	4852	1	84700000			84700000
1	p	36	0	1521	1	27800000			27800000

Gene index:
Genes-to-ideogram

Association: which gene is located on a certain chromosomal band.

COLOM expects a tab delimited data file containing

one gene per line
one column containing chromosomal band where gene is located
one column containing gene name
one column containing gene description

Such a file may be downloaded from NCBI's FTP site E.g. cyto_gene.md.gz for homo sapiens. Data files from other organisms may be downloaded from respective directories.

#tax_id	chromosome	iscn_start	iscn_end	orientation	featureName	featureId	featureType	printLocation	units
9606	1	1839	1895	na	A3GALT2	GeneID:127550	GENE	1p35.1a	BandsAsInt
9606	1	660	710	na	AADACL3	GeneID:126767	GENE	1p36.21d	BandsAsInt
9606	1	660	710	na	AADACL4	GeneID:343066	GENE	1p36.21d	BandsAsInt
9606	1	5682	5741	na	ABCA4	GeneID:24	GENE	1p22.1a	BandsAsInt
9606	1	13847	13906	na	ABCB10	GeneID:23456	GENE	1q42.13e	BandsAsInt
9606	1	5741	5795	na	ABCD3	GeneID:5825	GENE	1p21.3d	BandsAsInt

Theme		Group
		in	out
	in	8	2
	out	4	26

a	b	a+b
c	d	c+d
a+c	c+d

1. List of biological roles	GO-Terms Chromosomal bands Pathways ...
2. Gene index	Maps genes to the above biological roles
3. Reference gene list	All genes on your microarray (filter,...)
4. Selection gene list	Result gene list of your statistical analysis

Up-regulated geneset:	Down-regulated geneset:
Geneset with Gaussian regulation distribution:	Geneset with (artificial) random disribution:

a) Loaded SUMO analysis	Most easy way, in case you opend SUMO analysis contains the full list of genes analyzed on the Mircor-array/filter/... . This option shold give the most accurate p-values.
b) External data file	Load the complete list of anaylzed genes from an external data file. Typically this could be the array's ADF file, or an expression matrix, ... This shold give correct p-values, too
c) Genes mapped to biological theme	Simply take all genes which are assigned to the loaded bological themes. It may happen, that not all genes associated to the biological themes are found on the microarray. Thus computed p-values might be slightly incorrect. But with genome wide analyses this should not generate massively wrong data.

CO-LOcalisation Mapper (COLOM )

>Basic Idea and mathematical concepts

Over-representation of significant genes - Hypergeometric distribution

Some mathematical concepts

Gene Set Enrichement Analysis (GSEA) - Kolmogorov-Smirnov statistics

Simple Enrichement Analysis (SEA) - Chi² statistics

Load required data files

Data View

Filter

Search

Copy

Paste

Preferences

Data files and file formats

GO-Terms

Chromosomal bands

KEGG metabolic pathways

miRNA

TransFac

MSig-DB

Wiki pathways

Reactome

Reference gene lists

Selection gene lists

Left panel	Brows able hierarchical tree of biological themes
Right panel	Sort able table of populated biological themes
Bottom panel	Info window. Content depends an what was selected in above panels

Column	Content
1	Name of biological role / gene	Icons indicate: N: this is a node=biological theme, Click +/- sign to expand collapse this node G: this is a gene Text colours indicate (Default colours) Gray: no gene in this biological theme or this gene was not found in reference gene list Black: Reference genes for this biological theme found Blue: Selection genes found for this node, no nett regulation Red: Selection genes found for this node, netto up-regulated Green: Selection genes found for this node, netto down regulated
2	Number of genes in the biological theme role in reference gene list (e.g. ADF file) or Number of replicates for this gene found in reference list
3	Number of genes in the biological role found in selection list or Number of replicates for this gene found in selection list
4	Number of up-regulated genes in the biological role found in selection list or Number of up-regulated replicates for this gene found in selection list
5	Number of down-regulated genes in the biological role found in selection list or Number of down-regulated replicates for this gene found in selection list
6	Net regulation of biological role / gene	Arithmetic mean from regulation of all genes populating the respective Biological role / gene
7	Significance indicator	A coloured disk indicates: this biological role is significantly over/under populated (set critical p-value on preferences tab-sheet). Colours indicate: Blue: whole selection Red: up-regulated Green: Down-regulated Yellow: both Up-/Down-regulated
8	Number of expected gene for biological role
9	p-value: significance of populated for the biological role with all selection genes
10	p-value: significance of populated for the biological role with all up-regulated genes
11	p-value: significance of populated for the biological role with all down-regulated genes

Hide nodes without genes	Remove all biological-role nodes where no index genes were found.
Hide nodes without reference genes	Remove all biological-role nodes where no genes from reference list were found. Also remove all index genes from tree which were not found in reference-list.
Hide nodes without selection genes	Remove all biological-role nodes where no genes from selection list were found. Also remove all genes from tree which were not found in selection-list.
Hide Genes	Remove all genes from tree. Useful when to browse terms only
Show all	Remove all filters => show all nodes.

Filter Min
Filter Max	Define maximum value for filtering, i.e. all items with a value larger Max are removed
Filter All	Disable filtering for this column
Remove all filter	Disable filter for all columns

Suppress underrepresented values	There may be various reasons why too few genes may have passed your statistical test or clustering. Therefore you might see too many under-populated nodes. Click this field to mask underrepresented nodes, i.e. those nodes where number of mapped genes is lower compared to expected number. Or more easily: only show over-populated nodes.
Alpha	Critical p-value to mark nodes. Only those nodes with p-values smaller Alpha are marked with coloured buttons.
p-Computation
Biological role ID (here: GO-ID)	Display additionally systematic Node ID in tree and table (e.g. "GO:0044464 - cell part")
View biological role description	Display additionally Node's description in tree and table (e.g. "cell part - "Any constituent part of a cell, the basic structural and functional unit of all organisms." [GOC:jl]")
View gene counts	Display columns containing gene counts in tree and table panel
View p-values	Display columns containing p-values in tree and table panel
Exponential format	View p-values in exponential format: Instead of 0.00341 => 3.41E-003. Most interesting for p-values is the order of magnitude (E-003). Everything below E-002 is interesting. As smaller as better (E-006,E-007,...)

Data files	GO-Term column	Gene-Name column	GO/Gene-Description column	Gene aliases (Alternate gene names)	Alias divider
Biological roles: GO-Tree gene_ontology.obo
Gene index => GO-terms ENSEMBLE	4	3	2
NCBI homo sapiens mus musculus rattus norwegicus	18	3	9	5	\| ("Pipe", ASCII=124)

	MicroRNA	miR	GeneID	Gene
	MI0000266	hsa-miR-10a	23054	NCOA6
	MI0000102	hsa-miR-100	2475	FRAP1
	MI0000109	hsa-miR-103	2746	GLUD1
	MI0000109	hsa-miR-103	9493	KIF23
....

File	Number of algorithms	Number of predicitions	Number of miRNAs	Number of genes
MirToGene_LE1_5641K_652miR_16359G.txt	1 or more	5640798	652	16359
MirToGene_LE1_5641K_652miR_16359G.txt	2 or more	3553248	651	16243
MirToGene_LE3_2087K_619miR_16109G.txt	3 or more	2086987	619	16109
MirToGene_LE4_1259k_568miR_15826G.txt	4 or more	1259129	568	15826
MirToGene_LE5_715K_547miR_14866G.txt	5 or more	715864	547	14866
MirToGene_LE6_171K_489miR_11745G.txt	6 or more	171095	489	11745
MirToGene_LE7_29K_343miR_5563G.txt	7 or more	29249	343	5563
MirToGene_LE8_8K_181miR_2090G.txt	8 or more	8125	181	2050
MirToGene_LE9_2k_104miR_652G.txt	9 or more	2043	104	652
MirToGene_LE10_0,2K_56miR_156G.txt	10	239	56	105

TRANSITION_METAL_ION_TRANSMEMBRANE_TRANSPORTER_ACTIVITY	http://www.broadinstitute.org/gsea/msigdb/cards/TRANSITION_METAL_ION_TRANSMEMBRANE_TRANSPORTER_ACTIVITY	SLC11A2	FXN	SLC30A4	SLC30A5	CCS	SLC30A3	SLC31A2	SLC31A1	...
CYCLASE_ACTIVITY	http://www.broadinstitute.org/gsea/msigdb/cards/CYCLASE_ACTIVITY	GUCY2F	ADCY9	ADCY7	ADCY8	RTCD1	GUCY1A2	GUCY1A3	GUCY1B3	...
LOW_DENSITY_LIPOPROTEIN_BINDING	http://www.broadinstitute.org/gsea/msigdb/cards/LOW_DENSITY_LIPOPROTEIN_BINDING	APOA4	CDH13	LDLR	ANKRA2	STAB1	CXCL16	SORL1	LRP6	...
MAP_KINASE_KINASE_KINASE_ACTIVITY	http://www.broadinstitute.org/gsea/msigdb/cards/MAP_KINASE_KINASE_KINASE_ACTIVITY	MAP3K7	MAP3K6	MAP3K5	MAP3K4	ZAK	MAP3K3	MAP3K9	MAP3K10	...
...