SeedScan - Diagnostic plots

Dignostic plots allow to estimate consistence of an analysis.
Already after a few million sequences (i.e. a few minutes of analysis) the plots allow to recognize unexpected count distribution between barcodes/pairs/identified seeds.

Such problems are in most cases due to:


From SeedScan's Main menu | View select:











Overall counts

While TagScan is running and evaluation of a pair of NGS-sequnece files is performed, overall count numbers are graphically displayed and contiuously updated:


The bar chart visualizes:
The number on top of the bars indicate:


The example shows a preliminary counting snapshot after ~9 Mega sequences were analyzed (~2:30 min execution time.).
This snapshot indicates a particular bad analysis result, which is due to several problems/experimental settings:











Barcodes / pairs

From SeedScan's main menu select:
Main menu | View | BC_Count sums



The graph indicates the count numbers for each

The graph indicates:


Change the scaling to logarithmic for the y-axes (counts).
Click the Tool button to open the tool box and check the Y-log field:



Now yon can see, that the Barcodes BC11..BC14 and BC28,BC29 have small numbers of counts (few hundred).
Probably contaminations or matches to the random sequences.











Pairs / Seeds














Barcodes pairings

In a perfect run, you only would see barcode pairs as defined.
In the example pairs like BC1<=>BC15, BC2<=>BC16, ...
There should not be any other combinations.
SeedScan counts all detected barcode pairs for the first barcode from pairs with all others.



Each line represents the first barcode from a defined pair, showing the counts for all possible pairs.

Click the 3D to show a a 3-d projection of the data lines:



If you change to log-scale on y-axis (click Tool button, then check y-log, you may much better recognize low abandant "not allowed" pairs:















Seed shift

This graph visualizes abundance of identified seeds shifted against their expected position within the sequenced constructs

In the example a seed shift +/- 3bp was enabled.



The graph indicates:












Sample / Seed profiles

The graph shows the abundance of all seeds (x-axis) in the analyzed barcode-pairs (samples).



The example shows:












Sample reads

For deeper problem analysis, SeedScan stores a small set of reads (first 1000) as plain text file.
These files are generated in the location as specified for the Result file.
The differerent file names are composed of the base file name - as defined by the user - and an extension specific for the data type: