# NGI Visualizations
A collection of next-gen sequencing visualisation scripts. Click a script's
name to go to it's subdirectory which will contain a detailed `README.md`
file with examples and instructions.
---
> #### Depreciation notice
> Please Note that many of these scripts have been superseded by
> MultiQC (http://multiqc.info). if you're not familiar with it we
> recommend having a look there first before using the tools below.
---
### Introduction
Most of these scripts are written in Python. Those within the `stand_alone`
are generally run on the command line. The rest can either be run on the command
line or imported as part of the `ngi_visualizations` package. See
[below](#usage) for instructions on how to use the python package.
* [Count Biotypes](ngi_visualizations/count_biotypes/)
* Uses HTSeq to plot read overlaps with different feature biotype flags
* [preseq Complexity Curves](ngi_visualizations/preseq_complexity_curves/)
* [Subsampled Gene Observations](stand_alone/subsampled_gene_observations/)
* Group of scripts to plot the number of observed genes at varying sample
subsampling proportions. Can give an impression of library complexity on
a biological level.
* [Qualimap Plots](ngi_visualizations/qualimap/)
* Scripts to generate Coverage and Insert Size histograms. Both of these
plots are already produced by Qualimap. These just look nicer in our
reports and have some extra plotting options.
* [snpEff Effect Type Plot](ngi_visualizations/snpEff/)
* Script to create bar charts of SNP Effect counts, generated by snpEff.
* [Gene Body Coverage](ngi_visualizations/gene_body_coverage/)
* [Alignment Summaries](stand_alone/alignment_summaries/)
* Two scripts to parse log files containing alignment stats from bowtie,
bowtie 2 or tophat and generate overview HTML reports
* [Bismark Summary Report](stand_alone/bismark_summary/)
* Script to parse lots of bismark output reports and generate a single HTML summary report.
* [Bismark Coverage Analysis](stand_alone/bismark_coverage/)
* [Bismark Coverage Curves](stand_alone/bismark_coverage/#bismark-coverage-curves) - Plots the proportion of cytosines meeting increasing coverage thresholds
* [Bismark Window Sizes](stand_alone/bismark_coverage/#bismark-window-sizes) - Plots the proportion of windows passing observation thresholds with increasing window sizes
## Examples
See below for example outputs. Click an image to go to that script.
## Usage
For using stand alone packages see the `README.md` file in that package's subdirectory.
To use the `ngi_visulaizations` package, download or clone the repository.
Then, to install the package, run:
```
python setup.py install
```
If you intend to make any changes to the package, swap `install` for `develop`,
else you will have to reinstall the package each time you change the source code.
Once installed, you can import the script from the relevant subdirectory. For instance,
to use the [Qualimap Insert Size histogram](ngi_visualizations/qualimap/) you would use:
```python
from ngi_visulaizations.qualimap import insert_size
```
The functions within this script are then available in that namespace. For instance,
you could now generate the histograms by running:
```python
insert_size.plot_insert_size_histogram(input_fn)
```
## Contributing
If you would like to add a visualization script to this repository, please
read the [contributing notes](CONTRIBUTING.md) first. These describe the
steps required in adding your script to the repository.
## Credits
These scripts were written for use at the
[National Genomics Infrastructure](https://portal.scilifelab.se/genomics/)
at [SciLifeLab](http://www.scilifelab.se/) in Stockholm, Sweden.
For more information, please get in touch with
[Phil Ewels](https://github.com/ewels).
