{
"metadata": {
"name": "gen_workflows",
"signature": "sha256:e8c52b5439c9cc261d7c3fd32a97393e0a048826fe5b42d15e0e5d2c364e6790"
},
"nbformat": 3,
"nbformat_minor": 0,
"worksheets": [
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"#General Workflows for Handling Biggish Data"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"##Sequence Data \n",
"\n",
"Files from the core facility are downloaded locally to my computer; \"NGS Drive\". \n",
"\n",
" \n",
"\n",
"\n",
"\n",
"This drive is backuped in multiple places including \n",
"\n",
"a)\n",
"\n",
"\n",
"---\n",
"\n",
"### Primary Analysis\n",
"\n",
" \n",
"\n",
"\n",
"---\n",
"###Documentation via IPython Notebook \n",
"I have been using for a few months and just started hosting on Github . IPython is great for so many reasons. \n",
"\n",
"_demo_ \n",
"\n",
"GitHub is nice given the iterative nature of going back to a notebook, thus having version control? \n",
"\n",
"Here is a recent entry acessible via \n",
"\n",
"\n",
" \n",
" \n",
"A screencast from early on\n",
" \n",
" \n",
" \n",
"---\n",
" \n",
"### Secondary Analysis\n",
"Most of the \"secondary\" analysis (which I consider playing with large text files) I try to do within my IPython Notebook or [SQLShare](https://sqlshare.escience.washington.edu/sqlshare/). I put BLAST in this category and this is accomplished locally on 16 core machine (hummingbird) for big jobs or 4 core machine (greenbird) for light jobs. See example above for how I try to go from IPython to SQLShare. \n",
" \n",
" \n",
"These files are housed on a Synology NAS. My folder is public [here](http://eagle.fish.washington.edu/cnidarian/index.php?dir=&sort=date&order=desc). This is where I write and read working files to and from. \n",
" \n",
"\n",
" \n",
" \n",
"---\n",
"### Pretty Pictures\n",
"After the secondary analysis, pretty pictures need to be made. Some examples of how this might be done...\n",
"\n",
"- [DAVID](http://david.abcc.ncifcrf.gov/) > [Revigo](http://revigo.irb.hr/) \n",
"- GO table from SQLShare > [GO Categorizer](http://www.animalgenome.org/tools/catego/) \n",
"- [Many Eyes](http://www-958.ibm.com/software/analytics/manyeyes/) \n",
"- R \n",
" \n",
" \n",
"---\n",
"### Essentials\n",
"A few things I can't do without (besides whats listed above) \n",
" \n",
"- Clipboard History \n",
"- Skitch \n",
"- SnagPath \n",
" \n",
" "
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"---\n",
"\n",
""
]
},
{
"cell_type": "code",
"collapsed": false,
"input": [],
"language": "python",
"metadata": {},
"outputs": []
}
],
"metadata": {}
}
]
}