{ "metadata": { "name": "gen_workflows", "signature": "sha256:e8c52b5439c9cc261d7c3fd32a97393e0a048826fe5b42d15e0e5d2c364e6790" }, "nbformat": 3, "nbformat_minor": 0, "worksheets": [ { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "#General Workflows for Handling Biggish Data" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "##Sequence Data \n", "\n", "Files from the core facility are downloaded locally to my computer; \"NGS Drive\". \n", "\n", "\"NGS_Raw_Data_-_837.8_GB_17A0A4B5.png\"/ \n", "\n", "\n", "\n", "This drive is backuped in multiple places including \n", "\n", "a)\n", "\n", "\n", "---\n", "\n", "### Primary Analysis\n", "\n", "\"SimpleMind_Free_17A0A687.png\"/ \n", "\n", "\n", "---\n", "###Documentation via IPython Notebook \n", "I have been using for a few months and just started hosting on Github . IPython is great for so many reasons. \n", "\n", "_demo_ \n", "\n", "GitHub is nice given the iterative nature of going back to a notebook, thus having version control? \n", "\n", "Here is a recent entry acessible via \n", "\n", "\n", " \n", " \n", "A screencast from early on\n", " \n", " \n", " \n", "---\n", " \n", "### Secondary Analysis\n", "Most of the \"secondary\" analysis (which I consider playing with large text files) I try to do within my IPython Notebook or [SQLShare](https://sqlshare.escience.washington.edu/sqlshare/). I put BLAST in this category and this is accomplished locally on 16 core machine (hummingbird) for big jobs or 4 core machine (greenbird) for light jobs. See example above for how I try to go from IPython to SQLShare. \n", " \n", " \n", "These files are housed on a Synology NAS. My folder is public [here](http://eagle.fish.washington.edu/cnidarian/index.php?dir=&sort=date&order=desc). This is where I write and read working files to and from. \n", " \n", "\"the_eagle_17A0B185.png\"/\n", " \n", " \n", "---\n", "### Pretty Pictures\n", "After the secondary analysis, pretty pictures need to be made. Some examples of how this might be done...\n", "\n", "- [DAVID](http://david.abcc.ncifcrf.gov/) > [Revigo](http://revigo.irb.hr/) \n", "- GO table from SQLShare > [GO Categorizer](http://www.animalgenome.org/tools/catego/) \n", "- [Many Eyes](http://www-958.ibm.com/software/analytics/manyeyes/) \n", "- R \n", " \n", " \n", "---\n", "### Essentials\n", "A few things I can't do without (besides whats listed above) \n", " \n", "- Clipboard History \n", "- Skitch \n", "- SnagPath \n", " \n", " " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "---\n", "\n", "" ] }, { "cell_type": "code", "collapsed": false, "input": [], "language": "python", "metadata": {}, "outputs": [] } ], "metadata": {} } ] }