{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Intermine-Python: Tutorial 00- Interaction with Registry" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The following functions can be used to obtain some useful information about a mine or an organism. Assuming you are inside intermine-ws-python folder: " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "`getMines(organism)` fetches the mine(s) of a particular organism, say 'D. melanogaster'.\n", "Before running following modules, run pip install intermine in your terminal first." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "FawMine\n", "FlyMine\n", "LocustMine\n", "XenMine\n" ] } ], "source": [ "from intermine import registry\n", "registry.getMines(\"D. melanogaster\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "FawMine
\n", "FlyMine
\n", "LocustMine
\n", "XenMine" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now that we have the mines, we can use `getInfo(mine)` to fetch all the information about a particular mine ie its description, version, organisms associated etc. Suppose we want to know more about flymine, here's what we do: " ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Description: An integrated database for Drosophila genomics\n", "URL: https://www.flymine.org/flymine\n", "API Version: 30\n", "Release Version: 49 2020 February\n", "InterMine Version: 4.1.3\n", "Organisms: \n", "D. melanogaster\n", "Neighbours: \n", "Animals\n" ] } ], "source": [ "registry.getInfo(\"flymine\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Description: An integrated database for Drosophila genomics
\n", "URL: http://www.flymine.org/flymine
\n", "API Version: 30
\n", "Release Version: 49 2020 February
\n", "InterMine Version: 4.1.3
\n", "Organisms:
\n", "D. melanogaster
\n", "Neighbours:
\n", "Animals
\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now that we are very interested after viewing information about 'flymine', `getData(mine)` can be used to extract the data sets corresponding to it, the way is as follows: " ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Name: Affymetrix array: GeneChip Drosophila Genome 2.0 Array\n", "Name: Affymetrix array: GeneChip Drosophila Genome Array\n", "Name: Alleles and phenotypes\n", "Name: Artificial deletions\n", "Name: BDGP cDNA clone data set\n", "Name: BDGP in situ data set\n", "Name: BioGRID interaction data set\n", "Name: Disease Ontology\n", "Name: Disease models\n", "Name: Drosophila species orthologues and paralogues\n", "Name: E-FLYC-6\n", "Name: Fly Anatomy Ontology\n", "Name: Fly Development Ontology\n", "Name: FlyAtlas\n", "Name: FlyBase Contolled Vocabulary\n", "Name: FlyBase Controlled Vocabulary\n", "Name: FlyBase PubMed to gene mapping\n", "Name: FlyBase data set for Drosophila ananassae\n", "Name: FlyBase data set for Drosophila erecta\n", "Name: FlyBase data set for Drosophila grimshawi\n", "Name: FlyBase data set for Drosophila melanogaster\n", "Name: FlyBase data set for Drosophila mojavensis\n", "Name: FlyBase data set for Drosophila persimilis\n", "Name: FlyBase data set for Drosophila pseudoobscura pseudoobscura\n", "Name: FlyBase data set for Drosophila sechellia\n", "Name: FlyBase data set for Drosophila simulans\n", "Name: FlyBase data set for Drosophila virilis\n", "Name: FlyBase data set for Drosophila willistoni\n", "Name: FlyBase data set for Drosophila yakuba\n", "Name: FlyBase fasta data set for Drosophila ananassae\n", "Name: FlyBase fasta data set for Drosophila melanogaster\n", "Name: FlyBase fasta data set for Drosophila pseudoobscura pseudoobscura\n", "Name: FlyBase fasta data set for Drosophila simulans\n", "Name: FlyBase fasta data set for Drosophila virilis\n", "Name: FlyMine intergenic regions\n", "Name: GO\n", "Name: GO Annotation data set\n", "Name: GO Annotation for Drosophila melanogaster\n", "Name: HGNC identifiers\n", "Name: HomoloGene homology predictions\n", "Name: Human gene identifiers\n", "Name: INDAC long oligo data set\n", "Name: IntAct molecular interactions\n", "Name: InterPro data set\n", "Name: InterPro domain GO annotations\n", "Name: KEGG pathways data set\n", "Name: Mouse gene identifiers\n", "Name: Mouse identifiers\n", "Name: NCBI Entrez Gene identifiers\n", "Name: NCBI PubMed to gene mapping\n", "Name: OMIM diseases\n", "Name: PSI Molecular Interactions\n", "Name: Panther orthologue and paralogue predictions\n", "Name: PubMed to gene mapping\n", "Name: REDfly Drosophila transcription factor binding sites\n", "Name: REDfly Drosophila transcriptional cis-regulatory modules\n", "Name: RNAi screen phenotypes\n", "Name: Rat gene identifiers\n", "Name: Reactome pathways data set\n", "Name: Sequence Ontology\n", "Name: Swiss-Prot data set\n", "Name: The Gene Ontology\n", "Name: The Sequence Ontology\n", "Name: The developmental transcriptome of Drosophila melanogaster.\n", "Name: TrEMBL data set\n", "Name: TreeFam data set\n", "Name: UniProt data set\n", "Name: Uniprot data set\n", "Name: WormBase gene identifiers\n", "Name: Wormbase gene identifiers\n", "Name: fly-Fish data set\n", "Name: miRBase Targets\n", "Name: microRNA Targets\n" ] } ], "source": [ "registry.getData(\"flymine\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Name: Affymetrix array: GeneChip Drosophila Genome 2.0 Array
\n", "Name: Affymetrix array: GeneChip Drosophila Genome Array
\n", "Name: Alleles and phenotypes
\n", "Name: Artificial deletions
\n", "Name: BDGP cDNA clone data set
\n", "Name: BDGP in situ data set
\n", "Name: BioGRID interaction data set
\n", "Name: Disease Ontology
\n", "Name: Disease models
\n", "Name: Drosophila species orthologues and paralogues
\n", "Name: E-FLYC-6
\n", "Name: Fly Anatomy Ontology
\n", "Name: Fly Development Ontology
\n", "Name: FlyAtlas
\n", "Name: FlyBase Contolled Vocabulary
\n", "Name: FlyBase Controlled Vocabulary
\n", "Name: FlyBase PubMed to gene mapping
\n", "Name: FlyBase data set for Drosophila ananassae
\n", "Name: FlyBase data set for Drosophila erecta
\n", "Name: FlyBase data set for Drosophila grimshawi
\n", "Name: FlyBase data set for Drosophila melanogaster
\n", "Name: FlyBase data set for Drosophila mojavensis
\n", "Name: FlyBase data set for Drosophila persimilis
\n", "Name: FlyBase data set for Drosophila pseudoobscura pseudoobscura
\n", "Name: FlyBase data set for Drosophila sechellia
\n", "Name: FlyBase data set for Drosophila simulans
\n", "Name: FlyBase data set for Drosophila virilis
\n", "Name: FlyBase data set for Drosophila willistoni
\n", "Name: FlyBase data set for Drosophila yakuba
\n", "Name: FlyBase fasta data set for Drosophila ananassae
\n", "Name: FlyBase fasta data set for Drosophila melanogaster
\n", "Name: FlyBase fasta data set for Drosophila pseudoobscura pseudoobscura
\n", "Name: FlyBase fasta data set for Drosophila simulans
\n", "Name: FlyBase fasta data set for Drosophila virilis
\n", "Name: FlyMine intergenic regions
\n", "Name: GO
\n", "Name: GO Annotation data set
\n", "Name: GO Annotation for Drosophila melanogaster
\n", "Name: HGNC identifiers
\n", "Name: HomoloGene homology predictions
\n", "Name: Human gene identifiers
\n", "Name: INDAC long oligo data set
\n", "Name: IntAct molecular interactions
\n", "Name: InterPro data set
\n", "Name: InterPro domain GO annotations
\n", "Name: KEGG pathways data set
\n", "Name: Mouse gene identifiers
\n", "Name: Mouse identifiers
\n", "Name: NCBI Entrez Gene identifiers
\n", "Name: NCBI PubMed to gene mapping
\n", "Name: OMIM diseases
\n", "Name: PSI Molecular Interactions
\n", "Name: Panther orthologue and paralogue predictions
\n", "Name: PubMed to gene mapping
\n", "Name: REDfly Drosophila transcription factor binding sites
\n", "Name: REDfly Drosophila transcriptional cis-regulatory modules
\n", "Name: RNAi screen phenotypes
\n", "Name: Rat gene identifiers
\n", "Name: Reactome pathways data set
\n", "Name: Sequence Ontology
\n", "Name: Swiss-Prot data set
\n", "Name: The Gene Ontology
\n", "Name: The Sequence Ontology
\n", "Name: The developmental transcriptome of Drosophila melanogaster.
\n", "Name: TrEMBL data set
\n", "Name: TreeFam data set
\n", "Name: UniProt data set
\n", "Name: Uniprot data set
\n", "Name: WormBase gene identifiers
\n", "Name: Wormbase gene identifiers
\n", "Name: fly-Fish data set
\n", "Name: miRBase Targets
\n", "Name: microRNA Targets\n" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.4" }, "nbdime-conflicts": { "local_diff": [ { "key": "language_info", "op": "remove" } ], "remote_diff": [ { "diff": [ { "diff": [ { "key": 0, "op": "addrange", "valuelist": [ "3.8.1-final" ] }, { "key": 0, "length": 1, "op": "removerange" } ], "key": "version", "op": "patch" } ], "key": "language_info", "op": "patch" } ] } }, "nbformat": 4, "nbformat_minor": 2 }