{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Semalytics walkthrough\n", "\n", "Welcome to the Semalytics demo!\n", "\n", "\n", "\n", "## Introduction\n", "\n", "This is an extended computational narrative focusing on the platform **Semalytics**, a semantic-based tool for analyzing hierarchical data in translational cancer research. This demo is bundled with the paper:\n", "\n", "> _**Semalytics: a semantic analytics platform for the exploration of distributed and heterogenous cancer data (in translational research)**_\n", "\n", "Biological annotations are modeled in a new **Semantic Web** fashion and are connected to **Wikidata** for knowledge expansion. Please, note that Semalytics explores annotations that are highly scattered along hierarchical data.\n", "\n", "In this notebook, we are going to use Semalytics for analyzing a test dataset in order to investigate gene alteration-drug interactions. In particular, we focus on the response to the **Cetuximab**, an epidermal growth factor inhibitor used for the treatment of several cancer types, such as the colorectal cancer. Each cancer is a complex and variable system with unique characteristics at the molecular level, which may determine drugs performance. In this demo, we match drug responses data with annotations related to a set of 4 genes:\n", "\n", "* BRAF\n", "* EGFR\n", "* HER2 \n", "* KRAS\n", "\n", "which are known to be relevant to Cetuximab response in colorectal cancer.\n", "\n", "* In [Chapter 1](#insights), we show how the platform can be used for getting basic data insights about genomic landscapes and drug responses. In particular, we use Semalytics to identify an investigation set (i.e., data trees with both genomic and pharmacological annotations).\n", "\n", "* In [Chapter 2](#inside), we explore data into the investigation set. First, we get the list of variants for the genes in the panel. Then, we explore the co-occurence of genomic variants and responses to Cetuximab.\n", "\n", "* In [Chapter 3](#wikidata), we use Semalytics for analyzing local data harnessing the extended information of Wikidata, thus gaining new analytical options on our local database. For example, we use federated queries to explore of data about drugs different from Cetuximab, which we do not store and maintain locally. \n", "\n", "* Finally, in [Appendix](#appendix) we list computational references to figures used in the proof-of-concept (PoC) of the paper.\n", "\n", "See the aforementioned article for further details.\n", "\n", "\n", "## Table of contents\n", "\n", "* [Introduction](#intro)\n", "* [General settings](#settings)\n", "* [Chapter 1 - Basic insights](#insights)\n", " - [Annotated nodes](#annotated)\n", " - [Genomic annotations](#genes)\n", " - [Response annotations](#mice) \n", " - [Investigation set: genomic and responses annotations](#miceanddrug)\n", "* [Chapter 2 - Inside the investigation set](#inside)\n", " - [Variants of non-responders](#variantsnonresp)\n", " - [Getting basic data for co-occurence analysis](#matchinggetdata)\n", " - [Matching annotations in the investigation set](#mset)\n", " - [Matching `feature_amplification` only](#mamplsubset)\n", " - [Matching `sequence_alteration` only](#mseqsubset)\n", "* [Chapter 3 - Querying data with knowledge and Wikidata](#wikidata)\n", " - [Drugs targeting gene products](#drugsrole)\n", " - [Drug information: dabrafenib](#dabrafenib)\n", " - [Querying variants](#queryinnvar)\n", " - [Positive therapeutic predictors](#pos)\n", " - [Negative therapeutic predictors](#neg)\n", " - [Drugs predictions for a specific case](#case)\n", "* [Appendix - PoC figures](#appendix)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## General settings\n", "\n", "General imports and vars" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import utils\n", "import pandas as pd\n", "from IPython.display import SVG, display\n", "\n", "# SPARQL endpoints\n", "\n", "# Semalytics (i.e., local data)\n", "# 14,281,125 explicit triples\n", "# 2,391,980 inferred triples\n", "SEMALYTICS_ENDPOINT = 'http://semalytics:7200/repositories/annotationDB'\n", "\n", "# Remote knowledge base\n", "WIKIDATA_ENDPOINT = 'https://query.wikidata.org/sparql'\n", "\n", "# genes panel\n", "PANEL = {'BRAF','EGFR','ERBB2','KRAS'}\n", "\n", "# investigated variants\n", "VARIANTS = ':sequence_alteration :feature_amplification'\n", "\n", "# enable inline plotting\n", "%matplotlib inline\n", "\n", "# do not truncate data in tables\n", "pd.set_option('display.max_colwidth', -1)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Chapter 1 - Basic insights\n", "\n", "We query Semalytics data for getting basic insights. Semalytics returns immediately analytics on scattered annotations.\n", "\n", "\n", "### Annotated nodes\n", "\n", "We use the following query to retrieve **annotated nodes** (samples for genes or mice for drug reponses)." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "there are 3917 annotated nodes in trees\n" ] } ], "source": [ "# the query\n", "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX onto: \n", "select (count(distinct ?node) as ?nodes)\n", "from onto:disable-sameAs\n", "where {\n", " ?case a :Case ;\n", " :hasDescendant ?node .\n", " ?node a :Bioentity ;\n", " :has_annotation ?ann .\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go!\n", "result = result_table['nodes.value'][0] \n", "print(f'there are {result} annotated nodes in trees')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Genomic annotations\n", "\n", "Let $\\mathcal{G}$ be the set of data trees with annotations about `:sequence_alteration` or `:feature_amplification` for **genes in the panel**. We build $\\mathcal{G}$ with the following query:" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "there are 354 cases annotated with 1+ variant(s) in the panel (KRAS, EGFR, BRAF, HER2)\n" ] } ], "source": [ "# Cases with annotations in the genes panel\n", "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX onto: \n", "select (count(distinct ?case) as ?cases) \n", "from onto:disable-sameAs\n", "where { \n", " ?case a :Case ;\n", " :hasDescendant ?node .\n", " ?node :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?gene :has_variant ?ref.\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?ref a ?annotation_Type .\n", " VALUES ?annotation_Type { \"\"\"+VARIANTS+\"\"\" }\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go!\n", "result = result_table['cases.value'][0] \n", "print(f'there are {result} cases annotated with 1+ variant(s) in the panel (KRAS, EGFR, BRAF, HER2)')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Response annotations\n", "\n", "Let $\\mathcal{D}$ be the set of data trees with annotated mice about **pharmacological responses**. We build $\\mathcal{D}$ with the following query:" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "there are 238 cases annotated with 1+ pharmacological response(s) for the CETUXIMAB\n" ] } ], "source": [ "# Cases with pharmacological annotations\n", "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX onto: \n", "select ?drugName (count(distinct ?case) as ?cases)\n", "from onto:disable-sameAs\n", "where {\n", " ?case a :Case ;\n", " :hasDescendant ?mouse .\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref a :drug_response .\n", " ?drug :has_drug_response ?ref;\n", " :name ?drugName .\n", "}\n", "GROUP BY ?drugName\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go!\n", "drug,cases = result_table['drugName.value'][0],result_table['cases.value'][0]\n", "print(f'there are {cases} cases annotated with 1+ pharmacological response(s) for the {drug}')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Investigation set: genomic and responses annotations\n", "\n", "Let $\\mathcal{S} = (\\mathcal{G} \\cap \\mathcal{D})$ be the investigation scope (i.e., data trees with both genomic and pharmacological annotations). We get it through this query:" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "there are 113 cases annotated with 1+ pharmacological response(s) AND 1+ variant(s)\n" ] } ], "source": [ "# Cases with pharmacological and genomic annotations\n", "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX onto: \n", "select (count(distinct ?case) as ?cases)\n", "from onto:disable-sameAs\n", "where {\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref a :drug_response .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2.\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type { \"\"\"+VARIANTS+\"\"\" }\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go!\n", "investigation_scope = result_table['cases.value'][0]\n", "print(f'there are {investigation_scope} cases annotated with 1+ pharmacological response(s) AND 1+ variant(s)')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Chapter 2 - Inside the investigation set\n", "\n", "In this section we analyze annotation types for cases in the investigation set. Moreover, we exploit Semalytics for matching variants against responses to Cetuximab.\n", "\n", "\n", "### Variants of non responders\n", "\n", "With the following query, we get the **variants list** of non-responder cases. The column `alt_p.value` represents the type of `point_mutation`." ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
case.valuegeneSymbol.valuetype.valuealt_p.value
0CRC0019KRASpoint_mutationG13D
1CRC0021KRASpoint_mutationG12C
2CRC0021KRASpoint_mutationG12V
3CRC0024KRASpoint_mutationG13C
4CRC0027KRASpoint_mutationG13D
5CRC0028KRASpoint_mutationG13D
6CRC0031KRASpoint_mutationG12D
7CRC0053KRASpoint_mutationA146T
8CRC0055KRASpoint_mutationG12V
9CRC0058KRASpoint_mutationG12V
10CRC0060KRASpoint_mutationG12V
11CRC0063KRASpoint_mutationG12V
12CRC0064KRASpoint_mutationG12D
13CRC0067KRASpoint_mutationI36M
14CRC0068KRASpoint_mutationG12C
15CRC0070KRASpoint_mutationG12D
16CRC0073KRASpoint_mutationG12C
17CRC0077KRASpoint_mutationG12D
18CRC0079BRAFpoint_mutationV600E
19CRC0080ERBB2feature_amplification
20CRC0082KRASpoint_mutationG13D
21CRC0085KRASpoint_mutationA146T
22CRC0094KRASpoint_mutationG12C
23CRC0105EGFRfeature_amplification
24CRC0106BRAFpoint_mutationV600E
25CRC0112ERBB2feature_amplification
26CRC0118BRAFpoint_mutationV600E
27CRC0124ERBB2point_mutationH878Y
28CRC0124ERBB2feature_amplification
29CRC0125KRASpoint_mutationK117N
...............
61CRC0315KRASpoint_mutationG13D
62CRC0323BRAFpoint_mutationV600E
63CRC0324KRASpoint_mutationG12V
64CRC0348KRASpoint_mutationG12D
65CRC0349KRASpoint_mutationG12D
66CRC0382KRASpoint_mutationG12C
67CRC0438KRASpoint_mutationQ61K
68CRC0468KRASpoint_mutationG12V
69CRC0479KRASpoint_mutationG13D
70CRC0480BRAFpoint_mutationV600E
71CRC0481KRASpoint_mutationG13D
72CRC0481EGFRfeature_amplification
73CRC0484BRAFpoint_mutationV600E
74CRC0504KRASpoint_mutationG13D
75CRC0504ERBB2point_mutationR678Q
76CRC0508EGFRfeature_amplification
77CRC0527BRAFpoint_mutationK601N
78CRC0527EGFRfeature_amplification
79CRC0528BRAFpoint_mutationV600E
80CRC0610EGFRfeature_amplification
81CRC0626KRASpoint_mutationA146V
82CRC0714KRASpoint_mutationG13D
83CRC0729ERBB2feature_amplification
84CRC0753KRASpoint_mutationG12V
85CRC1063BRAFpoint_mutationK601E
86CRC1063BRAFpoint_mutationT241M
87CRC1138BRAFpoint_mutationK601E
88CRC1169EGFRfeature_amplification
89CRC1182KRASpoint_mutationG12A
90CRC1278EGFRfeature_amplification
\n", "

91 rows × 4 columns

\n", "
" ], "text/plain": [ " case.value geneSymbol.value type.value alt_p.value\n", "0 CRC0019 KRAS point_mutation G13D \n", "1 CRC0021 KRAS point_mutation G12C \n", "2 CRC0021 KRAS point_mutation G12V \n", "3 CRC0024 KRAS point_mutation G13C \n", "4 CRC0027 KRAS point_mutation G13D \n", "5 CRC0028 KRAS point_mutation G13D \n", "6 CRC0031 KRAS point_mutation G12D \n", "7 CRC0053 KRAS point_mutation A146T \n", "8 CRC0055 KRAS point_mutation G12V \n", "9 CRC0058 KRAS point_mutation G12V \n", "10 CRC0060 KRAS point_mutation G12V \n", "11 CRC0063 KRAS point_mutation G12V \n", "12 CRC0064 KRAS point_mutation G12D \n", "13 CRC0067 KRAS point_mutation I36M \n", "14 CRC0068 KRAS point_mutation G12C \n", "15 CRC0070 KRAS point_mutation G12D \n", "16 CRC0073 KRAS point_mutation G12C \n", "17 CRC0077 KRAS point_mutation G12D \n", "18 CRC0079 BRAF point_mutation V600E \n", "19 CRC0080 ERBB2 feature_amplification \n", "20 CRC0082 KRAS point_mutation G13D \n", "21 CRC0085 KRAS point_mutation A146T \n", "22 CRC0094 KRAS point_mutation G12C \n", "23 CRC0105 EGFR feature_amplification \n", "24 CRC0106 BRAF point_mutation V600E \n", "25 CRC0112 ERBB2 feature_amplification \n", "26 CRC0118 BRAF point_mutation V600E \n", "27 CRC0124 ERBB2 point_mutation H878Y \n", "28 CRC0124 ERBB2 feature_amplification \n", "29 CRC0125 KRAS point_mutation K117N \n", ".. ... ... ... ... \n", "61 CRC0315 KRAS point_mutation G13D \n", "62 CRC0323 BRAF point_mutation V600E \n", "63 CRC0324 KRAS point_mutation G12V \n", "64 CRC0348 KRAS point_mutation G12D \n", "65 CRC0349 KRAS point_mutation G12D \n", "66 CRC0382 KRAS point_mutation G12C \n", "67 CRC0438 KRAS point_mutation Q61K \n", "68 CRC0468 KRAS point_mutation G12V \n", "69 CRC0479 KRAS point_mutation G13D \n", "70 CRC0480 BRAF point_mutation V600E \n", "71 CRC0481 KRAS point_mutation G13D \n", "72 CRC0481 EGFR feature_amplification \n", "73 CRC0484 BRAF point_mutation V600E \n", "74 CRC0504 KRAS point_mutation G13D \n", "75 CRC0504 ERBB2 point_mutation R678Q \n", "76 CRC0508 EGFR feature_amplification \n", "77 CRC0527 BRAF point_mutation K601N \n", "78 CRC0527 EGFR feature_amplification \n", "79 CRC0528 BRAF point_mutation V600E \n", "80 CRC0610 EGFR feature_amplification \n", "81 CRC0626 KRAS point_mutation A146V \n", "82 CRC0714 KRAS point_mutation G13D \n", "83 CRC0729 ERBB2 feature_amplification \n", "84 CRC0753 KRAS point_mutation G12V \n", "85 CRC1063 BRAF point_mutation K601E \n", "86 CRC1063 BRAF point_mutation T241M \n", "87 CRC1138 BRAF point_mutation K601E \n", "88 CRC1169 EGFR feature_amplification \n", "89 CRC1182 KRAS point_mutation G12A \n", "90 CRC1278 EGFR feature_amplification \n", "\n", "[91 rows x 4 columns]" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX sesame: \n", "PREFIX onto: \n", "select distinct ?case ?geneSymbol ?type ?alt_p\n", "from onto:disable-sameAs\n", "where {\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", " \n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref a :DRCl_PD .\n", " \n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2 ;\n", " :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", " ?ref2 sesame:directType ?type.\n", " OPTIONAL {?ref2 :alt_p ?alt_p }\n", "}\n", "ORDER BY ?case\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# filter URIs prefixes\n", "utils.filter_prefixes(result_table)\n", "\n", "\n", "# there you go!\n", "result_table[['case.value', 'geneSymbol.value', 'type.value', 'alt_p.value']].fillna(\"\")\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Getting basic data for co-occurence analysis\n", "\n", "Creating basic data for further investigations about gene variant - drug matching." ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [], "source": [ "# data collections\n", "cases_per_gene = dict()\n", "cases_per_variant = dict()\n", "cases_per_variant_per_gene = dict()\n", "cases_per_response = dict()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We get **cases harboring 1+ variants for each gene in the panel**.\n", "\n", "Please, note that we are counting distinct cases per gene. Therefore, cases harboring multiple variants in the same gene will be counted only once." ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Querying EGFR\n", "Querying BRAF\n", "Querying KRAS\n", "Querying ERBB2\n", "Cases outline:\n", "EGFR: 29\n", "BRAF: 13\n", "KRAS: 70\n", "ERBB2: 11\n" ] } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX sesame: \n", "PREFIX onto: \n", "select distinct ?case\n", "from onto:disable-sameAs\n", "where {{\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", "\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", "\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2 ;\n", " :symbol ?geneSymbol\n", " VALUES ?geneSymbol {{'{}'}}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type {{ :sequence_alteration :feature_amplification }}\n", " ?ref2 sesame:directType ?type. \n", "}}\"\"\"\n", "\n", "for gene in PANEL:\n", " print (f'Querying {gene}')\n", " result_table = utils.query(SEMALYTICS_ENDPOINT, my_query.format(gene))\n", " cases_per_gene[gene] = set(result_table['case.value'])\n", "\n", "print ('Cases outline:')\n", "for key in cases_per_gene:\n", " print(f'{key}: {len(cases_per_gene[key])}')\n", " " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Then, we get cases **harboring 1+ `:sequence_alteration` or `:feature_amplification`**\n", "\n", "Again, please, note that we are counting distinct cases per gene. Therefore, cases harboring multiple variants in the same gene will be counted only once." ] }, { "cell_type": "code", "execution_count": 9, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Querying sequence_alteration\n", "Querying feature_amplification\n", "Cases outline:\n", "\tsequence_alteration 88\n", "\tfeature_amplification 33\n" ] } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX sesame: \n", "PREFIX onto: \n", "select distinct ?case \n", "from onto:disable-sameAs\n", "where {{\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", "\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", "\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2 ;\n", " :symbol ?geneSymbol\n", " VALUES ?geneSymbol {{'KRAS' 'EGFR' 'BRAF' 'ERBB2'}}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type {{ :{} }}\n", " ?ref2 sesame:directType ?type. \n", "}}\"\"\"\n", "\n", "for variant in ['sequence_alteration', 'feature_amplification']:\n", " print (f'Querying {variant}')\n", " result_table = utils.query(SEMALYTICS_ENDPOINT, my_query.format(variant))\n", " cases_per_variant[variant] = set(result_table['case.value'])\n", " \n", "\n", "print ('Cases outline:')\n", "for variant in cases_per_variant:\n", " print(f'\\t{variant} {len(cases_per_variant[variant])}')\n", " " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Besides, we get cases **harboring 1+ `:sequence_alteration` or `:feature_amplification` for each gene in the panel**.\n" ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Querying sequence_alteration\n", "Querying feature_amplification\n", "no data for feature_amplification - BRAF\n", "no data for feature_amplification - KRAS\n", "Cases outline:\n", "sequence_alteration\n", "\tEGFR 4\n", "\tBRAF 13\n", "\tKRAS 70\n", "\tERBB2 5\n", "feature_amplification\n", "\tEGFR 26\n", "\tERBB2 7\n" ] } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX sesame: \n", "PREFIX onto: \n", "select distinct ?case \n", "from onto:disable-sameAs\n", "where {{\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", "\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", "\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2 ;\n", " :symbol ?geneSymbol\n", " VALUES ?geneSymbol {{'{}'}}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type {{ :{} }}\n", " ?ref2 sesame:directType ?type. \n", "}}\"\"\"\n", "\n", "for variant in ['sequence_alteration', 'feature_amplification']:\n", " print (f'Querying {variant}')\n", " cases_per_variant_per_gene[variant] = dict()\n", " for gene in PANEL:\n", " result_table = utils.query(SEMALYTICS_ENDPOINT, my_query.format(gene, variant))\n", " try:\n", " cases_per_variant_per_gene[variant][gene] = set(result_table['case.value'])\n", " except KeyError:\n", " print (f'no data for {variant} - {gene}')\n", "\n", "print ('Cases outline:')\n", "for variant in cases_per_variant_per_gene:\n", " print(f'{variant}')\n", " for gene in cases_per_variant_per_gene[variant]:\n", " print(f'\\t{gene} {len(cases_per_variant_per_gene[variant][gene])}')\n", " " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Variants summary:\n", "\n", "\n", "| Gene | All variant types | :feature_amplification | :sequence_alteration |\n", "|-------------|-----|------------|-----|\n", "| Annotated | 113 | 33 | 88 |\n", "| BRAF | 13 | 0 | 13 |\n", "| EGFR | 29 | 26 | 4 |\n", "| ERBB2 | 11 | 7 | 5 |\n", "| KRAS | 70 | 0 | 70 |" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Finally, we get **cases per response type**." ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Querying DRCl_OR\n", "Querying DRCl_SD\n", "Querying DRCl_PD\n", "Cases outline:\n", "DRCl_OR: 7\n", "DRCl_SD: 26\n", "DRCl_PD: 80\n" ] } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX sesame: \n", "PREFIX onto: \n", "select distinct ?case \n", "from onto:disable-sameAs\n", "where {{\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", "\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref sesame:directType :{} .\n", "\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2 ;\n", " :symbol ?geneSymbol\n", " VALUES ?geneSymbol {{'KRAS' 'EGFR' 'BRAF' 'ERBB2'}}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type {{ :sequence_alteration :feature_amplification }}\n", " ?ref2 sesame:directType ?type. \n", "}}\"\"\"\n", "\n", "for response in ['DRCl_OR', 'DRCl_SD', 'DRCl_PD']:\n", " print (f'Querying {response}')\n", " result_table = utils.query(SEMALYTICS_ENDPOINT, my_query.format(response))\n", " cases_per_response[response] = set(result_table['case.value'])\n", "\n", "print ('Cases outline:')\n", "for key in cases_per_response:\n", " print(f'{key}: {len(cases_per_response[key])}')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Since we are also interested in analyzing **variants co-occurrences**, we enumerate all possible combinations (i.e., the power set). We will use these data in the next sections." ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "[(),\n", " ('EGFR',),\n", " ('ERBB2',),\n", " ('BRAF',),\n", " ('KRAS',),\n", " ('EGFR', 'KRAS'),\n", " ('KRAS', 'ERBB2'),\n", " ('BRAF', 'KRAS'),\n", " ('EGFR', 'BRAF'),\n", " ('BRAF', 'ERBB2'),\n", " ('EGFR', 'ERBB2'),\n", " ('EGFR', 'BRAF', 'KRAS'),\n", " ('BRAF', 'KRAS', 'ERBB2'),\n", " ('EGFR', 'BRAF', 'ERBB2'),\n", " ('EGFR', 'KRAS', 'ERBB2'),\n", " ('EGFR', 'BRAF', 'KRAS', 'ERBB2')]" ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# create variants co-occurrences list (i.e., the power set of {'BRAF','EGFR','ERBB2','KRAS'})\n", "variants_occurrences = list(utils.powerset(PANEL))\n", "variants_occurrences.sort(key=len)\n", "\n", "# just combinatorics\n", "variants_occurrences" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Matching annotations in the investigation set\n", "\n", "We analyze all annotated cases and we match drug information with gene variants data." ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "113\n" ] } ], "source": [ "# the investigation set\n", "tot = cases_per_gene['BRAF'] | cases_per_gene['EGFR'] | cases_per_gene['ERBB2'] | cases_per_gene['KRAS']\n", "\n", "print(len(tot))" ] }, { "cell_type": "code", "execution_count": 14, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
genecases
0Annotated113
1BRAF13
2EGFR29
3ERBB211
4KRAS70
\n", "
" ], "text/plain": [ " gene cases\n", "0 Annotated 113 \n", "1 BRAF 13 \n", "2 EGFR 29 \n", "3 ERBB2 11 \n", "4 KRAS 70 " ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# create a new Semalytics analysis object\n", "a = utils.Analysis(tot, cases_per_gene, cases_per_response, variants_occurrences)\n", "\n", "# gene variants\n", "a.variants" ] }, { "cell_type": "code", "execution_count": 15, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 15, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# plot variants distribution\n", "a.plot_variants()" ] }, { "cell_type": "code", "execution_count": 16, "metadata": { "scrolled": false }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
response_typecases
0response7
1neutral26
2progression80
\n", "
" ], "text/plain": [ " response_type cases\n", "0 response 7 \n", "1 neutral 26 \n", "2 progression 80 " ] }, "execution_count": 16, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# responses\n", "a.responses" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "(Note: the following is figure 4b/right in the paper)" ] }, { "cell_type": "code", "execution_count": 17, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "Text(0, 0.5, '')" ] }, "execution_count": 17, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# plot responses\n", "a.plot_responses()" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
genesDRCl_PDDRCl_SDDRCl_ORtotprogressionneutralresponse
0(EGFR,)10145290.3448280.4827590.172414
1(ERBB2,)812110.7272730.09090910.181818
2(BRAF,)130013100
3(KRAS,)56140700.80.20
4(EGFR, KRAS)33060.50.50
5(KRAS, ERBB2)1001100
6(BRAF, KRAS)2002100
7(EGFR, BRAF)1001100
8(BRAF, ERBB2)0000
9(EGFR, ERBB2)0000
10(EGFR, BRAF, KRAS)0000
11(BRAF, KRAS, ERBB2)0000
12(EGFR, BRAF, ERBB2)0000
13(EGFR, KRAS, ERBB2)0000
14(EGFR, BRAF, KRAS, ERBB2)0000
\n", "
" ], "text/plain": [ " genes DRCl_PD DRCl_SD DRCl_OR tot progression \\\n", "0 (EGFR,) 10 14 5 29 0.344828 \n", "1 (ERBB2,) 8 1 2 11 0.727273 \n", "2 (BRAF,) 13 0 0 13 1 \n", "3 (KRAS,) 56 14 0 70 0.8 \n", "4 (EGFR, KRAS) 3 3 0 6 0.5 \n", "5 (KRAS, ERBB2) 1 0 0 1 1 \n", "6 (BRAF, KRAS) 2 0 0 2 1 \n", "7 (EGFR, BRAF) 1 0 0 1 1 \n", "8 (BRAF, ERBB2) 0 0 0 0 \n", "9 (EGFR, ERBB2) 0 0 0 0 \n", "10 (EGFR, BRAF, KRAS) 0 0 0 0 \n", "11 (BRAF, KRAS, ERBB2) 0 0 0 0 \n", "12 (EGFR, BRAF, ERBB2) 0 0 0 0 \n", "13 (EGFR, KRAS, ERBB2) 0 0 0 0 \n", "14 (EGFR, BRAF, KRAS, ERBB2) 0 0 0 0 \n", "\n", " neutral response \n", "0 0.482759 0.172414 \n", "1 0.0909091 0.181818 \n", "2 0 0 \n", "3 0.2 0 \n", "4 0.5 0 \n", "5 0 0 \n", "6 0 0 \n", "7 0 0 \n", "8 \n", "9 \n", "10 \n", "11 \n", "12 \n", "13 \n", "14 " ] }, "execution_count": 18, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# variants vs responses\n", "a.matching.fillna(\"\")" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 19, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# plot matching\n", "a.plot_matching()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Matching `feature_amplification` only\n", "\n", "We analyze cases with only 1+ `feature_amplification` (and with no `sequence_alteration`)" ] }, { "cell_type": "code", "execution_count": 20, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "25\n" ] } ], "source": [ "# create the subset\n", "tot = cases_per_variant['feature_amplification'] - cases_per_variant['sequence_alteration']\n", "print(len(tot))" ] }, { "cell_type": "code", "execution_count": 21, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
genecases
0Annotated25
1BRAF0
2EGFR19
3ERBB26
4KRAS0
\n", "
" ], "text/plain": [ " gene cases\n", "0 Annotated 25 \n", "1 BRAF 0 \n", "2 EGFR 19 \n", "3 ERBB2 6 \n", "4 KRAS 0 " ] }, "execution_count": 21, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# create a new Semalytics analysis object\n", "a = utils.Analysis(tot, cases_per_gene, cases_per_response, variants_occurrences)\n", "\n", "# gene variants\n", "a.variants" ] }, { "cell_type": "code", "execution_count": 22, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 22, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# plot variants distribution\n", "a.plot_variants()" ] }, { "cell_type": "code", "execution_count": 23, "metadata": { "scrolled": false }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
response_typecases
0response7
1neutral9
2progression9
\n", "
" ], "text/plain": [ " response_type cases\n", "0 response 7 \n", "1 neutral 9 \n", "2 progression 9 " ] }, "execution_count": 23, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# responses\n", "a.responses" ] }, { "cell_type": "code", "execution_count": 24, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "Text(0, 0.5, '')" ] }, "execution_count": 24, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# plot responses\n", "a.plot_responses()" ] }, { "cell_type": "code", "execution_count": 25, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
genesDRCl_PDDRCl_SDDRCl_ORtotprogressionneutralresponse
0(EGFR,)595190.2631580.4736840.263158
1(ERBB2,)40260.66666700.333333
2(BRAF,)0000
3(KRAS,)0000
4(EGFR, KRAS)0000
5(KRAS, ERBB2)0000
6(BRAF, KRAS)0000
7(EGFR, BRAF)0000
8(BRAF, ERBB2)0000
9(EGFR, ERBB2)0000
10(EGFR, BRAF, KRAS)0000
11(BRAF, KRAS, ERBB2)0000
12(EGFR, BRAF, ERBB2)0000
13(EGFR, KRAS, ERBB2)0000
14(EGFR, BRAF, KRAS, ERBB2)0000
\n", "
" ], "text/plain": [ " genes DRCl_PD DRCl_SD DRCl_OR tot progression \\\n", "0 (EGFR,) 5 9 5 19 0.263158 \n", "1 (ERBB2,) 4 0 2 6 0.666667 \n", "2 (BRAF,) 0 0 0 0 \n", "3 (KRAS,) 0 0 0 0 \n", "4 (EGFR, KRAS) 0 0 0 0 \n", "5 (KRAS, ERBB2) 0 0 0 0 \n", "6 (BRAF, KRAS) 0 0 0 0 \n", "7 (EGFR, BRAF) 0 0 0 0 \n", "8 (BRAF, ERBB2) 0 0 0 0 \n", "9 (EGFR, ERBB2) 0 0 0 0 \n", "10 (EGFR, BRAF, KRAS) 0 0 0 0 \n", "11 (BRAF, KRAS, ERBB2) 0 0 0 0 \n", "12 (EGFR, BRAF, ERBB2) 0 0 0 0 \n", "13 (EGFR, KRAS, ERBB2) 0 0 0 0 \n", "14 (EGFR, BRAF, KRAS, ERBB2) 0 0 0 0 \n", "\n", " neutral response \n", "0 0.473684 0.263158 \n", "1 0 0.333333 \n", "2 \n", "3 \n", "4 \n", "5 \n", "6 \n", "7 \n", "8 \n", "9 \n", "10 \n", "11 \n", "12 \n", "13 \n", "14 " ] }, "execution_count": 25, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# variants vs responses\n", "a.matching.fillna(\"\")" ] }, { "cell_type": "code", "execution_count": 26, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 26, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# plot matching\n", "a.plot_matching()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Matching `sequence_alteration` only\n", "\n", "We analyze cases with only 1+ `sequence_alteration` (and with no `feature_amplification`)" ] }, { "cell_type": "code", "execution_count": 27, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "80\n" ] } ], "source": [ "# create the subset\n", "tot = cases_per_variant['sequence_alteration'] - cases_per_variant['feature_amplification']\n", "print (len(tot))" ] }, { "cell_type": "code", "execution_count": 28, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
genecases
0Annotated80
1BRAF12
2EGFR3
3ERBB24
4KRAS65
\n", "
" ], "text/plain": [ " gene cases\n", "0 Annotated 80 \n", "1 BRAF 12 \n", "2 EGFR 3 \n", "3 ERBB2 4 \n", "4 KRAS 65 " ] }, "execution_count": 28, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# create a new Semalytics analysis object\n", "a = utils.Analysis(tot, cases_per_gene, cases_per_response, variants_occurrences)\n", "\n", "# gene variants\n", "a.variants" ] }, { "cell_type": "code", "execution_count": 29, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 29, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# plot variants distribution\n", "a.plot_variants()" ] }, { "cell_type": "code", "execution_count": 30, "metadata": { "scrolled": false }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
response_typecases
0response0
1neutral14
2progression66
\n", "
" ], "text/plain": [ " response_type cases\n", "0 response 0 \n", "1 neutral 14 \n", "2 progression 66 " ] }, "execution_count": 30, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# responses\n", "a.responses" ] }, { "cell_type": "code", "execution_count": 31, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "Text(0, 0.5, '')" ] }, "execution_count": 31, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# plot responses\n", "a.plot_responses()" ] }, { "cell_type": "code", "execution_count": 32, "metadata": { "scrolled": false }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
genesDRCl_PDDRCl_SDDRCl_ORtotprogressionneutralresponse
0(EGFR,)12030.3333330.6666670
1(ERBB2,)31040.750.250
2(BRAF,)120012100
3(KRAS,)54110650.8307690.1692310
4(EGFR, KRAS)1001100
5(KRAS, ERBB2)1001100
6(BRAF, KRAS)2002100
7(EGFR, BRAF)0000
8(BRAF, ERBB2)0000
9(EGFR, ERBB2)0000
10(EGFR, BRAF, KRAS)0000
11(BRAF, KRAS, ERBB2)0000
12(EGFR, BRAF, ERBB2)0000
13(EGFR, KRAS, ERBB2)0000
14(EGFR, BRAF, KRAS, ERBB2)0000
\n", "
" ], "text/plain": [ " genes DRCl_PD DRCl_SD DRCl_OR tot progression \\\n", "0 (EGFR,) 1 2 0 3 0.333333 \n", "1 (ERBB2,) 3 1 0 4 0.75 \n", "2 (BRAF,) 12 0 0 12 1 \n", "3 (KRAS,) 54 11 0 65 0.830769 \n", "4 (EGFR, KRAS) 1 0 0 1 1 \n", "5 (KRAS, ERBB2) 1 0 0 1 1 \n", "6 (BRAF, KRAS) 2 0 0 2 1 \n", "7 (EGFR, BRAF) 0 0 0 0 \n", "8 (BRAF, ERBB2) 0 0 0 0 \n", "9 (EGFR, ERBB2) 0 0 0 0 \n", "10 (EGFR, BRAF, KRAS) 0 0 0 0 \n", "11 (BRAF, KRAS, ERBB2) 0 0 0 0 \n", "12 (EGFR, BRAF, ERBB2) 0 0 0 0 \n", "13 (EGFR, KRAS, ERBB2) 0 0 0 0 \n", "14 (EGFR, BRAF, KRAS, ERBB2) 0 0 0 0 \n", "\n", " neutral response \n", "0 0.666667 0 \n", "1 0.25 0 \n", "2 0 0 \n", "3 0.169231 0 \n", "4 0 0 \n", "5 0 0 \n", "6 0 0 \n", "7 \n", "8 \n", "9 \n", "10 \n", "11 \n", "12 \n", "13 \n", "14 " ] }, "execution_count": 32, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# variants vs responses\n", "a.matching.fillna(\"\")" ] }, { "cell_type": "code", "execution_count": 33, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 33, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# plot matching\n", "a.plot_matching()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Chapter 3 - Querying data with knowlege and Wikidata\n", "\n", "In this section, we are going to query data with extended knowledge. The platform connects Wikidata by leveraging `owl:sameAs` predicates.\n", "\n", "The SPARQL endpoint of Semalytics is federated with the Wikidata one (https://query.wikidata.org/sparql).\n", "\n", "See also this [Web page](https://www.wikidata.org/wiki/User:ProteinBoxBot/SPARQL_Examples#Query_Wikidata_with_SPARQL) for other Wikidata examples related to life sciences. Those queries can be also used for querying local data in Semalytics." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Drugs targeting gene products\n", "\n", "We get chemical compounds (`Q11173`) which physically interacts (`P129`), with a specific role (`P2868`), with products encoded by genes in the investigation panel." ] }, { "cell_type": "code", "execution_count": 34, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
geneSymbol.valuedrugLabel.valueroleLabel.valuegene_productLabel.value
0BRAFdabrafenibenzyme inhibitorB-Raf proto-oncogene, serine/threonine kinase
1BRAFregorafenibenzyme inhibitorB-Raf proto-oncogene, serine/threonine kinase
2BRAFsorafenibenzyme inhibitorB-Raf proto-oncogene, serine/threonine kinase
3BRAFvemurafenibenzyme inhibitorB-Raf proto-oncogene, serine/threonine kinase
4EGFRicotinibenzyme inhibitorEpidermal growth factor receptor
5EGFRdacomitinibenzyme inhibitorEpidermal growth factor receptor
6EGFRosimertinibenzyme inhibitorEpidermal growth factor receptor
7EGFRgefitinibenzyme inhibitorEpidermal growth factor receptor
8EGFRerlotinibenzyme inhibitorEpidermal growth factor receptor
9EGFRlapatinibenzyme inhibitorEpidermal growth factor receptor
10EGFR5-chloro-N2-(4-(4-(dimethylamino)-1-piperidinyl)-2-methoxyphenyl)-N4-(2-(dimethylphosphinyl)phenyl)-2,4-pyrimidinediamineenzyme inhibitorEpidermal growth factor receptor
11EGFRafatinibenzyme inhibitorEpidermal growth factor receptor
12EGFRcanertinibenzyme inhibitorEpidermal growth factor receptor
13EGFRneratinibenzyme inhibitorEpidermal growth factor receptor
14EGFRvandetanibenzyme inhibitorEpidermal growth factor receptor
15ERBB2dacomitinibenzyme inhibitorErb-b2 receptor tyrosine kinase 2
16ERBB2lapatinibenzyme inhibitorErb-b2 receptor tyrosine kinase 2
17ERBB2afatinibenzyme inhibitorErb-b2 receptor tyrosine kinase 2
18ERBB2canertinibenzyme inhibitorErb-b2 receptor tyrosine kinase 2
19ERBB2mubritinibenzyme inhibitorErb-b2 receptor tyrosine kinase 2
20ERBB2neratinibenzyme inhibitorErb-b2 receptor tyrosine kinase 2
21KRASlonafarnibenzyme inhibitorKRAS proto-oncogene, GTPase
\n", "
" ], "text/plain": [ " geneSymbol.value \\\n", "0 BRAF \n", "1 BRAF \n", "2 BRAF \n", "3 BRAF \n", "4 EGFR \n", "5 EGFR \n", "6 EGFR \n", "7 EGFR \n", "8 EGFR \n", "9 EGFR \n", "10 EGFR \n", "11 EGFR \n", "12 EGFR \n", "13 EGFR \n", "14 EGFR \n", "15 ERBB2 \n", "16 ERBB2 \n", "17 ERBB2 \n", "18 ERBB2 \n", "19 ERBB2 \n", "20 ERBB2 \n", "21 KRAS \n", "\n", " drugLabel.value \\\n", "0 dabrafenib \n", "1 regorafenib \n", "2 sorafenib \n", "3 vemurafenib \n", "4 icotinib \n", "5 dacomitinib \n", "6 osimertinib \n", "7 gefitinib \n", "8 erlotinib \n", "9 lapatinib \n", "10 5-chloro-N2-(4-(4-(dimethylamino)-1-piperidinyl)-2-methoxyphenyl)-N4-(2-(dimethylphosphinyl)phenyl)-2,4-pyrimidinediamine \n", "11 afatinib \n", "12 canertinib \n", "13 neratinib \n", "14 vandetanib \n", "15 dacomitinib \n", "16 lapatinib \n", "17 afatinib \n", "18 canertinib \n", "19 mubritinib \n", "20 neratinib \n", "21 lonafarnib \n", "\n", " roleLabel.value gene_productLabel.value \n", "0 enzyme inhibitor B-Raf proto-oncogene, serine/threonine kinase \n", "1 enzyme inhibitor B-Raf proto-oncogene, serine/threonine kinase \n", "2 enzyme inhibitor B-Raf proto-oncogene, serine/threonine kinase \n", "3 enzyme inhibitor B-Raf proto-oncogene, serine/threonine kinase \n", "4 enzyme inhibitor Epidermal growth factor receptor \n", "5 enzyme inhibitor Epidermal growth factor receptor \n", "6 enzyme inhibitor Epidermal growth factor receptor \n", "7 enzyme inhibitor Epidermal growth factor receptor \n", "8 enzyme inhibitor Epidermal growth factor receptor \n", "9 enzyme inhibitor Epidermal growth factor receptor \n", "10 enzyme inhibitor Epidermal growth factor receptor \n", "11 enzyme inhibitor Epidermal growth factor receptor \n", "12 enzyme inhibitor Epidermal growth factor receptor \n", "13 enzyme inhibitor Epidermal growth factor receptor \n", "14 enzyme inhibitor Epidermal growth factor receptor \n", "15 enzyme inhibitor Erb-b2 receptor tyrosine kinase 2 \n", "16 enzyme inhibitor Erb-b2 receptor tyrosine kinase 2 \n", "17 enzyme inhibitor Erb-b2 receptor tyrosine kinase 2 \n", "18 enzyme inhibitor Erb-b2 receptor tyrosine kinase 2 \n", "19 enzyme inhibitor Erb-b2 receptor tyrosine kinase 2 \n", "20 enzyme inhibitor Erb-b2 receptor tyrosine kinase 2 \n", "21 enzyme inhibitor KRAS proto-oncogene, GTPase " ] }, "execution_count": 34, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX wd: \n", "PREFIX wdt: \n", "PREFIX pq: \n", "PREFIX ps: \n", "PREFIX p: \n", "PREFIX wikibase: \n", "PREFIX bd: \n", "PREFIX rdfs: \n", "\n", "select ?geneSymbol ?drugLabel ?roleLabel ?gene_productLabel\n", "where {\n", "\n", " # Wikidata endpoint \n", " SERVICE {\n", "\n", " ?chem p:P129 [\n", " ps:P129 ?gene_product ;\n", " pq:P2868 ?role ] .\n", " ?chem wdt:P31 wd:Q11173 .\n", " ?gene_product wdt:P702 ?gene .\n", "\n", " SERVICE wikibase:label { \n", " bd:serviceParam wikibase:language \"en\" . \n", " ?chem rdfs:label ?drugLabel .\n", " ?gene_product rdfs:label ?gene_productLabel .\n", " ?role rdfs:label ?roleLabel .\n", " }\n", " }\n", " \n", " #local data\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", "}\n", "order by ?geneSymbol\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go!\n", "result_table[['geneSymbol.value', 'drugLabel.value', 'roleLabel.value', 'gene_productLabel.value']]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Drug information: dabrafenib\n", "\n", "Now we get from Wikidata the **chemical formula** (P274) of one of those drug: the **dabrafenib**..." ] }, { "cell_type": "code", "execution_count": 35, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
chem.typechem.value
0literalC₂₃H₂₀F₃N₅O₂S₂
\n", "
" ], "text/plain": [ " chem.type chem.value\n", "0 literal C₂₃H₂₀F₃N₅O₂S₂" ] }, "execution_count": 35, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX wd: \n", "PREFIX wdt: \n", "SELECT *\n", "WHERE \n", "{\n", " wd:Q3011604 wdt:P274 ?chem .\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(WIKIDATA_ENDPOINT, my_query)\n", "\n", "result_table" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "...as well as its **chemical structure** (`P117`)." ] }, { "cell_type": "code", "execution_count": 36, "metadata": {}, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\t\n", "\n", "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "live rendering from Wikidata of http://commons.wikimedia.org/wiki/Special:FilePath/Dabrafenib.svg\n" ] } ], "source": [ "my_query = \"\"\"\n", "PREFIX wd: \n", "PREFIX wdt: \n", "SELECT *\n", "WHERE \n", "{\n", " wd:Q3011604 wdt:P117 ?struct .\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(WIKIDATA_ENDPOINT, my_query)\n", "\n", "display(SVG(url=result_table['struct.value'][0]))\n", "\n", "print (f'live rendering from Wikidata of {result_table[\"struct.value\"][0]}')\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Finally, we get **medical conditions treated** (P2175), relative data source (1) and information retrieval date.\n", "\n", "_(1) \"dataset containing drug indications extracted from the FDA Adverse Event Reporting System\"_" ] }, { "cell_type": "code", "execution_count": 37, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
medical_conditionLabel.valuereferenceLabel.valuedate.value
0non-small-cell lung carcinomaDrug Indications Extracted from FAERS2018-10-02T00:00:00Z
1skin cancerDrug Indications Extracted from FAERS2018-10-02T00:00:00Z
2metastatic melanomaDrug Indications Extracted from FAERS2018-10-02T00:00:00Z
3melanomaDrug Indications Extracted from FAERS2018-10-02T00:00:00Z
\n", "
" ], "text/plain": [ " medical_conditionLabel.value referenceLabel.value \\\n", "0 non-small-cell lung carcinoma Drug Indications Extracted from FAERS \n", "1 skin cancer Drug Indications Extracted from FAERS \n", "2 metastatic melanoma Drug Indications Extracted from FAERS \n", "3 melanoma Drug Indications Extracted from FAERS \n", "\n", " date.value \n", "0 2018-10-02T00:00:00Z \n", "1 2018-10-02T00:00:00Z \n", "2 2018-10-02T00:00:00Z \n", "3 2018-10-02T00:00:00Z " ] }, "execution_count": 37, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "SELECT ?medical_conditionLabel ?referenceLabel ?date\n", "WHERE \n", "{\n", " wd:Q3011604 p:P2175 [\n", " ps:P2175 ?medical_condition ;\n", " prov:wasDerivedFrom ?source \n", " ].\n", " ?source pr:P248 ?reference ;\n", " pr:P813 ?date\n", " SERVICE wikibase:label { bd:serviceParam wikibase:language \"en\". }\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(WIKIDATA_ENDPOINT, my_query)\n", "\n", "# there you go\n", "result_table[['medical_conditionLabel.value', 'referenceLabel.value', 'date.value']]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Querying variants\n", "\n", "We can also query only cases with variants mapped to Wikidata. Those are entry points for knowledge enrichment. The column `alt_p.value` represents the type of `point_mutation`." ] }, { "cell_type": "code", "execution_count": 38, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
case.valuevariant.valuegeneSymbol.valuealt_p.valueannotation_Type.value
0CRC0058Q28371388KRASG12Vsequence_alteration
1CRC0063Q28371388KRASG12Vsequence_alteration
2CRC0309Q28371388KRASG12Vsequence_alteration
3CRC0468Q28371388KRASG12Vsequence_alteration
4CRC0265Q28371388KRASG12Vsequence_alteration
5CRC0261Q28371388KRASG12Vsequence_alteration
6CRC0187Q28371388KRASG12Vsequence_alteration
7CRC0242Q28371388KRASG12Vsequence_alteration
8CRC0324Q28371388KRASG12Vsequence_alteration
9CRC0184Q28371388KRASG12Vsequence_alteration
10CRC0165Q28371388KRASG12Vsequence_alteration
11CRC0021Q28371388KRASG12Vsequence_alteration
12CRC0060Q28371388KRASG12Vsequence_alteration
13CRC0753Q28371388KRASG12Vsequence_alteration
14CRC0055Q28371388KRASG12Vsequence_alteration
15CRC0354Q29938363KRASQ61Hsequence_alteration
16CRC0139Q29938363KRASQ61Hsequence_alteration
17CRC0438Q29938368KRASQ61Ksequence_alteration
18CRC0024Q32948338KRASG13Csequence_alteration
19CRC0315Q28371015KRASG13Dsequence_alteration
20CRC0127Q28371015KRASG13Dsequence_alteration
21CRC0071Q28371015KRASG13Dsequence_alteration
22CRC0237Q28371015KRASG13Dsequence_alteration
23CRC0018Q28371015KRASG13Dsequence_alteration
24CRC0019Q28371015KRASG13Dsequence_alteration
25CRC0479Q28371015KRASG13Dsequence_alteration
26CRC0504Q28371015KRASG13Dsequence_alteration
27CRC0714Q28371015KRASG13Dsequence_alteration
28CRC0149Q28371015KRASG13Dsequence_alteration
29CRC0082Q28371015KRASG13Dsequence_alteration
..................
87CRC0362Q28444964EGFRfeature_amplification
88CRC0327Q28444964EGFRfeature_amplification
89CRC0328Q28444964EGFRfeature_amplification
90CRC0449Q28444964EGFRfeature_amplification
91CRC0481Q28444964EGFRfeature_amplification
92CRC0527Q28444964EGFRfeature_amplification
93CRC0537Q28444964EGFRfeature_amplification
94CRC0542Q28444964EGFRfeature_amplification
95CRC1278Q28444964EGFRfeature_amplification
96CRC0480Q21851559BRAFV600Esequence_alteration
97CRC0528Q21851559BRAFV600Esequence_alteration
98CRC0323Q21851559BRAFV600Esequence_alteration
99CRC0118Q21851559BRAFV600Esequence_alteration
100CRC0106Q21851559BRAFV600Esequence_alteration
101CRC0484Q21851559BRAFV600Esequence_alteration
102CRC0079Q21851559BRAFV600Esequence_alteration
103CRC0150Q50092868BRAFG466Vsequence_alteration
104CRC1138Q28371540BRAFK601Esequence_alteration
105CRC1063Q28371540BRAFK601Esequence_alteration
106CRC0504Q28370981ERBB2R678Qsequence_alteration
107CRC0126Q28370984ERBB2V777Lsequence_alteration
108CRC0131Q28370984ERBB2V777Lsequence_alteration
109CRC0124Q29938313ERBB2H878Ysequence_alteration
110CRC0124Q27908387ERBB2feature_amplification
111CRC0080Q27908387ERBB2feature_amplification
112CRC0185Q27908387ERBB2feature_amplification
113CRC0186Q27908387ERBB2feature_amplification
114CRC0112Q27908387ERBB2feature_amplification
115CRC0729Q27908387ERBB2feature_amplification
116CRC0743Q27908387ERBB2feature_amplification
\n", "

117 rows × 5 columns

\n", "
" ], "text/plain": [ " case.value variant.value geneSymbol.value alt_p.value \\\n", "0 CRC0058 Q28371388 KRAS G12V \n", "1 CRC0063 Q28371388 KRAS G12V \n", "2 CRC0309 Q28371388 KRAS G12V \n", "3 CRC0468 Q28371388 KRAS G12V \n", "4 CRC0265 Q28371388 KRAS G12V \n", "5 CRC0261 Q28371388 KRAS G12V \n", "6 CRC0187 Q28371388 KRAS G12V \n", "7 CRC0242 Q28371388 KRAS G12V \n", "8 CRC0324 Q28371388 KRAS G12V \n", "9 CRC0184 Q28371388 KRAS G12V \n", "10 CRC0165 Q28371388 KRAS G12V \n", "11 CRC0021 Q28371388 KRAS G12V \n", "12 CRC0060 Q28371388 KRAS G12V \n", "13 CRC0753 Q28371388 KRAS G12V \n", "14 CRC0055 Q28371388 KRAS G12V \n", "15 CRC0354 Q29938363 KRAS Q61H \n", "16 CRC0139 Q29938363 KRAS Q61H \n", "17 CRC0438 Q29938368 KRAS Q61K \n", "18 CRC0024 Q32948338 KRAS G13C \n", "19 CRC0315 Q28371015 KRAS G13D \n", "20 CRC0127 Q28371015 KRAS G13D \n", "21 CRC0071 Q28371015 KRAS G13D \n", "22 CRC0237 Q28371015 KRAS G13D \n", "23 CRC0018 Q28371015 KRAS G13D \n", "24 CRC0019 Q28371015 KRAS G13D \n", "25 CRC0479 Q28371015 KRAS G13D \n", "26 CRC0504 Q28371015 KRAS G13D \n", "27 CRC0714 Q28371015 KRAS G13D \n", "28 CRC0149 Q28371015 KRAS G13D \n", "29 CRC0082 Q28371015 KRAS G13D \n", ".. ... ... ... ... \n", "87 CRC0362 Q28444964 EGFR \n", "88 CRC0327 Q28444964 EGFR \n", "89 CRC0328 Q28444964 EGFR \n", "90 CRC0449 Q28444964 EGFR \n", "91 CRC0481 Q28444964 EGFR \n", "92 CRC0527 Q28444964 EGFR \n", "93 CRC0537 Q28444964 EGFR \n", "94 CRC0542 Q28444964 EGFR \n", "95 CRC1278 Q28444964 EGFR \n", "96 CRC0480 Q21851559 BRAF V600E \n", "97 CRC0528 Q21851559 BRAF V600E \n", "98 CRC0323 Q21851559 BRAF V600E \n", "99 CRC0118 Q21851559 BRAF V600E \n", "100 CRC0106 Q21851559 BRAF V600E \n", "101 CRC0484 Q21851559 BRAF V600E \n", "102 CRC0079 Q21851559 BRAF V600E \n", "103 CRC0150 Q50092868 BRAF G466V \n", "104 CRC1138 Q28371540 BRAF K601E \n", "105 CRC1063 Q28371540 BRAF K601E \n", "106 CRC0504 Q28370981 ERBB2 R678Q \n", "107 CRC0126 Q28370984 ERBB2 V777L \n", "108 CRC0131 Q28370984 ERBB2 V777L \n", "109 CRC0124 Q29938313 ERBB2 H878Y \n", "110 CRC0124 Q27908387 ERBB2 \n", "111 CRC0080 Q27908387 ERBB2 \n", "112 CRC0185 Q27908387 ERBB2 \n", "113 CRC0186 Q27908387 ERBB2 \n", "114 CRC0112 Q27908387 ERBB2 \n", "115 CRC0729 Q27908387 ERBB2 \n", "116 CRC0743 Q27908387 ERBB2 \n", "\n", " annotation_Type.value \n", "0 sequence_alteration \n", "1 sequence_alteration \n", "2 sequence_alteration \n", "3 sequence_alteration \n", "4 sequence_alteration \n", "5 sequence_alteration \n", "6 sequence_alteration \n", "7 sequence_alteration \n", "8 sequence_alteration \n", "9 sequence_alteration \n", "10 sequence_alteration \n", "11 sequence_alteration \n", "12 sequence_alteration \n", "13 sequence_alteration \n", "14 sequence_alteration \n", "15 sequence_alteration \n", "16 sequence_alteration \n", "17 sequence_alteration \n", "18 sequence_alteration \n", "19 sequence_alteration \n", "20 sequence_alteration \n", "21 sequence_alteration \n", "22 sequence_alteration \n", "23 sequence_alteration \n", "24 sequence_alteration \n", "25 sequence_alteration \n", "26 sequence_alteration \n", "27 sequence_alteration \n", "28 sequence_alteration \n", "29 sequence_alteration \n", ".. ... \n", "87 feature_amplification \n", "88 feature_amplification \n", "89 feature_amplification \n", "90 feature_amplification \n", "91 feature_amplification \n", "92 feature_amplification \n", "93 feature_amplification \n", "94 feature_amplification \n", "95 feature_amplification \n", "96 sequence_alteration \n", "97 sequence_alteration \n", "98 sequence_alteration \n", "99 sequence_alteration \n", "100 sequence_alteration \n", "101 sequence_alteration \n", "102 sequence_alteration \n", "103 sequence_alteration \n", "104 sequence_alteration \n", "105 sequence_alteration \n", "106 sequence_alteration \n", "107 sequence_alteration \n", "108 sequence_alteration \n", "109 sequence_alteration \n", "110 feature_amplification \n", "111 feature_amplification \n", "112 feature_amplification \n", "113 feature_amplification \n", "114 feature_amplification \n", "115 feature_amplification \n", "116 feature_amplification \n", "\n", "[117 rows x 5 columns]" ] }, "execution_count": 38, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX wdt: \n", "PREFIX owl: \n", "select distinct ?case ?variant ?geneSymbol ?alt_p ?annotation_Type\n", "\n", "where {\n", "\n", " SERVICE {\n", " ?variant wdt:P3329 ?id .\n", " }\n", "\n", "\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref a :drug_response .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?variant .\n", " ?gene :has_variant ?variant.\n", " OPTIONAL {?variant :alt_p ?alt_p }\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?variant a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", "}\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# filter URIs prefixes\n", "utils.filter_prefixes(result_table)\n", "\n", "# there you go\n", "result_table[['case.value', 'variant.value', 'geneSymbol.value', 'alt_p.value', 'annotation_Type.value']].fillna(\"\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Positive therapeutic predictors\n", "\n", "We can use the **variants occurrences** annotated in the local database for querying **associated positive response predictions** to drugs. Moreover, we retrieve also the scientific article from where the evidence comes and the relative medical condition treated." ] }, { "cell_type": "code", "execution_count": 39, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
geneSymbol.valuevariantLabel.valuetreatmentLabel.valuediseaseLabel.valuereferenceLabel.value
0BRAFBRAF G466VvemurafenibcancerTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
1BRAFBRAF K601Evemurafenibskin melanomaBRAF(L597) mutations in melanoma are associated with sensitivity to MEK inhibitors
2BRAFBRAF K601Etrametinibskin melanomaBRAF(L597) mutations in melanoma are associated with sensitivity to MEK inhibitors
3BRAFBRAF V600Ecobimetinib fumaratecancerMechanism of MEK inhibition determines efficacy in mutant KRAS- versus BRAF-driven cancers
4BRAFBRAF V600Eirinotecan / Panitumumab / vemurafenib combination therapycholangiocarcinomaComplete Clinical Response of BRAF-Mutated Cholangiocarcinoma to Vemurafenib, Panitumumab, and Irinotecan
5BRAFBRAF V600Evemurafenibovarian cancerTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
6BRAFBRAF V600EDabrafenib / Trametinib combination therapymelanomaCombined BRAF and MEK inhibition in melanoma with BRAF V600 mutations
7BRAFBRAF V600EvemurafenibmelanomaSafety and efficacy of vemurafenib in BRAF(V600E) and BRAF(V600K) mutation-positive melanoma (BRIM-3): extended follow-up of a phase 3, randomised, open-label study
8BRAFBRAF V600EDabrafenib / Trametinib combination therapymelanomaDabrafenib and trametinib, alone and in combination for BRAF-mutant metastatic melanoma.
9BRAFBRAF V600Evemurafenib / cobimetinib fumarate combination therapymelanomaCombined vemurafenib and cobimetinib in BRAF-mutated melanoma.
10BRAFBRAF V600EpictilisibmelanomaFirst-in-human phase I study of pictilisib (GDC-0941), a potent pan-class I phosphatidylinositol-3-kinase (PI3K) inhibitor, in patients with advanced solid tumors.
11BRAFBRAF V600Eselumetinib / dactolisib combination therapymelanomaPrimary cross-resistance to BRAFV600E-, MEK1/2- and PI3K/mTOR-specific inhibitors in BRAF-mutant melanoma cells counteracted by dual pathway blockade
12BRAFBRAF V600EvemurafenibmelanomaInhibition of Mutated, Activated BRAF in Metastatic Melanoma
13BRAFBRAF V600EDabrafenib / Trametinib combination therapymelanomaCombined BRAF and MEK inhibition versus BRAF inhibition alone in melanoma.
14BRAFBRAF V600EDabrafenib / Trametinib combination therapymelanomaAdjuvant Dabrafenib plus Trametinib in Stage III BRAF-Mutated Melanoma.
15BRAFBRAF V600Etrametinib / vemurafenib / dabrafenib combination therapygastrointestinal neuroendocrine tumorBRAFV600E Mutations in High-Grade Colorectal Neuroendocrine Tumors May Predict Responsiveness to BRAF-MEK Combination Therapy.
16BRAFBRAF V600EPanitumumab / Trametinib combination therapycolorectal adenocarcinomaCombined BRAF, EGFR, and MEK Inhibition in Patients with BRAFV600E-Mutant Colorectal Cancer.
17BRAFBRAF V600Evemurafeniblaryngeal squamous cell carcinomaTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
18BRAFBRAF V600Evemurafenibskin melanomaSurvival in BRAF V600-mutant advanced melanoma treated with vemurafenib
19BRAFBRAF V600Evemurafenibskin melanomaImproved survival with vemurafenib in melanoma with BRAF V600E mutation
20BRAFBRAF V600EDabrafenib / Trametinib combination therapyskin melanomaImproved overall survival in melanoma with combined dabrafenib and trametinib.
21BRAFBRAF V600ESorafenib / Panitumumab combination therapycolorectal cancerWild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer.
22BRAFBRAF V600ECapecitabine / Vemurafenib / Bevacizumab combination therapycolorectal cancerAntitumor activity of BRAF inhibitor vemurafenib in preclinical models of BRAF-mutant colorectal cancer.
23BRAFBRAF V600Evemurafenibcolorectal cancerAntitumor activity of BRAF inhibitor vemurafenib in preclinical models of BRAF-mutant colorectal cancer.
24BRAFBRAF V600EVemurafenib / Gefitinib / Cetuximab combination therapycolorectal cancerUnresponsiveness of colon cancer to BRAF(V600E) inhibition through feedback activation of EGFR.
25BRAFBRAF V600Edabrafenibcolorectal cancerDabrafenib in patients with melanoma, untreated brain metastases, and other solid tumours: a phase 1 dose-escalation trial.
26BRAFBRAF V600Edactolisib / GDC-0879 combination therapycolorectal cancerConcomitant BRAF and PI3K/mTOR blockade is required for effective treatment of BRAF(V600E) colorectal cancer.
27BRAFBRAF V600EPLX4720 / GDC0941combination therapycolorectal cancerA genetic progression model of Braf(V600E)-induced intestinal tumorigenesis reveals targets for therapeutic intervention.
28BRAFBRAF V600EVemurafenib / Panitumumab combination therapycolorectal cancerPilot trial of combined BRAF and EGFR inhibition in BRAF-mutant metastatic colorectal cancer patients.
29BRAFBRAF V600Evemurafenibcolorectal cancerPhase II Pilot Study of Vemurafenib in Patients With Metastatic BRAF-Mutated Colorectal Cancer.
..................
84ERBB2ERBB2 AMPLIFICATIONado-trastuzumab emtansineHer2-receptor positive breast cancerPhase II study of the antibody drug conjugate trastuzumab-DM1 for the treatment of human epidermal growth factor receptor 2 (HER2)-positive breast cancer after prior HER2-directed therapy.
85ERBB2ERBB2 AMPLIFICATIONtrastuzumabscrotum Paget's diseaseMetastatic Extramammary Paget's Disease of Scrotum Responds Completely to Single Agent Trastuzumab in a Hemodialysis Patient: Case Report, Molecular Profiling and Brief Review of the Literature.
86ERBB2ERBB2 AMPLIFICATIONtrastuzumabgastric adenocarcinomaTrastuzumab in combination with chemotherapy versus chemotherapy alone for treatment of HER2-positive advanced gastric or gastro-oesophageal junction cancer (ToGA): a phase 3, open-label, randomised controlled trial.
87ERBB2ERBB2 AMPLIFICATIONlapatinibgastric adenocarcinomaLapatinib plus paclitaxel versus paclitaxel alone in the second-line treatment of HER2-amplified advanced gastric cancer in Asian populations: TyTAN--a randomized, phase III study.
88ERBB2ERBB2 AMPLIFICATIONPertuzumab / Trastuzumab combination therapybladder carcinomaTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
89ERBB2ERBB2 AMPLIFICATIONafatinibpancreatic adenocarcinomaAfatinib, an Irreversible EGFR Family Inhibitor, Shows Activity Toward Pancreatic Cancer Cells, Alone and in Combination with Radiotherapy, Independent of KRAS Status.
90ERBB2ERBB2 AMPLIFICATIONPertuzumab / Trastuzumab combination therapybiliary tract cancerTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
91ERBB2ERBB2 AMPLIFICATIONTrastuzumab / Lapatinib combination therapycolorectal cancerDual-targeted therapy with trastuzumab and lapatinib in treatment-refractory, KRAS codon 12/13 wild-type, HER2-positive metastatic colorectal cancer (HERACLES): a proof-of-concept, multicentre, open-label, phase 2 trial.
92ERBB2ERBB2 AMPLIFICATIONPertuzumab / Trastuzumab combination therapycolorectal cancerTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
93ERBB2ERBB2 AMPLIFICATIONTrastuzumab / irinotecan combination therapylung small cell carcinomaFavorable response to trastuzumab plus irinotecan combination therapy in two patients with HER2-positive relapsed small-cell lung cancer.
94ERBB2ERBB2 AMPLIFICATIONtrastuzumabuterine corpus serous adenocarcinomaTrastuzumab treatment in patients with advanced or recurrent endometrial carcinoma overexpressing HER2/neu.
95ERBB2ERBB2 AMPLIFICATIONPertuzumab / Trastuzumab combination therapypancreatic cancerTargeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study.
96ERBB2ERBB2 AMPLIFICATIONtrastuzumabnon-small-cell lung carcinomaRandomized phase II trial of gemcitabine-cisplatin with or without trastuzumab in HER2-positive non-small-cell lung cancer.
97ERBB2ERBB2 AMPLIFICATIONtrastuzumabnon-small-cell lung carcinomaTrastuzumab plus docetaxel in HER2/neu-positive non-small-cell lung cancer: a California Cancer Consortium screening and phase II trial.
98ERBB2ERBB2 AMPLIFICATIONado-trastuzumab emtansinenon-small-cell lung carcinomaTrastuzumab emtansine is active on HER-2 overexpressing NSCLC cell lines and overcomes gefitinib resistance.
99ERBB2ERBB2 AMPLIFICATIONdacomitinibnon-small-cell lung carcinomaTargeting HER2 aberrations as actionable drivers in lung cancers: phase II trial of the pan-HER tyrosine kinase inhibitor dacomitinib in patients with HER2-mutant or amplified tumors
100ERBB2ERBB2 AMPLIFICATIONtrastuzumabendometrial cancerPhase II trial of trastuzumab in women with advanced or recurrent, HER2-positive endometrial carcinoma: a Gynecologic Oncology Group study.
101KRASKRAS A146Tselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
102KRASKRAS A146Vselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
103KRASKRAS A146Vabemaciclibnon-small-cell lung carcinomaWhole-exome sequencing and clinical interpretation of formalin-fixed, paraffin-embedded tumor samples to guide precision cancer medicine.
104KRASKRAS G12Cselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
105KRASKRAS G12Cselumetinib / docetaxel trihydrate combination therapynon-small-cell lung carcinomaImpact of KRAS codon subtypes from a randomised phase II trial of selumetinib plus docetaxel in KRAS mutant advanced non-small-cell lung cancer
106KRASKRAS G12DMK-2206pancreatic carcinomaFirst-in-man clinical trial of the oral pan-AKT inhibitor MK-2206 in patients with advanced solid tumors.
107KRASKRAS G12Dselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
108KRASKRAS G12Dselumetinib / dactolisib combination therapynon-small-cell lung carcinomaEffective use of PI3K and MEK inhibitors to treat mutant Kras G12D and PIK3CA H1047R murine lung cancers.
109KRASKRAS G12Vselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
110KRASKRAS G12Vpalbociclibnon-small-cell lung carcinomaA synthetic lethal interaction between K-Ras oncogenes and Cdk4 unveils a therapeutic strategy for non-small cell lung carcinoma.
111KRASKRAS G12Vselumetinib / docetaxel trihydrate combination therapynon-small-cell lung carcinomaImpact of KRAS codon subtypes from a randomised phase II trial of selumetinib plus docetaxel in KRAS mutant advanced non-small-cell lung cancer
112KRASKRAS G13DCetuximabcolorectal cancerAssociation of KRAS p.G13D mutation with outcome in patients with chemotherapy-refractory metastatic colorectal cancer treated with cetuximab.
113KRASKRAS G13Dselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
\n", "

114 rows × 5 columns

\n", "
" ], "text/plain": [ " geneSymbol.value variantLabel.value \\\n", "0 BRAF BRAF G466V \n", "1 BRAF BRAF K601E \n", "2 BRAF BRAF K601E \n", "3 BRAF BRAF V600E \n", "4 BRAF BRAF V600E \n", "5 BRAF BRAF V600E \n", "6 BRAF BRAF V600E \n", "7 BRAF BRAF V600E \n", "8 BRAF BRAF V600E \n", "9 BRAF BRAF V600E \n", "10 BRAF BRAF V600E \n", "11 BRAF BRAF V600E \n", "12 BRAF BRAF V600E \n", "13 BRAF BRAF V600E \n", "14 BRAF BRAF V600E \n", "15 BRAF BRAF V600E \n", "16 BRAF BRAF V600E \n", "17 BRAF BRAF V600E \n", "18 BRAF BRAF V600E \n", "19 BRAF BRAF V600E \n", "20 BRAF BRAF V600E \n", "21 BRAF BRAF V600E \n", "22 BRAF BRAF V600E \n", "23 BRAF BRAF V600E \n", "24 BRAF BRAF V600E \n", "25 BRAF BRAF V600E \n", "26 BRAF BRAF V600E \n", "27 BRAF BRAF V600E \n", "28 BRAF BRAF V600E \n", "29 BRAF BRAF V600E \n", ".. ... ... \n", "84 ERBB2 ERBB2 AMPLIFICATION \n", "85 ERBB2 ERBB2 AMPLIFICATION \n", "86 ERBB2 ERBB2 AMPLIFICATION \n", "87 ERBB2 ERBB2 AMPLIFICATION \n", "88 ERBB2 ERBB2 AMPLIFICATION \n", "89 ERBB2 ERBB2 AMPLIFICATION \n", "90 ERBB2 ERBB2 AMPLIFICATION \n", "91 ERBB2 ERBB2 AMPLIFICATION \n", "92 ERBB2 ERBB2 AMPLIFICATION \n", "93 ERBB2 ERBB2 AMPLIFICATION \n", "94 ERBB2 ERBB2 AMPLIFICATION \n", "95 ERBB2 ERBB2 AMPLIFICATION \n", "96 ERBB2 ERBB2 AMPLIFICATION \n", "97 ERBB2 ERBB2 AMPLIFICATION \n", "98 ERBB2 ERBB2 AMPLIFICATION \n", "99 ERBB2 ERBB2 AMPLIFICATION \n", "100 ERBB2 ERBB2 AMPLIFICATION \n", "101 KRAS KRAS A146T \n", "102 KRAS KRAS A146V \n", "103 KRAS KRAS A146V \n", "104 KRAS KRAS G12C \n", "105 KRAS KRAS G12C \n", "106 KRAS KRAS G12D \n", "107 KRAS KRAS G12D \n", "108 KRAS KRAS G12D \n", "109 KRAS KRAS G12V \n", "110 KRAS KRAS G12V \n", "111 KRAS KRAS G12V \n", "112 KRAS KRAS G13D \n", "113 KRAS KRAS G13D \n", "\n", " treatmentLabel.value \\\n", "0 vemurafenib \n", "1 vemurafenib \n", "2 trametinib \n", "3 cobimetinib fumarate \n", "4 irinotecan / Panitumumab / vemurafenib combination therapy \n", "5 vemurafenib \n", "6 Dabrafenib / Trametinib combination therapy \n", "7 vemurafenib \n", "8 Dabrafenib / Trametinib combination therapy \n", "9 vemurafenib / cobimetinib fumarate combination therapy \n", "10 pictilisib \n", "11 selumetinib / dactolisib combination therapy \n", "12 vemurafenib \n", "13 Dabrafenib / Trametinib combination therapy \n", "14 Dabrafenib / Trametinib combination therapy \n", "15 trametinib / vemurafenib / dabrafenib combination therapy \n", "16 Panitumumab / Trametinib combination therapy \n", "17 vemurafenib \n", "18 vemurafenib \n", "19 vemurafenib \n", "20 Dabrafenib / Trametinib combination therapy \n", "21 Sorafenib / Panitumumab combination therapy \n", "22 Capecitabine / Vemurafenib / Bevacizumab combination therapy \n", "23 vemurafenib \n", "24 Vemurafenib / Gefitinib / Cetuximab combination therapy \n", "25 dabrafenib \n", "26 dactolisib / GDC-0879 combination therapy \n", "27 PLX4720 / GDC0941combination therapy \n", "28 Vemurafenib / Panitumumab combination therapy \n", "29 vemurafenib \n", ".. ... \n", "84 ado-trastuzumab emtansine \n", "85 trastuzumab \n", "86 trastuzumab \n", "87 lapatinib \n", "88 Pertuzumab / Trastuzumab combination therapy \n", "89 afatinib \n", "90 Pertuzumab / Trastuzumab combination therapy \n", "91 Trastuzumab / Lapatinib combination therapy \n", "92 Pertuzumab / Trastuzumab combination therapy \n", "93 Trastuzumab / irinotecan combination therapy \n", "94 trastuzumab \n", "95 Pertuzumab / Trastuzumab combination therapy \n", "96 trastuzumab \n", "97 trastuzumab \n", "98 ado-trastuzumab emtansine \n", "99 dacomitinib \n", "100 trastuzumab \n", "101 selumetinib / dactolisib combination therapy \n", "102 selumetinib / dactolisib combination therapy \n", "103 abemaciclib \n", "104 selumetinib / dactolisib combination therapy \n", "105 selumetinib / docetaxel trihydrate combination therapy \n", "106 MK-2206 \n", "107 selumetinib / dactolisib combination therapy \n", "108 selumetinib / dactolisib combination therapy \n", "109 selumetinib / dactolisib combination therapy \n", "110 palbociclib \n", "111 selumetinib / docetaxel trihydrate combination therapy \n", "112 Cetuximab \n", "113 selumetinib / dactolisib combination therapy \n", "\n", " diseaseLabel.value \\\n", "0 cancer \n", "1 skin melanoma \n", "2 skin melanoma \n", "3 cancer \n", "4 cholangiocarcinoma \n", "5 ovarian cancer \n", "6 melanoma \n", "7 melanoma \n", "8 melanoma \n", "9 melanoma \n", "10 melanoma \n", "11 melanoma \n", "12 melanoma \n", "13 melanoma \n", "14 melanoma \n", "15 gastrointestinal neuroendocrine tumor \n", "16 colorectal adenocarcinoma \n", "17 laryngeal squamous cell carcinoma \n", "18 skin melanoma \n", "19 skin melanoma \n", "20 skin melanoma \n", "21 colorectal cancer \n", "22 colorectal cancer \n", "23 colorectal cancer \n", "24 colorectal cancer \n", "25 colorectal cancer \n", "26 colorectal cancer \n", "27 colorectal cancer \n", "28 colorectal cancer \n", "29 colorectal cancer \n", ".. ... \n", "84 Her2-receptor positive breast cancer \n", "85 scrotum Paget's disease \n", "86 gastric adenocarcinoma \n", "87 gastric adenocarcinoma \n", "88 bladder carcinoma \n", "89 pancreatic adenocarcinoma \n", "90 biliary tract cancer \n", "91 colorectal cancer \n", "92 colorectal cancer \n", "93 lung small cell carcinoma \n", "94 uterine corpus serous adenocarcinoma \n", "95 pancreatic cancer \n", "96 non-small-cell lung carcinoma \n", "97 non-small-cell lung carcinoma \n", "98 non-small-cell lung carcinoma \n", "99 non-small-cell lung carcinoma \n", "100 endometrial cancer \n", "101 colorectal cancer \n", "102 colorectal cancer \n", "103 non-small-cell lung carcinoma \n", "104 colorectal cancer \n", "105 non-small-cell lung carcinoma \n", "106 pancreatic carcinoma \n", "107 colorectal cancer \n", "108 non-small-cell lung carcinoma \n", "109 colorectal cancer \n", "110 non-small-cell lung carcinoma \n", "111 non-small-cell lung carcinoma \n", "112 colorectal cancer \n", "113 colorectal cancer \n", "\n", " referenceLabel.value \n", "0 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "1 BRAF(L597) mutations in melanoma are associated with sensitivity to MEK inhibitors \n", "2 BRAF(L597) mutations in melanoma are associated with sensitivity to MEK inhibitors \n", "3 Mechanism of MEK inhibition determines efficacy in mutant KRAS- versus BRAF-driven cancers \n", "4 Complete Clinical Response of BRAF-Mutated Cholangiocarcinoma to Vemurafenib, Panitumumab, and Irinotecan \n", "5 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "6 Combined BRAF and MEK inhibition in melanoma with BRAF V600 mutations \n", "7 Safety and efficacy of vemurafenib in BRAF(V600E) and BRAF(V600K) mutation-positive melanoma (BRIM-3): extended follow-up of a phase 3, randomised, open-label study \n", "8 Dabrafenib and trametinib, alone and in combination for BRAF-mutant metastatic melanoma. \n", "9 Combined vemurafenib and cobimetinib in BRAF-mutated melanoma. \n", "10 First-in-human phase I study of pictilisib (GDC-0941), a potent pan-class I phosphatidylinositol-3-kinase (PI3K) inhibitor, in patients with advanced solid tumors. \n", "11 Primary cross-resistance to BRAFV600E-, MEK1/2- and PI3K/mTOR-specific inhibitors in BRAF-mutant melanoma cells counteracted by dual pathway blockade \n", "12 Inhibition of Mutated, Activated BRAF in Metastatic Melanoma \n", "13 Combined BRAF and MEK inhibition versus BRAF inhibition alone in melanoma. \n", "14 Adjuvant Dabrafenib plus Trametinib in Stage III BRAF-Mutated Melanoma. \n", "15 BRAFV600E Mutations in High-Grade Colorectal Neuroendocrine Tumors May Predict Responsiveness to BRAF-MEK Combination Therapy. \n", "16 Combined BRAF, EGFR, and MEK Inhibition in Patients with BRAFV600E-Mutant Colorectal Cancer. \n", "17 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "18 Survival in BRAF V600-mutant advanced melanoma treated with vemurafenib \n", "19 Improved survival with vemurafenib in melanoma with BRAF V600E mutation \n", "20 Improved overall survival in melanoma with combined dabrafenib and trametinib. \n", "21 Wild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer. \n", "22 Antitumor activity of BRAF inhibitor vemurafenib in preclinical models of BRAF-mutant colorectal cancer. \n", "23 Antitumor activity of BRAF inhibitor vemurafenib in preclinical models of BRAF-mutant colorectal cancer. \n", "24 Unresponsiveness of colon cancer to BRAF(V600E) inhibition through feedback activation of EGFR. \n", "25 Dabrafenib in patients with melanoma, untreated brain metastases, and other solid tumours: a phase 1 dose-escalation trial. \n", "26 Concomitant BRAF and PI3K/mTOR blockade is required for effective treatment of BRAF(V600E) colorectal cancer. \n", "27 A genetic progression model of Braf(V600E)-induced intestinal tumorigenesis reveals targets for therapeutic intervention. \n", "28 Pilot trial of combined BRAF and EGFR inhibition in BRAF-mutant metastatic colorectal cancer patients. \n", "29 Phase II Pilot Study of Vemurafenib in Patients With Metastatic BRAF-Mutated Colorectal Cancer. \n", ".. ... \n", "84 Phase II study of the antibody drug conjugate trastuzumab-DM1 for the treatment of human epidermal growth factor receptor 2 (HER2)-positive breast cancer after prior HER2-directed therapy. \n", "85 Metastatic Extramammary Paget's Disease of Scrotum Responds Completely to Single Agent Trastuzumab in a Hemodialysis Patient: Case Report, Molecular Profiling and Brief Review of the Literature. \n", "86 Trastuzumab in combination with chemotherapy versus chemotherapy alone for treatment of HER2-positive advanced gastric or gastro-oesophageal junction cancer (ToGA): a phase 3, open-label, randomised controlled trial. \n", "87 Lapatinib plus paclitaxel versus paclitaxel alone in the second-line treatment of HER2-amplified advanced gastric cancer in Asian populations: TyTAN--a randomized, phase III study. \n", "88 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "89 Afatinib, an Irreversible EGFR Family Inhibitor, Shows Activity Toward Pancreatic Cancer Cells, Alone and in Combination with Radiotherapy, Independent of KRAS Status. \n", "90 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "91 Dual-targeted therapy with trastuzumab and lapatinib in treatment-refractory, KRAS codon 12/13 wild-type, HER2-positive metastatic colorectal cancer (HERACLES): a proof-of-concept, multicentre, open-label, phase 2 trial. \n", "92 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "93 Favorable response to trastuzumab plus irinotecan combination therapy in two patients with HER2-positive relapsed small-cell lung cancer. \n", "94 Trastuzumab treatment in patients with advanced or recurrent endometrial carcinoma overexpressing HER2/neu. \n", "95 Targeted Therapy for Advanced Solid Tumors on the Basis of Molecular Profiles: Results From MyPathway, an Open-Label, Phase IIa Multiple Basket Study. \n", "96 Randomized phase II trial of gemcitabine-cisplatin with or without trastuzumab in HER2-positive non-small-cell lung cancer. \n", "97 Trastuzumab plus docetaxel in HER2/neu-positive non-small-cell lung cancer: a California Cancer Consortium screening and phase II trial. \n", "98 Trastuzumab emtansine is active on HER-2 overexpressing NSCLC cell lines and overcomes gefitinib resistance. \n", "99 Targeting HER2 aberrations as actionable drivers in lung cancers: phase II trial of the pan-HER tyrosine kinase inhibitor dacomitinib in patients with HER2-mutant or amplified tumors \n", "100 Phase II trial of trastuzumab in women with advanced or recurrent, HER2-positive endometrial carcinoma: a Gynecologic Oncology Group study. \n", "101 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. \n", "102 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. \n", "103 Whole-exome sequencing and clinical interpretation of formalin-fixed, paraffin-embedded tumor samples to guide precision cancer medicine. \n", "104 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. \n", "105 Impact of KRAS codon subtypes from a randomised phase II trial of selumetinib plus docetaxel in KRAS mutant advanced non-small-cell lung cancer \n", "106 First-in-man clinical trial of the oral pan-AKT inhibitor MK-2206 in patients with advanced solid tumors. \n", "107 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. \n", "108 Effective use of PI3K and MEK inhibitors to treat mutant Kras G12D and PIK3CA H1047R murine lung cancers. \n", "109 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. \n", "110 A synthetic lethal interaction between K-Ras oncogenes and Cdk4 unveils a therapeutic strategy for non-small cell lung carcinoma. \n", "111 Impact of KRAS codon subtypes from a randomised phase II trial of selumetinib plus docetaxel in KRAS mutant advanced non-small-cell lung cancer \n", "112 Association of KRAS p.G13D mutation with outcome in patients with chemotherapy-refractory metastatic colorectal cancer treated with cetuximab. \n", "113 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. \n", "\n", "[114 rows x 5 columns]" ] }, "execution_count": 39, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX wdt: \n", "PREFIX owl: \n", "PREFIX pq: \n", "PREFIX ps: \n", "PREFIX pr: \n", "PREFIX p: \n", "PREFIX prov: \n", "PREFIX wikibase: \n", "PREFIX bd: \n", "PREFIX rdfs: \n", "select distinct ?geneSymbol ?variantLabel ?treatmentLabel ?diseaseLabel ?referenceLabel\n", "\n", "where {\n", "\n", " SERVICE {\n", " ?variant wdt:P3329 ?id .\n", " ?variant p:P3354 [ ps:P3354 ?treatment ; \n", " pq:P2175 ?disease ;\n", " prov:wasDerivedFrom ?source ].\n", " ?source pr:P248 ?reference \n", "\n", " SERVICE wikibase:label { \n", " bd:serviceParam wikibase:language \"en\" . \n", " ?variant rdfs:label ?variantLabel .\n", " ?treatment rdfs:label ?treatmentLabel .\n", " ?disease rdfs:label ?diseaseLabel .\n", " ?reference rdfs:label ?referenceLabel\n", " }\n", " }\n", "\n", "\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref a :drug_response .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?variant .\n", " ?gene :has_variant ?variant.\n", " OPTIONAL {?variant :alt_p ?alt_p }\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?variant a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", "}\n", "order by ?geneSymbol\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go\n", "result_table[['geneSymbol.value', 'variantLabel.value', 'treatmentLabel.value', 'diseaseLabel.value', 'referenceLabel.value']]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Negative therapeutic predictors\n", "\n", "We can use the **variants occurrences** annotated in the local database for querying associated **negative response predictions** to drugs. Moreover, we retrieve also the scientific article from where the evidence comes and the relative medical condition treated." ] }, { "cell_type": "code", "execution_count": 40, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
geneSymbol.valuevariantLabel.valuetreatmentLabel.valuediseaseLabel.valuereferenceLabel.value
0BRAFBRAF V600EvemurafenibmelanomaLoss of NF1 in cutaneous melanoma is associated with RAS activation and MEK dependence.
1BRAFBRAF V600Epd-0325901melanomaLoss of NF1 in cutaneous melanoma is associated with RAS activation and MEK dependence.
2BRAFBRAF V600EtrametinibmelanomaLoss of NF1 in cutaneous melanoma is associated with RAS activation and MEK dependence.
3BRAFBRAF V600EPanitumumabcolorectal cancerMeta-analysis of BRAF mutation as a predictive biomarker of benefit from anti-EGFR monoclonal antibody therapy for RAS wild-type metastatic colorectal cancer
4BRAFBRAF V600ECetuximabcolorectal cancerMeta-analysis of BRAF mutation as a predictive biomarker of benefit from anti-EGFR monoclonal antibody therapy for RAS wild-type metastatic colorectal cancer
5BRAFBRAF V600EPanitumumabcolorectal cancerWild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer.
6BRAFBRAF V600ECetuximabcolorectal cancerWild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer.
7BRAFBRAF V600ECetuximabcolorectal cancerEffects of KRAS, BRAF, NRAS, and PIK3CA mutations on the efficacy of cetuximab plus chemotherapy in chemotherapy-refractory metastatic colorectal cancer: a retrospective consortium analysis.
8BRAFBRAF V600EOxaliplatincolorectal cancerPrognostic and predictive value of common mutations for treatment response and survival in patients with metastatic colorectal cancer.
9BRAFBRAF V600Eirinotecancolorectal cancerPrognostic and predictive value of common mutations for treatment response and survival in patients with metastatic colorectal cancer.
10BRAFBRAF V600Edabrafenibnon-small-cell lung carcinomaMolecular characterization of acquired resistance to the BRAF inhibitor dabrafenib in a patient with BRAF-mutant non-small-cell lung cancer.
11EGFREGFR G465RPanitumumabcolorectal cancerThe First-in-class Anti-EGFR Antibody Mixture Sym004 Overcomes Cetuximab Resistance Mediated by EGFR Extracellular Domain Mutations in Colorectal Cancer.
12EGFREGFR G465RCetuximabcolorectal cancerThe First-in-class Anti-EGFR Antibody Mixture Sym004 Overcomes Cetuximab Resistance Mediated by EGFR Extracellular Domain Mutations in Colorectal Cancer.
13EGFREGFR AMPLIFICATIONosimertinibnon-small-cell lung carcinomaAmplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors.
14EGFREGFR AMPLIFICATIONrociletinibnon-small-cell lung carcinomaAmplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors.
15ERBB2ERBB2 AMPLIFICATIONCetuximabcolorectal cancerA molecularly annotated platform of patient-derived xenografts (\"xenopatients\") identifies HER2 as an effective therapeutic target in cetuximab-resistant colorectal cancer.
16ERBB2ERBB2 AMPLIFICATIONPanitumumabcolorectal cancerHER2 gene copy number status may influence clinical efficacy to anti-EGFR monoclonal antibodies in metastatic colorectal cancer patients.
17ERBB2ERBB2 AMPLIFICATIONCetuximabcolorectal cancerHER2 gene copy number status may influence clinical efficacy to anti-EGFR monoclonal antibodies in metastatic colorectal cancer patients.
18ERBB2ERBB2 AMPLIFICATIONCetuximab / capecitabine / Oxaliplatin combination therapycolorectal cancerHER2 in high-risk rectal cancer patients treated in EXPERT-C, a randomized phase II trial of neoadjuvant capecitabine and oxaliplatin (CAPOX) and chemoradiotherapy (CRT) with or without cetuximab.
19ERBB2ERBB2 AMPLIFICATIONCetuximabcolorectal cancerHER2 Amplification and Cetuximab Efficacy in Patients With Metastatic Colorectal Cancer Harboring Wild-type RAS and BRAF.
20ERBB2ERBB2 AMPLIFICATIONgefitinibadenocarcinoma of the lungAnalysis of tumor specimens at the time of acquired resistance to EGFR-TKI therapy in 155 patients with EGFR-mutant lung cancers.
21ERBB2ERBB2 AMPLIFICATIONerlotinibadenocarcinoma of the lungAnalysis of tumor specimens at the time of acquired resistance to EGFR-TKI therapy in 155 patients with EGFR-mutant lung cancers.
22KRASKRAS A146TCetuximabcolorectal cancerGenomic and biological characterization of exon 4 KRAS mutations in human cancer
23KRASKRAS A146TFOLFOX-4 / Cetuximab combination therapycolorectal cancerFOLFOX4 Plus Cetuximab for Patients With Previously Untreated Metastatic Colorectal Cancer According to Tumor RAS and BRAF Mutation Status: Updated Analysis of the CECOG/CORE 1.2.002 Study.
24KRASKRAS G12APanitumumabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
25KRASKRAS G12ACetuximabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
26KRASKRAS G12Aregorafenibcolorectal cancerKRAS exon 2 mutations influence activity of regorafenib in an SW48-based disease model of colorectal cancer.
27KRASKRAS G12Amelphalanmultiple myelomaReduction of serum IGF-I levels in patients affected with Monoclonal Gammopathies of undetermined significance or Multiple Myeloma. Comparison with bFGF, VEGF and K-ras gene mutation.
28KRASKRAS G12Amelphalanmultiple myelomaOncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance.
29KRASKRAS G12Amelphalanmultiple myelomaActivation of N-ras and K-ras induced by interleukin-6 in a myeloma cell line: implications for disease progression and therapeutic response.
..................
33KRASKRAS G12Aerlotinibadenocarcinoma of the lungClinical implications of KRAS mutations in lung cancer patients treated with tyrosine kinase inhibitors: an important role for mutations in minor clones
34KRASKRAS G12Cgefitinibcolorectal cancerThe dominant role of G12C over other KRAS mutation types in the negative prediction of efficacy of epidermal growth factor receptor tyrosine kinase inhibitors in non-small cell lung cancer.
35KRASKRAS G12Cerlotinibcolorectal cancerThe dominant role of G12C over other KRAS mutation types in the negative prediction of efficacy of epidermal growth factor receptor tyrosine kinase inhibitors in non-small cell lung cancer.
36KRASKRAS G12Cmelphalanmultiple myelomaOncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance.
37KRASKRAS G12Cmelphalanmultiple myelomaHeterogeneity in therapeutic response of genetically altered myeloma cell lines to interleukin 6, dexamethasone, doxorubicin, and melphalan.
38KRASKRAS G12Cgefitiniblung cancerPTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients.
39KRASKRAS G12DvemurafenibmelanomaAcquired resistance and clonal evolution in melanoma during BRAF inhibitor therapy
40KRASKRAS G12DPanitumumabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
41KRASKRAS G12DCetuximabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
42KRASKRAS G12Dvemurafenibhairy cell leukemiaTargeting Mutant BRAF in Relapsed or Refractory Hairy-Cell Leukemia
43KRASKRAS G12Dmelphalanmultiple myelomaOncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance.
44KRASKRAS G12Dmelphalanmultiple myelomaHeterogeneity in therapeutic response of genetically altered myeloma cell lines to interleukin 6, dexamethasone, doxorubicin, and melphalan.
45KRASKRAS G12Dgefitiniblung cancerPTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients.
46KRASKRAS G12RCetuximabcolorectal cancerEmergence of KRAS mutations and acquired resistance to anti-EGFR therapy in colorectal cancer
47KRASKRAS G12SPanitumumabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
48KRASKRAS G12SCetuximabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
49KRASKRAS G12Smelphalanmultiple myelomaOncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance.
50KRASKRAS G12Smelphalanmultiple myelomaActivation of N-ras and K-ras induced by interleukin-6 in a myeloma cell line: implications for disease progression and therapeutic response.
51KRASKRAS G12Smelphalanmultiple myelomaHeterogeneity in therapeutic response of genetically altered myeloma cell lines to interleukin 6, dexamethasone, doxorubicin, and melphalan.
52KRASKRAS G12Sgefitiniblung cancerPTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients.
53KRASKRAS G12VcrizotinibcancerDurable Response to Crizotinib in a MET-Amplified, KRAS-Mutated Carcinoma of Unknown Primary.
54KRASKRAS G12VPanitumumabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
55KRASKRAS G12VCetuximabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
56KRASKRAS G12VCetuximabcolorectal cancerAssociation of KRAS p.G13D mutation with outcome in patients with chemotherapy-refractory metastatic colorectal cancer treated with cetuximab.
57KRASKRAS G12Vgefitiniblung cancerPTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients.
58KRASKRAS G13DPanitumumabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
59KRASKRAS G13DCetuximabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
60KRASKRAS G13DCetuximabcolorectal cancerPhase II study of single-agent cetuximab in KRAS G13D mutant metastatic colorectal cancer.
61KRASKRAS G13DCetuximabcolorectal cancerCetuximab treatment for metastatic colorectal cancer with KRAS p.G13D mutations improves progression-free survival.
62KRASKRAS G13DCetuximabcolorectal cancerMeta-analysis comparing the efficacy of anti-EGFR monoclonal antibody therapy between KRAS G13D and other KRAS mutant metastatic colorectal cancer tumours.
\n", "

63 rows × 5 columns

\n", "
" ], "text/plain": [ " geneSymbol.value variantLabel.value \\\n", "0 BRAF BRAF V600E \n", "1 BRAF BRAF V600E \n", "2 BRAF BRAF V600E \n", "3 BRAF BRAF V600E \n", "4 BRAF BRAF V600E \n", "5 BRAF BRAF V600E \n", "6 BRAF BRAF V600E \n", "7 BRAF BRAF V600E \n", "8 BRAF BRAF V600E \n", "9 BRAF BRAF V600E \n", "10 BRAF BRAF V600E \n", "11 EGFR EGFR G465R \n", "12 EGFR EGFR G465R \n", "13 EGFR EGFR AMPLIFICATION \n", "14 EGFR EGFR AMPLIFICATION \n", "15 ERBB2 ERBB2 AMPLIFICATION \n", "16 ERBB2 ERBB2 AMPLIFICATION \n", "17 ERBB2 ERBB2 AMPLIFICATION \n", "18 ERBB2 ERBB2 AMPLIFICATION \n", "19 ERBB2 ERBB2 AMPLIFICATION \n", "20 ERBB2 ERBB2 AMPLIFICATION \n", "21 ERBB2 ERBB2 AMPLIFICATION \n", "22 KRAS KRAS A146T \n", "23 KRAS KRAS A146T \n", "24 KRAS KRAS G12A \n", "25 KRAS KRAS G12A \n", "26 KRAS KRAS G12A \n", "27 KRAS KRAS G12A \n", "28 KRAS KRAS G12A \n", "29 KRAS KRAS G12A \n", ".. ... ... \n", "33 KRAS KRAS G12A \n", "34 KRAS KRAS G12C \n", "35 KRAS KRAS G12C \n", "36 KRAS KRAS G12C \n", "37 KRAS KRAS G12C \n", "38 KRAS KRAS G12C \n", "39 KRAS KRAS G12D \n", "40 KRAS KRAS G12D \n", "41 KRAS KRAS G12D \n", "42 KRAS KRAS G12D \n", "43 KRAS KRAS G12D \n", "44 KRAS KRAS G12D \n", "45 KRAS KRAS G12D \n", "46 KRAS KRAS G12R \n", "47 KRAS KRAS G12S \n", "48 KRAS KRAS G12S \n", "49 KRAS KRAS G12S \n", "50 KRAS KRAS G12S \n", "51 KRAS KRAS G12S \n", "52 KRAS KRAS G12S \n", "53 KRAS KRAS G12V \n", "54 KRAS KRAS G12V \n", "55 KRAS KRAS G12V \n", "56 KRAS KRAS G12V \n", "57 KRAS KRAS G12V \n", "58 KRAS KRAS G13D \n", "59 KRAS KRAS G13D \n", "60 KRAS KRAS G13D \n", "61 KRAS KRAS G13D \n", "62 KRAS KRAS G13D \n", "\n", " treatmentLabel.value \\\n", "0 vemurafenib \n", "1 pd-0325901 \n", "2 trametinib \n", "3 Panitumumab \n", "4 Cetuximab \n", "5 Panitumumab \n", "6 Cetuximab \n", "7 Cetuximab \n", "8 Oxaliplatin \n", "9 irinotecan \n", "10 dabrafenib \n", "11 Panitumumab \n", "12 Cetuximab \n", "13 osimertinib \n", "14 rociletinib \n", "15 Cetuximab \n", "16 Panitumumab \n", "17 Cetuximab \n", "18 Cetuximab / capecitabine / Oxaliplatin combination therapy \n", "19 Cetuximab \n", "20 gefitinib \n", "21 erlotinib \n", "22 Cetuximab \n", "23 FOLFOX-4 / Cetuximab combination therapy \n", "24 Panitumumab \n", "25 Cetuximab \n", "26 regorafenib \n", "27 melphalan \n", "28 melphalan \n", "29 melphalan \n", ".. ... \n", "33 erlotinib \n", "34 gefitinib \n", "35 erlotinib \n", "36 melphalan \n", "37 melphalan \n", "38 gefitinib \n", "39 vemurafenib \n", "40 Panitumumab \n", "41 Cetuximab \n", "42 vemurafenib \n", "43 melphalan \n", "44 melphalan \n", "45 gefitinib \n", "46 Cetuximab \n", "47 Panitumumab \n", "48 Cetuximab \n", "49 melphalan \n", "50 melphalan \n", "51 melphalan \n", "52 gefitinib \n", "53 crizotinib \n", "54 Panitumumab \n", "55 Cetuximab \n", "56 Cetuximab \n", "57 gefitinib \n", "58 Panitumumab \n", "59 Cetuximab \n", "60 Cetuximab \n", "61 Cetuximab \n", "62 Cetuximab \n", "\n", " diseaseLabel.value \\\n", "0 melanoma \n", "1 melanoma \n", "2 melanoma \n", "3 colorectal cancer \n", "4 colorectal cancer \n", "5 colorectal cancer \n", "6 colorectal cancer \n", "7 colorectal cancer \n", "8 colorectal cancer \n", "9 colorectal cancer \n", "10 non-small-cell lung carcinoma \n", "11 colorectal cancer \n", "12 colorectal cancer \n", "13 non-small-cell lung carcinoma \n", "14 non-small-cell lung carcinoma \n", "15 colorectal cancer \n", "16 colorectal cancer \n", "17 colorectal cancer \n", "18 colorectal cancer \n", "19 colorectal cancer \n", "20 adenocarcinoma of the lung \n", "21 adenocarcinoma of the lung \n", "22 colorectal cancer \n", "23 colorectal cancer \n", "24 colorectal cancer \n", "25 colorectal cancer \n", "26 colorectal cancer \n", "27 multiple myeloma \n", "28 multiple myeloma \n", "29 multiple myeloma \n", ".. ... \n", "33 adenocarcinoma of the lung \n", "34 colorectal cancer \n", "35 colorectal cancer \n", "36 multiple myeloma \n", "37 multiple myeloma \n", "38 lung cancer \n", "39 melanoma \n", "40 colorectal cancer \n", "41 colorectal cancer \n", "42 hairy cell leukemia \n", "43 multiple myeloma \n", "44 multiple myeloma \n", "45 lung cancer \n", "46 colorectal cancer \n", "47 colorectal cancer \n", "48 colorectal cancer \n", "49 multiple myeloma \n", "50 multiple myeloma \n", "51 multiple myeloma \n", "52 lung cancer \n", "53 cancer \n", "54 colorectal cancer \n", "55 colorectal cancer \n", "56 colorectal cancer \n", "57 lung cancer \n", "58 colorectal cancer \n", "59 colorectal cancer \n", "60 colorectal cancer \n", "61 colorectal cancer \n", "62 colorectal cancer \n", "\n", " referenceLabel.value \n", "0 Loss of NF1 in cutaneous melanoma is associated with RAS activation and MEK dependence. \n", "1 Loss of NF1 in cutaneous melanoma is associated with RAS activation and MEK dependence. \n", "2 Loss of NF1 in cutaneous melanoma is associated with RAS activation and MEK dependence. \n", "3 Meta-analysis of BRAF mutation as a predictive biomarker of benefit from anti-EGFR monoclonal antibody therapy for RAS wild-type metastatic colorectal cancer \n", "4 Meta-analysis of BRAF mutation as a predictive biomarker of benefit from anti-EGFR monoclonal antibody therapy for RAS wild-type metastatic colorectal cancer \n", "5 Wild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer. \n", "6 Wild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer. \n", "7 Effects of KRAS, BRAF, NRAS, and PIK3CA mutations on the efficacy of cetuximab plus chemotherapy in chemotherapy-refractory metastatic colorectal cancer: a retrospective consortium analysis. \n", "8 Prognostic and predictive value of common mutations for treatment response and survival in patients with metastatic colorectal cancer. \n", "9 Prognostic and predictive value of common mutations for treatment response and survival in patients with metastatic colorectal cancer. \n", "10 Molecular characterization of acquired resistance to the BRAF inhibitor dabrafenib in a patient with BRAF-mutant non-small-cell lung cancer. \n", "11 The First-in-class Anti-EGFR Antibody Mixture Sym004 Overcomes Cetuximab Resistance Mediated by EGFR Extracellular Domain Mutations in Colorectal Cancer. \n", "12 The First-in-class Anti-EGFR Antibody Mixture Sym004 Overcomes Cetuximab Resistance Mediated by EGFR Extracellular Domain Mutations in Colorectal Cancer. \n", "13 Amplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors. \n", "14 Amplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors. \n", "15 A molecularly annotated platform of patient-derived xenografts (\"xenopatients\") identifies HER2 as an effective therapeutic target in cetuximab-resistant colorectal cancer. \n", "16 HER2 gene copy number status may influence clinical efficacy to anti-EGFR monoclonal antibodies in metastatic colorectal cancer patients. \n", "17 HER2 gene copy number status may influence clinical efficacy to anti-EGFR monoclonal antibodies in metastatic colorectal cancer patients. \n", "18 HER2 in high-risk rectal cancer patients treated in EXPERT-C, a randomized phase II trial of neoadjuvant capecitabine and oxaliplatin (CAPOX) and chemoradiotherapy (CRT) with or without cetuximab. \n", "19 HER2 Amplification and Cetuximab Efficacy in Patients With Metastatic Colorectal Cancer Harboring Wild-type RAS and BRAF. \n", "20 Analysis of tumor specimens at the time of acquired resistance to EGFR-TKI therapy in 155 patients with EGFR-mutant lung cancers. \n", "21 Analysis of tumor specimens at the time of acquired resistance to EGFR-TKI therapy in 155 patients with EGFR-mutant lung cancers. \n", "22 Genomic and biological characterization of exon 4 KRAS mutations in human cancer \n", "23 FOLFOX4 Plus Cetuximab for Patients With Previously Untreated Metastatic Colorectal Cancer According to Tumor RAS and BRAF Mutation Status: Updated Analysis of the CECOG/CORE 1.2.002 Study. \n", "24 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "25 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "26 KRAS exon 2 mutations influence activity of regorafenib in an SW48-based disease model of colorectal cancer. \n", "27 Reduction of serum IGF-I levels in patients affected with Monoclonal Gammopathies of undetermined significance or Multiple Myeloma. Comparison with bFGF, VEGF and K-ras gene mutation. \n", "28 Oncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance. \n", "29 Activation of N-ras and K-ras induced by interleukin-6 in a myeloma cell line: implications for disease progression and therapeutic response. \n", ".. ... \n", "33 Clinical implications of KRAS mutations in lung cancer patients treated with tyrosine kinase inhibitors: an important role for mutations in minor clones \n", "34 The dominant role of G12C over other KRAS mutation types in the negative prediction of efficacy of epidermal growth factor receptor tyrosine kinase inhibitors in non-small cell lung cancer. \n", "35 The dominant role of G12C over other KRAS mutation types in the negative prediction of efficacy of epidermal growth factor receptor tyrosine kinase inhibitors in non-small cell lung cancer. \n", "36 Oncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance. \n", "37 Heterogeneity in therapeutic response of genetically altered myeloma cell lines to interleukin 6, dexamethasone, doxorubicin, and melphalan. \n", "38 PTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients. \n", "39 Acquired resistance and clonal evolution in melanoma during BRAF inhibitor therapy \n", "40 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "41 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "42 Targeting Mutant BRAF in Relapsed or Refractory Hairy-Cell Leukemia \n", "43 Oncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance. \n", "44 Heterogeneity in therapeutic response of genetically altered myeloma cell lines to interleukin 6, dexamethasone, doxorubicin, and melphalan. \n", "45 PTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients. \n", "46 Emergence of KRAS mutations and acquired resistance to anti-EGFR therapy in colorectal cancer \n", "47 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "48 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "49 Oncogenic RAS mutations in myeloma cells selectively induce cox-2 expression, which participates in enhanced adhesion to fibronectin and chemoresistance. \n", "50 Activation of N-ras and K-ras induced by interleukin-6 in a myeloma cell line: implications for disease progression and therapeutic response. \n", "51 Heterogeneity in therapeutic response of genetically altered myeloma cell lines to interleukin 6, dexamethasone, doxorubicin, and melphalan. \n", "52 PTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients. \n", "53 Durable Response to Crizotinib in a MET-Amplified, KRAS-Mutated Carcinoma of Unknown Primary. \n", "54 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "55 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "56 Association of KRAS p.G13D mutation with outcome in patients with chemotherapy-refractory metastatic colorectal cancer treated with cetuximab. \n", "57 PTEN and PIK3CA expression is associated with prolonged survival after gefitinib treatment in EGFR-mutated lung cancer patients. \n", "58 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "59 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "60 Phase II study of single-agent cetuximab in KRAS G13D mutant metastatic colorectal cancer. \n", "61 Cetuximab treatment for metastatic colorectal cancer with KRAS p.G13D mutations improves progression-free survival. \n", "62 Meta-analysis comparing the efficacy of anti-EGFR monoclonal antibody therapy between KRAS G13D and other KRAS mutant metastatic colorectal cancer tumours. \n", "\n", "[63 rows x 5 columns]" ] }, "execution_count": 40, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX wdt: \n", "PREFIX owl: \n", "PREFIX pq: \n", "PREFIX ps: \n", "PREFIX pr: \n", "PREFIX p: \n", "PREFIX prov: \n", "PREFIX wikibase: \n", "PREFIX bd: \n", "PREFIX rdfs: \n", "select distinct ?geneSymbol ?variantLabel ?treatmentLabel ?diseaseLabel ?referenceLabel\n", "\n", "where {\n", "\n", " SERVICE {\n", " ?variant wdt:P3329 ?id .\n", " ?variant p:P3355 [ ps:P3355 ?treatment ; \n", " pq:P2175 ?disease ;\n", " prov:wasDerivedFrom ?source ].\n", " ?source pr:P248 ?reference \n", "\n", " SERVICE wikibase:label { \n", " bd:serviceParam wikibase:language \"en\" . \n", " ?variant rdfs:label ?variantLabel .\n", " ?treatment rdfs:label ?treatmentLabel .\n", " ?disease rdfs:label ?diseaseLabel .\n", " ?reference rdfs:label ?referenceLabel\n", " }\n", " }\n", "\n", "\n", " ?case a :Case ;\n", " :hasDescendant ?mouse ;\n", " :hasDescendant ?node .\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref a :drug_response .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?variant .\n", " ?gene :has_variant ?variant.\n", " OPTIONAL {?variant :alt_p ?alt_p }\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?variant a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", "}\n", "order by ?geneSymbol\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go\n", "result_table[['geneSymbol.value', 'variantLabel.value', 'treatmentLabel.value', 'diseaseLabel.value', 'referenceLabel.value']]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "### Drugs predictions for a specific case\n", "\n", "We can use the connection to Wikidata for querying evidences of drug responses predictions (i.e., positive or negative) associated with variants harbored by a specific case (i.e., _id=CRC0481_)\n", "\n", "**Positive responses predictions**" ] }, { "cell_type": "code", "execution_count": 41, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
geneSymbol.valuevariantLabel.valuetreatmentLabel.valuediseaseLabel.valuereferenceLabel.value
0EGFREGFR AMPLIFICATIONCetuximab / platinum / fluorouracil combination therapyhead and neck squamous cell carcinomaEvaluation of EGFR gene copy number as a predictive biomarker for the efficacy of cetuximab in combination with chemotherapy in the first-line treatment of recurrent and/or metastatic squamous cell carcinoma of the head and neck: EXTREME study.
1EGFREGFR AMPLIFICATIONCetuximabcolorectal cancerClinical usefulness of EGFR gene copy number as a predictive marker in colorectal cancer patients treated with cetuximab: a fluorescent in situ hybridization study.
2EGFREGFR AMPLIFICATIONPanitumumabcolorectal cancerEGFR gene copy number as a predictive biomarker for resistance to anti-EGFR monoclonal antibodies in metastatic colorectal cancer treatment: a meta-analysis.
3EGFREGFR AMPLIFICATIONCetuximabcolorectal cancerEGFR gene copy number as a predictive biomarker for resistance to anti-EGFR monoclonal antibodies in metastatic colorectal cancer treatment: a meta-analysis.
4EGFREGFR AMPLIFICATIONEGFR inhibitornon-small-cell lung carcinomaEGFR gene copy number as a predictive biomarker for patients receiving tyrosine kinase inhibitor treatment: a systematic review and meta-analysis in non-small-cell lung cancer.
5EGFREGFR AMPLIFICATIONgefitinibnon-small-cell lung carcinomaEpidermal Growth Factor Receptor Gene Amplification in Patients with Advanced-stage NSCLC.
6EGFREGFR AMPLIFICATIONerlotinibnon-small-cell lung carcinomaEpidermal Growth Factor Receptor Gene Amplification in Patients with Advanced-stage NSCLC.
7KRASKRAS G13DCetuximabcolorectal cancerAssociation of KRAS p.G13D mutation with outcome in patients with chemotherapy-refractory metastatic colorectal cancer treated with cetuximab.
8KRASKRAS G13Dselumetinib / dactolisib combination therapycolorectal cancerInhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas.
\n", "
" ], "text/plain": [ " geneSymbol.value variantLabel.value \\\n", "0 EGFR EGFR AMPLIFICATION \n", "1 EGFR EGFR AMPLIFICATION \n", "2 EGFR EGFR AMPLIFICATION \n", "3 EGFR EGFR AMPLIFICATION \n", "4 EGFR EGFR AMPLIFICATION \n", "5 EGFR EGFR AMPLIFICATION \n", "6 EGFR EGFR AMPLIFICATION \n", "7 KRAS KRAS G13D \n", "8 KRAS KRAS G13D \n", "\n", " treatmentLabel.value \\\n", "0 Cetuximab / platinum / fluorouracil combination therapy \n", "1 Cetuximab \n", "2 Panitumumab \n", "3 Cetuximab \n", "4 EGFR inhibitor \n", "5 gefitinib \n", "6 erlotinib \n", "7 Cetuximab \n", "8 selumetinib / dactolisib combination therapy \n", "\n", " diseaseLabel.value \\\n", "0 head and neck squamous cell carcinoma \n", "1 colorectal cancer \n", "2 colorectal cancer \n", "3 colorectal cancer \n", "4 non-small-cell lung carcinoma \n", "5 non-small-cell lung carcinoma \n", "6 non-small-cell lung carcinoma \n", "7 colorectal cancer \n", "8 colorectal cancer \n", "\n", " referenceLabel.value \n", "0 Evaluation of EGFR gene copy number as a predictive biomarker for the efficacy of cetuximab in combination with chemotherapy in the first-line treatment of recurrent and/or metastatic squamous cell carcinoma of the head and neck: EXTREME study. \n", "1 Clinical usefulness of EGFR gene copy number as a predictive marker in colorectal cancer patients treated with cetuximab: a fluorescent in situ hybridization study. \n", "2 EGFR gene copy number as a predictive biomarker for resistance to anti-EGFR monoclonal antibodies in metastatic colorectal cancer treatment: a meta-analysis. \n", "3 EGFR gene copy number as a predictive biomarker for resistance to anti-EGFR monoclonal antibodies in metastatic colorectal cancer treatment: a meta-analysis. \n", "4 EGFR gene copy number as a predictive biomarker for patients receiving tyrosine kinase inhibitor treatment: a systematic review and meta-analysis in non-small-cell lung cancer. \n", "5 Epidermal Growth Factor Receptor Gene Amplification in Patients with Advanced-stage NSCLC. \n", "6 Epidermal Growth Factor Receptor Gene Amplification in Patients with Advanced-stage NSCLC. \n", "7 Association of KRAS p.G13D mutation with outcome in patients with chemotherapy-refractory metastatic colorectal cancer treated with cetuximab. \n", "8 Inhibition of MEK and PI3K/mTOR suppresses tumor growth but does not cause tumor regression in patient-derived xenografts of RAS-mutant colorectal carcinomas. " ] }, "execution_count": 41, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX wdt: \n", "PREFIX owl: \n", "PREFIX pq: \n", "PREFIX ps: \n", "PREFIX pr: \n", "PREFIX p: \n", "PREFIX prov: \n", "PREFIX wikibase: \n", "PREFIX bd: \n", "PREFIX rdfs: \n", "select distinct ?geneSymbol ?variantLabel ?treatmentLabel ?diseaseLabel ?referenceLabel\n", "where {\n", " SERVICE {\n", " ?variant wdt:P3329 ?id .\n", " ?variant p:P3354 [ ps:P3354 ?treatment ; \n", " pq:P2175 ?disease ;\n", " prov:wasDerivedFrom ?source ].\n", " ?source pr:P248 ?reference\n", " \n", " SERVICE wikibase:label { \n", " bd:serviceParam wikibase:language \"en\" . \n", " ?variant rdfs:label ?variantLabel .\n", " ?treatment rdfs:label ?treatmentLabel .\n", " ?disease rdfs:label ?diseaseLabel .\n", " ?reference rdfs:label ?referenceLabel .\n", " }\n", " }\n", "\n", "\n", " :CRC0481 :hasDescendant ?node .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?variant .\n", " ?gene :has_variant ?variant.\n", " OPTIONAL {?variant :alt_p ?alt_p }\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?variant a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", "}\n", "\n", "order by ?geneSymbol\n", "\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go\n", "result_table[['geneSymbol.value', 'variantLabel.value', 'treatmentLabel.value', 'diseaseLabel.value', 'referenceLabel.value']]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "**Negative responses predictions**" ] }, { "cell_type": "code", "execution_count": 42, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
geneSymbol.valuevariantLabel.valuetreatmentLabel.valuediseaseLabel.valuereferenceLabel.value
0EGFREGFR AMPLIFICATIONosimertinibnon-small-cell lung carcinomaAmplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors.
1EGFREGFR AMPLIFICATIONrociletinibnon-small-cell lung carcinomaAmplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors.
2KRASKRAS G13DPanitumumabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
3KRASKRAS G13DCetuximabcolorectal cancerPIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies.
4KRASKRAS G13DCetuximabcolorectal cancerPhase II study of single-agent cetuximab in KRAS G13D mutant metastatic colorectal cancer.
5KRASKRAS G13DCetuximabcolorectal cancerCetuximab treatment for metastatic colorectal cancer with KRAS p.G13D mutations improves progression-free survival.
6KRASKRAS G13DCetuximabcolorectal cancerMeta-analysis comparing the efficacy of anti-EGFR monoclonal antibody therapy between KRAS G13D and other KRAS mutant metastatic colorectal cancer tumours.
\n", "
" ], "text/plain": [ " geneSymbol.value variantLabel.value treatmentLabel.value \\\n", "0 EGFR EGFR AMPLIFICATION osimertinib \n", "1 EGFR EGFR AMPLIFICATION rociletinib \n", "2 KRAS KRAS G13D Panitumumab \n", "3 KRAS KRAS G13D Cetuximab \n", "4 KRAS KRAS G13D Cetuximab \n", "5 KRAS KRAS G13D Cetuximab \n", "6 KRAS KRAS G13D Cetuximab \n", "\n", " diseaseLabel.value \\\n", "0 non-small-cell lung carcinoma \n", "1 non-small-cell lung carcinoma \n", "2 colorectal cancer \n", "3 colorectal cancer \n", "4 colorectal cancer \n", "5 colorectal cancer \n", "6 colorectal cancer \n", "\n", " referenceLabel.value \n", "0 Amplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors. \n", "1 Amplification of EGFR Wild-Type Alleles in Non-Small Cell Lung Cancer Cells Confers Acquired Resistance to Mutation-Selective EGFR Tyrosine Kinase Inhibitors. \n", "2 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "3 PIK3CA mutations in colorectal cancer are associated with clinical resistance to EGFR-targeted monoclonal antibodies. \n", "4 Phase II study of single-agent cetuximab in KRAS G13D mutant metastatic colorectal cancer. \n", "5 Cetuximab treatment for metastatic colorectal cancer with KRAS p.G13D mutations improves progression-free survival. \n", "6 Meta-analysis comparing the efficacy of anti-EGFR monoclonal antibody therapy between KRAS G13D and other KRAS mutant metastatic colorectal cancer tumours. " ] }, "execution_count": 42, "metadata": {}, "output_type": "execute_result" } ], "source": [ "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX wdt: \n", "PREFIX owl: \n", "PREFIX pq: \n", "PREFIX ps: \n", "PREFIX pr: \n", "PREFIX p: \n", "PREFIX prov: \n", "PREFIX wikibase: \n", "PREFIX bd: \n", "PREFIX rdfs: \n", "select distinct ?geneSymbol ?variantLabel ?treatmentLabel ?diseaseLabel ?referenceLabel\n", "where {\n", " SERVICE {\n", " ?variant wdt:P3329 ?id .\n", " ?variant p:P3355 [ ps:P3355 ?treatment ; \n", " pq:P2175 ?disease ;\n", " prov:wasDerivedFrom ?source ].\n", " ?source pr:P248 ?reference\n", " \n", " SERVICE wikibase:label { \n", " bd:serviceParam wikibase:language \"en\" . \n", " ?variant rdfs:label ?variantLabel .\n", " ?treatment rdfs:label ?treatmentLabel .\n", " ?disease rdfs:label ?diseaseLabel .\n", " ?reference rdfs:label ?referenceLabel .\n", " }\n", " }\n", "\n", "\n", " :CRC0481 :hasDescendant ?node .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?variant .\n", " ?gene :has_variant ?variant.\n", " OPTIONAL {?variant :alt_p ?alt_p }\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?variant a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", "}\n", "\n", "order by ?geneSymbol\n", "\n", "\"\"\"\n", "\n", "# get data\n", "result_table = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "# there you go\n", "result_table[['geneSymbol.value', 'variantLabel.value', 'treatmentLabel.value', 'diseaseLabel.value', 'referenceLabel.value']]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "## Appendix - PoC figures\n", "\n", "In this final section we list links between figures in the paper and queries or computations in this notebook." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Figure 4b (left)\n", "\n", "The **pie chart on the left** (i.e., response fractions in trees with no variants) can be obtained with the following query:" ] }, { "cell_type": "code", "execution_count": 43, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
type.valuecases.value
0http://las.ircc.it/ontology/annotationplatform#DRCl_PD38
1http://las.ircc.it/ontology/annotationplatform#DRCl_SD58
2http://las.ircc.it/ontology/annotationplatform#DRCl_OR29
\n", "
" ], "text/plain": [ " type.value cases.value\n", "0 http://las.ircc.it/ontology/annotationplatform#DRCl_PD 38 \n", "1 http://las.ircc.it/ontology/annotationplatform#DRCl_SD 58 \n", "2 http://las.ircc.it/ontology/annotationplatform#DRCl_OR 29 " ] }, "execution_count": 43, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# response fractions in trees with no variants\n", "my_query = \"\"\"\n", "PREFIX : \n", "PREFIX onto: \n", "PREFIX sesame: \n", "select (count(distinct ?case) as ?cases) ?type\n", "from onto:disable-sameAs\n", "where {\n", " ?case a :Case ;\n", " :hasDescendant ?mouse .\n", " ?mouse a :Biomouse ;\n", " :has_annotation ?ann .\n", " ?ann :has_reference ?ref .\n", " ?ref sesame:directType ?type .\n", " filter not exists { \n", " ?case :hasDescendant ?node .\n", " ?node :has_annotation ?ann2 .\n", " ?ann2 :has_reference ?ref2 .\n", " ?gene :has_variant ?ref2.\n", " ?gene :symbol ?geneSymbol\n", " VALUES ?geneSymbol {'KRAS' 'EGFR' 'BRAF' 'ERBB2'}\n", " ?ref2 a ?annotation_Type.\n", " VALUES ?annotation_Type { :sequence_alteration :feature_amplification }\n", " }\n", "}\n", "group by ?type\n", "\"\"\"\n", "\n", "# get data\n", "result_table_no_var = utils.query(SEMALYTICS_ENDPOINT, my_query)\n", "\n", "result_table_no_var[['type.value','cases.value']]" ] }, { "cell_type": "code", "execution_count": 44, "metadata": {}, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "results = [\n", " int(result_table_no_var[result_table_no_var['type.value'].str.contains('_OR')]['cases.value']),\n", " int(result_table_no_var[result_table_no_var['type.value'].str.contains('_SD')]['cases.value']),\n", " int(result_table_no_var[result_table_no_var['type.value'].str.contains('_PD')]['cases.value'])\n", "]\n", "\n", "d = {'response_type': ['response', 'neutral', 'progression'], 'cases': results}\n", "df = pd.DataFrame(data=d)\n", "\n", "chart = df.plot.pie(y = 'cases',\n", " rot=0,\n", " #labels = df['response_type'], # labels\n", " labels = None,\n", " legend = False,\n", " figsize=(5, 5),\n", " colors=utils.response_colors(df['response_type']),\n", " title ='No variants' # title\n", " ).set_ylabel('')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Figure 4b (right)\n", "We computed the **pie chart on the right** (responses of cases with 1+ variant(s)) in this [cell](#fig4br)." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Figure 5a - Data matrix\n", "\n", "The data matrix in Figure 5a cotains several charts.\n", "\n", "* Row *Mutations only*: the three charts of the first row are the ones computed in cells of section [Matching `sequence_alteration` only](#mseqsubset)\n", "\n", "* Row _Amplifications only_: charts of this row are computed in cells of section [Matching `feature_amplification` only](#mamplsubset)\n", "\n", "* Row _All cases_: these charts are generated in cells of section [Matching annotations in the investigation set](#mset)\n", "\n", "### Figure 5b - Variants occurences\n", "\n", "This figure shows the distribution of variants detected in cases that did not respond to Cetuximab. In particular, data coming from the query presented in section [Variants of non-responders](#variantsnonresp) are sliced and diced in a pivot table to present distributions about:\n", "\n", "* altered genes\n", "* alteration types per gene (mutations or amplifications)\n", "* mutations detected per gene\n", "\n" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.0" } }, "nbformat": 4, "nbformat_minor": 2 }