{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# nCovMemory\n", "\n", "The nCovMemory project is a GitHub repository where people are curating stories about COVID-19 in the media and social media. You can see it mentioned in a short NYTimes video documentary about censorship in China: [China Is Censoring Coronavirus Stories: These Citizens Are Fighting Back](https://www.nytimes.com/video/world/asia/100000006970549/coronavirus-chinese-citizens.html) by Christoph Koettl, Muyi Xiao, Nilo Tabrizy and Dmitriy Khavin.\n", "\n", "They make their data available at this [static website](https://2019ncovmemory.github.io/nCovMemory/) but also as CSV data in their GitHub repository. We can check their data to see if any of them need to be added to the IIPC collection." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## GitHub Data\n", "\n", "We can download their latest CSV data directly from the web.\n" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | category | \n", "update | \n", "media | \n", "date | \n", "title | \n", "title_en | \n", "url | \n", "translation_en | \n", "is_deleted | \n", "alternative | \n", "archive | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|
id | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
4241 | \n", "non_fiction | \n", "2020-03-23 | \n", "人间theLivings | \n", "2020-03-23 | \n", "海外疫区里的中国留学生:要学位,还是保命? | \n", "NaN | \n", "https://mp.weixin.qq.com/s/HkJQ01ZBkerky7BC-xAhiA | \n", "NaN | \n", "NaN | \n", "NaN | \n", "http://archive.is/XD3Nz | \n", "
4240 | \n", "narrative | \n", "2020-03-23 | \n", "在人间living | \n", "2020-03-23 | \n", "今天,武汉封城两个月了 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/mrWF9nFUxtXnNnyNEf-Ibw | \n", "NaN | \n", "NaN | \n", "NaN | \n", "http://archive.is/KPoAT | \n", "
4239 | \n", "non_fiction | \n", "2020-03-23 | \n", "中国经营报 | \n", "2020-03-23 | \n", "“108好汉”为何注射新冠疫苗,这位00后的回答刷屏… | \n", "NaN | \n", "https://mp.weixin.qq.com/s/GinzGhKnNHZrtlDuVIrkKw | \n", "NaN | \n", "NaN | \n", "NaN | \n", "http://archive.is/tHBxO | \n", "
4238 | \n", "non_fiction | \n", "2020-03-23 | \n", "中国经营报 | \n", "2020-03-23 | \n", "新加坡、澳大利亚“封国”!意大利全国“停产”,美国确诊人数突破3万... | \n", "NaN | \n", "https://mp.weixin.qq.com/s/hE8J7D-GrkB92GoPnmpcsg | \n", "NaN | \n", "NaN | \n", "NaN | \n", "http://archive.is/QcWU5 | \n", "
4237 | \n", "narrative | \n", "2020-03-22 | \n", "WUXU | \n", "2020-03-23 | \n", "[四十日谈] 条条大路“不”通罗马,老猫的曲折回意之路 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/fLlGjOcZcotS-QybkqZjcw | \n", "NaN | \n", "NaN | \n", "NaN | \n", "http://archive.ph/ehsAk | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
5 | \n", "non_fiction | \n", "2020-02-06 | \n", "GQ报道 | \n", "2020-01-29 | \n", "孝感前线医生:武汉更难,我们下面不好意思提要求 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/uGaFeqrqmLBQe5qdRSTeSQ | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://archive.ph/MnZrn | \n", "
4 | \n", "non_fiction | \n", "2020-02-06 | \n", "GQ报道 | \n", "2020-01-29 | \n", "疫情危机中不被看见的人们:武汉周边城市百姓的自救行动 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/D8Ob8pNmecHKXg7yR7EWFg | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://archive.ph/vDSj5 | \n", "
3 | \n", "non_fiction | \n", "2020-02-06 | \n", "GQ报道 | \n", "2020-01-28 | \n", "我家离华南海鲜市场很近:返乡、封城、过年,一位武汉大学生的过去一周 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/n7dXGHh-79d6VEzDhhOUbQ | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://archive.ph/RSmFx | \n", "
2 | \n", "non_fiction | \n", "2020-02-06 | \n", "GQ报道 | \n", "2020-01-28 | \n", "武汉隔离:疫区、信息孤岛与一辆鄂A车的漂流 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/M-hVivF7NQmZHlu8YMnL_w | \n", "NaN | \n", "NaN | \n", "NaN | \n", "http://archive.is/3XKZD | \n", "
1 | \n", "non_fiction | \n", "2020-02-06 | \n", "GQ报道 | \n", "2020-01-27 | \n", "10000个临时发往武汉的口罩 | \n", "NaN | \n", "https://mp.weixin.qq.com/s/p-uPky_zB6XKcAetthqkKg | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://archive.ph/9s1ug | \n", "
4227 rows × 11 columns
\n", "\n", " | id | \n", "url | \n", "creator | \n", "created | \n", "updated | \n", "crawl_definition | \n", "title | \n", "description | \n", "language | \n", "tld | \n", "
---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "2147692 | \n", "http://coronavirus.fr/ | \n", "alext | \n", "2020-02-21T03:43:18.662353Z | \n", "2020-03-16T19:53:45.860949Z | \n", "31104294373 | \n", "Epicorem. Ecoépidémiologie | \n", "Medical/Scientific aspects | \n", "French | \n", ".fr | \n", "
1 | \n", "2147693 | \n", "http://english.whiov.cas.cn/ | \n", "alext | \n", "2020-02-21T03:43:18.706571Z | \n", "2020-03-16T19:52:28.575749Z | \n", "31104294373 | \n", "Wuhan Institute of Virulogy, official page in ... | \n", "Health Organisation | \n", "English | \n", ".cn | \n", "
2 | \n", "2147694 | \n", "http://www.china-embassy.or.jp/chn/ | \n", "alext | \n", "2020-02-21T03:43:18.739126Z | \n", "2020-03-16T19:53:03.086729Z | \n", "31104294373 | \n", "中华人民共和国驻日本大使馆 | \n", "Embassy | \n", "Chinese | \n", ".jp | \n", "
3 | \n", "2147695 | \n", "http://www.china-embassy.or.jp/jpn/ | \n", "alext | \n", "2020-02-21T03:43:18.766308Z | \n", "2020-03-16T19:54:02.280945Z | \n", "31104294373 | \n", "中華人民共和国駐日本国大使館 | \n", "Embassy | \n", "Japanese | \n", ".jp | \n", "
4 | \n", "2147696 | \n", "https://cadenaser.com/tag/ncov/a/ | \n", "alext | \n", "2020-02-21T03:43:18.791716Z | \n", "2020-03-16T19:54:19.694418Z | \n", "31104294373 | \n", "Coronavirus de Wuhan | \n", "Cadena Ser | \n", "Spanish | \n", ".com | \n", "