{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Principal Component Plots" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "For this chapter, you will need the PCA results that we ran in the last chapter. I have actually included the output files of my runs into this repository, so you can just use them if something didn't work in the previous chapter." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "For making plots in python, the most popular libary around is [matplotlib](http://matplotlib.org). We will also make use of [pandas](https://pandas.pydata.org). You can load them via:" ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [], "source": [ "%matplotlib inline\n", "import pandas as pd\n", "import matplotlib.pyplot as plt" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "/home/stephan/popgen_course\n" ] } ], "source": [ "cd ~/popgen_course" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'/home/stephan/popgen_course'" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "pwd" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "01_bashnb_getting_started.ipynb pca.AllEurasia.params.txt\r\n", "02_pynb_getting_started.ipynb pca.WestEurasia.eval\r\n", "03_bashnb_smartpca.ipynb pca.WestEurasia.evec\r\n", "04_pynb_plotting_pca.ipynb pca.WestEurasia.params.txt\r\n", "pca.AllEurasia.eval population_frequencies.txt\r\n", "pca.AllEurasia.evec README.md\r\n" ] } ], "source": [ "ls" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's have a look at the main results file from ``smartpca``:" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " #eigvals: 6.289 3.095 2.693 2.010 \r\n", " Yuk_009 0.0123 0.1252 0.1147 0.0567 Yukagir\r\n", " Yuk_025 0.0120 0.1258 0.1168 0.0576 Yukagir\r\n", " Yuk_022 0.0136 0.1303 0.1186 0.0564 Yukagir\r\n", " Yuk_020 0.0170 0.1278 0.1176 0.0584 Yukagir\r\n", " MC_40 0.0183 0.1226 0.1123 0.0537 Chukchi\r\n", " Yuk_024 0.0144 0.1271 0.1124 0.0584 Yukagir\r\n", " Yuk_023 0.0124 0.1348 0.1238 0.0642 Yukagir\r\n", " MC_16 0.0144 0.1266 0.1169 0.0541 Chukchi\r\n", " MC_15 0.0146 0.1250 0.1119 0.0559 Chukchi\r\n" ] } ], "source": [ "!head pca.WestEurasia.evec" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The first row contains the eigenvalues for the first 4 principal components (PCs), and all further rows contain the PC coordinates for each individual. The first column contains the name of each individual, the last row the population. To load this dataset with python, we use the pandas_ package, which facilitates working with data in python. To load data using pandas, we will use the `read_csv()` function. This function lets you define column headers, which we have to define first:" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [], "source": [ "column_names = [\"Name\", \"PC1\", \"PC2\", \"PC3\", \"PC4\", \"Group\"]" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['Name', 'PC1', 'PC2', 'PC3', 'PC4', 'Group']" ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" } ], "source": [ "column_names" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We can then load the eigenVec file from the pca run:" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [], "source": [ "pcaDat = pd.read_csv(\"pca.WestEurasia.evec\",\n", " delim_whitespace=True, skiprows=1, names=column_names)" ] }, { "cell_type": "code", "execution_count": 25, "metadata": {}, "outputs": [], "source": [ "pcaDat2 = pd.read_csv(\"pca.AllEurasia.evec\",\n", " delim_whitespace=True, skiprows=1, names=column_names)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Looking at the data, we find that it is a matrix, with each individual on one row, and the columns denoting the first 4 principal components. The last column contains the population for each individual:" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | Name | \n", "PC1 | \n", "PC2 | \n", "PC3 | \n", "PC4 | \n", "Group | \n", "
---|---|---|---|---|---|---|
0 | \n", "Yuk_009 | \n", "0.0123 | \n", "0.1252 | \n", "0.1147 | \n", "0.0567 | \n", "Yukagir | \n", "
1 | \n", "Yuk_025 | \n", "0.0120 | \n", "0.1258 | \n", "0.1168 | \n", "0.0576 | \n", "Yukagir | \n", "
2 | \n", "Yuk_022 | \n", "0.0136 | \n", "0.1303 | \n", "0.1186 | \n", "0.0564 | \n", "Yukagir | \n", "
3 | \n", "Yuk_020 | \n", "0.0170 | \n", "0.1278 | \n", "0.1176 | \n", "0.0584 | \n", "Yukagir | \n", "
4 | \n", "MC_40 | \n", "0.0183 | \n", "0.1226 | \n", "0.1123 | \n", "0.0537 | \n", "Chukchi | \n", "
5 | \n", "Yuk_024 | \n", "0.0144 | \n", "0.1271 | \n", "0.1124 | \n", "0.0584 | \n", "Yukagir | \n", "
6 | \n", "Yuk_023 | \n", "0.0124 | \n", "0.1348 | \n", "0.1238 | \n", "0.0642 | \n", "Yukagir | \n", "
7 | \n", "MC_16 | \n", "0.0144 | \n", "0.1266 | \n", "0.1169 | \n", "0.0541 | \n", "Chukchi | \n", "
8 | \n", "MC_15 | \n", "0.0146 | \n", "0.1250 | \n", "0.1119 | \n", "0.0559 | \n", "Chukchi | \n", "
9 | \n", "MC_18 | \n", "0.0175 | \n", "0.1238 | \n", "0.1167 | \n", "0.0523 | \n", "Chukchi | \n", "
10 | \n", "Yuk_004 | \n", "0.0110 | \n", "0.1273 | \n", "0.1117 | \n", "0.0573 | \n", "Yukagir | \n", "
11 | \n", "MC_08 | \n", "0.0187 | \n", "0.1253 | \n", "0.1185 | \n", "0.0564 | \n", "Chukchi | \n", "
12 | \n", "Nov_005 | \n", "0.0152 | \n", "0.1349 | \n", "0.1285 | \n", "0.0618 | \n", "Nganasan | \n", "
13 | \n", "MC_25 | \n", "0.0182 | \n", "0.1258 | \n", "0.1196 | \n", "0.0532 | \n", "Chukchi | \n", "
14 | \n", "Yuk_019 | \n", "0.0161 | \n", "0.1327 | \n", "0.1229 | \n", "0.0617 | \n", "Yukagir | \n", "
15 | \n", "Yuk_011 | \n", "0.0152 | \n", "0.1217 | \n", "0.1148 | \n", "0.0569 | \n", "Yukagir | \n", "
16 | \n", "Sesk_47 | \n", "0.0167 | \n", "0.1241 | \n", "0.1177 | \n", "0.0549 | \n", "Chukchi1 | \n", "
17 | \n", "MC_17 | \n", "0.0180 | \n", "0.1268 | \n", "0.1147 | \n", "0.0544 | \n", "Chukchi | \n", "
18 | \n", "Yuk_021 | \n", "0.0141 | \n", "0.1329 | \n", "0.1210 | \n", "0.0653 | \n", "Yukagir | \n", "
19 | \n", "MC_06 | \n", "0.0159 | \n", "0.1264 | \n", "0.1135 | \n", "0.0557 | \n", "Chukchi | \n", "
20 | \n", "MC_38 | \n", "0.0178 | \n", "0.1240 | \n", "0.1143 | \n", "0.0534 | \n", "Chukchi | \n", "
21 | \n", "MC_14 | \n", "0.0165 | \n", "0.1238 | \n", "0.1114 | \n", "0.0524 | \n", "Chukchi | \n", "
22 | \n", "Ul5 | \n", "0.0070 | \n", "0.1306 | \n", "0.1144 | \n", "0.0540 | \n", "Ulchi | \n", "
23 | \n", "Ul31 | \n", "0.0056 | \n", "0.1289 | \n", "0.1182 | \n", "0.0550 | \n", "Ulchi | \n", "
24 | \n", "Ul65 | \n", "0.0051 | \n", "0.1331 | \n", "0.1117 | \n", "0.0599 | \n", "Ulchi | \n", "
25 | \n", "Tuba12 | \n", "0.0172 | \n", "0.0906 | \n", "0.0790 | \n", "0.0362 | \n", "Tubalar | \n", "
26 | \n", "Tuba20 | \n", "0.0129 | \n", "0.0894 | \n", "0.0767 | \n", "0.0308 | \n", "Tubalar | \n", "
27 | \n", "Nel19 | \n", "0.0273 | \n", "0.0605 | \n", "0.0608 | \n", "0.0333 | \n", "Yukagir | \n", "
28 | \n", "Nlk16 | \n", "0.0217 | \n", "0.0744 | \n", "0.0753 | \n", "0.0360 | \n", "Even | \n", "
29 | \n", "Kor66 | \n", "0.0148 | \n", "0.1259 | \n", "0.1157 | \n", "0.0531 | \n", "Koryak | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
1259 | \n", "I0429 | \n", "0.0413 | \n", "0.0447 | \n", "0.0440 | \n", "0.0098 | \n", "Yamnaya_Samara | \n", "
1260 | \n", "I0438 | \n", "0.0384 | \n", "0.0497 | \n", "0.0399 | \n", "0.0020 | \n", "Yamnaya_Samara | \n", "
1261 | \n", "I0585 | \n", "0.0770 | \n", "-0.0424 | \n", "0.0372 | \n", "0.0355 | \n", "WHG | \n", "
1262 | \n", "I0797 | \n", "-0.0101 | \n", "-0.0452 | \n", "-0.0342 | \n", "-0.0124 | \n", "LBK_EN | \n", "
1263 | \n", "I0795 | \n", "-0.0057 | \n", "-0.0495 | \n", "-0.0429 | \n", "0.0098 | \n", "LBK_EN | \n", "
1264 | \n", "I0022 | \n", "-0.0133 | \n", "-0.0433 | \n", "-0.0356 | \n", "-0.0089 | \n", "LBK_EN | \n", "
1265 | \n", "I0026 | \n", "-0.0142 | \n", "-0.0438 | \n", "-0.0430 | \n", "-0.0027 | \n", "LBK_EN | \n", "
1266 | \n", "I1507 | \n", "0.0866 | \n", "-0.0455 | \n", "0.0393 | \n", "0.0311 | \n", "WHG | \n", "
1267 | \n", "I0025 | \n", "-0.0103 | \n", "-0.0449 | \n", "-0.0404 | \n", "-0.0023 | \n", "LBK_EN | \n", "
1268 | \n", "I0443 | \n", "0.0350 | \n", "0.0401 | \n", "0.0412 | \n", "0.0028 | \n", "Yamnaya_Samara | \n", "
1269 | \n", "I0054 | \n", "-0.0054 | \n", "-0.0413 | \n", "-0.0410 | \n", "-0.0124 | \n", "LBK_EN | \n", "
1270 | \n", "I0046 | \n", "-0.0066 | \n", "-0.0446 | \n", "-0.0386 | \n", "-0.0092 | \n", "LBK_EN | \n", "
1271 | \n", "I0048 | \n", "-0.0128 | \n", "-0.0367 | \n", "-0.0388 | \n", "-0.0129 | \n", "LBK_EN | \n", "
1272 | \n", "I0056 | \n", "-0.0067 | \n", "-0.0472 | \n", "-0.0388 | \n", "-0.0054 | \n", "LBK_EN | \n", "
1273 | \n", "I0057 | \n", "-0.0113 | \n", "-0.0442 | \n", "-0.0357 | \n", "-0.0008 | \n", "LBK_EN | \n", "
1274 | \n", "I0100 | \n", "-0.0063 | \n", "-0.0455 | \n", "-0.0410 | \n", "-0.0051 | \n", "LBK_EN | \n", "
1275 | \n", "I0659 | \n", "-0.0084 | \n", "-0.0437 | \n", "-0.0431 | \n", "-0.0099 | \n", "LBK_EN | \n", "
1276 | \n", "I0821 | \n", "-0.0071 | \n", "-0.0428 | \n", "-0.0380 | \n", "-0.0103 | \n", "LBK_EN | \n", "
1277 | \n", "I1550 | \n", "-0.0107 | \n", "-0.0386 | \n", "-0.0402 | \n", "-0.0039 | \n", "LBK_EN | \n", "
1278 | \n", "BOO001 | \n", "0.0399 | \n", "0.0760 | \n", "0.0915 | \n", "0.0453 | \n", "BolshoyOleniOstrov | \n", "
1279 | \n", "BOO002 | \n", "0.0445 | \n", "0.0735 | \n", "0.0925 | \n", "0.0379 | \n", "BolshoyOleniOstrov | \n", "
1280 | \n", "BOO003 | \n", "0.0466 | \n", "0.0765 | \n", "0.0862 | \n", "0.0415 | \n", "BolshoyOleniOstrov | \n", "
1281 | \n", "BOO004 | \n", "0.0411 | \n", "0.0723 | \n", "0.0938 | \n", "0.0419 | \n", "BolshoyOleniOstrov | \n", "
1282 | \n", "BOO005 | \n", "0.0461 | \n", "0.0731 | \n", "0.0909 | \n", "0.0401 | \n", "BolshoyOleniOstrov | \n", "
1283 | \n", "BOO006 | \n", "0.0394 | \n", "0.0917 | \n", "0.1002 | \n", "0.0438 | \n", "BolshoyOleniOstrov | \n", "
1284 | \n", "CHV001 | \n", "0.0441 | \n", "0.0331 | \n", "0.0587 | \n", "0.0325 | \n", "ChalmnyVarre | \n", "
1285 | \n", "CHV002 | \n", "0.0442 | \n", "0.0351 | \n", "0.0610 | \n", "0.0373 | \n", "ChalmnyVarre | \n", "
1286 | \n", "JK1968 | \n", "0.0398 | \n", "0.0385 | \n", "0.0661 | \n", "0.0299 | \n", "Levanluhta | \n", "
1287 | \n", "JK1970 | \n", "0.0408 | \n", "0.0466 | \n", "0.0600 | \n", "0.0363 | \n", "Levanluhta | \n", "
1288 | \n", "JK2065 | \n", "0.0392 | \n", "-0.0065 | \n", "0.0195 | \n", "0.0043 | \n", "JK2065 | \n", "
1289 rows × 6 columns
\n", "\n", " | Population | \n", "
---|---|
1 | \n", "Abkhasian | \n", "
2 | \n", "Adygei | \n", "
3 | \n", "Albanian | \n", "
4 | \n", "Armenian | \n", "
5 | \n", "Assyrian | \n", "
6 | \n", "Balkar | \n", "
7 | \n", "Basque | \n", "
8 | \n", "BedouinA | \n", "
9 | \n", "BedouinB | \n", "
10 | \n", "Belarusian | \n", "
11 | \n", "Bulgarian | \n", "
12 | \n", "Canary_Islander | \n", "
13 | \n", "Chechen | \n", "
0 | \n", "Chuvash | \n", "
14 | \n", "Croatian | \n", "
15 | \n", "Cypriot | \n", "
16 | \n", "Czech | \n", "
17 | \n", "Druze | \n", "
18 | \n", "English | \n", "
19 | \n", "Estonian | \n", "
20 | \n", "Finnish | \n", "
21 | \n", "French | \n", "
22 | \n", "Georgian | \n", "
23 | \n", "German | \n", "
24 | \n", "Greek | \n", "
25 | \n", "Hungarian | \n", "
26 | \n", "Icelandic | \n", "
27 | \n", "Iranian | \n", "
28 | \n", "Irish | \n", "
29 | \n", "Irish_Ulster | \n", "
... | \n", "... | \n", "
38 | \n", "Jew_Tunisian | \n", "
39 | \n", "Jew_Turkish | \n", "
40 | \n", "Jew_Yemenite | \n", "
41 | \n", "Jordanian | \n", "
42 | \n", "Kumyk | \n", "
44 | \n", "Lebanese | \n", "
43 | \n", "Lebanese_Christian | \n", "
45 | \n", "Lebanese_Muslim | \n", "
46 | \n", "Lezgin | \n", "
47 | \n", "Lithuanian | \n", "
48 | \n", "Maltese | \n", "
49 | \n", "Mordovian | \n", "
50 | \n", "North_Ossetian | \n", "
51 | \n", "Norwegian | \n", "
52 | \n", "Orcadian | \n", "
53 | \n", "Palestinian | \n", "
54 | \n", "Polish | \n", "
55 | \n", "Romanian | \n", "
56 | \n", "Russian | \n", "
57 | \n", "Sardinian | \n", "
58 | \n", "Saudi | \n", "
59 | \n", "Scottish | \n", "
60 | \n", "Shetlandic | \n", "
61 | \n", "Sicilian | \n", "
62 | \n", "Sorb | \n", "
64 | \n", "Spanish | \n", "
63 | \n", "Spanish_North | \n", "
65 | \n", "Syrian | \n", "
66 | \n", "Turkish | \n", "
67 | \n", "Ukrainian | \n", "
68 rows × 1 columns
\n", "\n", " | Population | \n", "colorIndex | \n", "symbolIndex | \n", "
---|---|---|---|
1 | \n", "Abkhasian | \n", "0 | \n", "0 | \n", "
2 | \n", "Adygei | \n", "0 | \n", "1 | \n", "
3 | \n", "Albanian | \n", "0 | \n", "2 | \n", "
4 | \n", "Armenian | \n", "0 | \n", "3 | \n", "
5 | \n", "Assyrian | \n", "0 | \n", "4 | \n", "
6 | \n", "Balkar | \n", "0 | \n", "5 | \n", "
7 | \n", "Basque | \n", "0 | \n", "6 | \n", "
8 | \n", "BedouinA | \n", "0 | \n", "7 | \n", "
9 | \n", "BedouinB | \n", "1 | \n", "0 | \n", "
10 | \n", "Belarusian | \n", "1 | \n", "1 | \n", "
11 | \n", "Bulgarian | \n", "1 | \n", "2 | \n", "
12 | \n", "Canary_Islander | \n", "1 | \n", "3 | \n", "
13 | \n", "Chechen | \n", "1 | \n", "4 | \n", "
0 | \n", "Chuvash | \n", "1 | \n", "5 | \n", "
14 | \n", "Croatian | \n", "1 | \n", "6 | \n", "
15 | \n", "Cypriot | \n", "1 | \n", "7 | \n", "
16 | \n", "Czech | \n", "2 | \n", "0 | \n", "
17 | \n", "Druze | \n", "2 | \n", "1 | \n", "
18 | \n", "English | \n", "2 | \n", "2 | \n", "
19 | \n", "Estonian | \n", "2 | \n", "3 | \n", "
20 | \n", "Finnish | \n", "2 | \n", "4 | \n", "
21 | \n", "French | \n", "2 | \n", "5 | \n", "
22 | \n", "Georgian | \n", "2 | \n", "6 | \n", "
23 | \n", "German | \n", "2 | \n", "7 | \n", "
24 | \n", "Greek | \n", "3 | \n", "0 | \n", "
25 | \n", "Hungarian | \n", "3 | \n", "1 | \n", "
26 | \n", "Icelandic | \n", "3 | \n", "2 | \n", "
27 | \n", "Iranian | \n", "3 | \n", "3 | \n", "
28 | \n", "Irish | \n", "3 | \n", "4 | \n", "
29 | \n", "Irish_Ulster | \n", "3 | \n", "5 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "
38 | \n", "Jew_Tunisian | \n", "4 | \n", "6 | \n", "
39 | \n", "Jew_Turkish | \n", "4 | \n", "7 | \n", "
40 | \n", "Jew_Yemenite | \n", "5 | \n", "0 | \n", "
41 | \n", "Jordanian | \n", "5 | \n", "1 | \n", "
42 | \n", "Kumyk | \n", "5 | \n", "2 | \n", "
44 | \n", "Lebanese | \n", "5 | \n", "3 | \n", "
43 | \n", "Lebanese_Christian | \n", "5 | \n", "4 | \n", "
45 | \n", "Lebanese_Muslim | \n", "5 | \n", "5 | \n", "
46 | \n", "Lezgin | \n", "5 | \n", "6 | \n", "
47 | \n", "Lithuanian | \n", "5 | \n", "7 | \n", "
48 | \n", "Maltese | \n", "6 | \n", "0 | \n", "
49 | \n", "Mordovian | \n", "6 | \n", "1 | \n", "
50 | \n", "North_Ossetian | \n", "6 | \n", "2 | \n", "
51 | \n", "Norwegian | \n", "6 | \n", "3 | \n", "
52 | \n", "Orcadian | \n", "6 | \n", "4 | \n", "
53 | \n", "Palestinian | \n", "6 | \n", "5 | \n", "
54 | \n", "Polish | \n", "6 | \n", "6 | \n", "
55 | \n", "Romanian | \n", "6 | \n", "7 | \n", "
56 | \n", "Russian | \n", "7 | \n", "0 | \n", "
57 | \n", "Sardinian | \n", "7 | \n", "1 | \n", "
58 | \n", "Saudi | \n", "7 | \n", "2 | \n", "
59 | \n", "Scottish | \n", "7 | \n", "3 | \n", "
60 | \n", "Shetlandic | \n", "7 | \n", "4 | \n", "
61 | \n", "Sicilian | \n", "7 | \n", "5 | \n", "
62 | \n", "Sorb | \n", "7 | \n", "6 | \n", "
64 | \n", "Spanish | \n", "7 | \n", "7 | \n", "
63 | \n", "Spanish_North | \n", "8 | \n", "0 | \n", "
65 | \n", "Syrian | \n", "8 | \n", "1 | \n", "
66 | \n", "Turkish | \n", "8 | \n", "2 | \n", "
67 | \n", "Ukrainian | \n", "8 | \n", "3 | \n", "
68 rows × 3 columns
\n", "\n", " | Population | \n", "colorIndex | \n", "symbolIndex | \n", "
---|---|---|---|
0 | \n", "Abkhasian | \n", "0 | \n", "0 | \n", "
1 | \n", "Adygei | \n", "0 | \n", "1 | \n", "
2 | \n", "Albanian | \n", "0 | \n", "2 | \n", "
3 | \n", "Aleut | \n", "0 | \n", "3 | \n", "
4 | \n", "Aleut_Tlingit | \n", "0 | \n", "4 | \n", "
5 | \n", "Altaian | \n", "0 | \n", "5 | \n", "
6 | \n", "Ami | \n", "0 | \n", "6 | \n", "
7 | \n", "Armenian | \n", "0 | \n", "7 | \n", "
8 | \n", "Assyrian | \n", "0 | \n", "8 | \n", "
9 | \n", "Atayal | \n", "0 | \n", "9 | \n", "
10 | \n", "Avar | \n", "0 | \n", "10 | \n", "
11 | \n", "Azeri | \n", "0 | \n", "11 | \n", "
12 | \n", "Balkar | \n", "0 | \n", "12 | \n", "
13 | \n", "Basque | \n", "1 | \n", "0 | \n", "
14 | \n", "BedouinA | \n", "1 | \n", "1 | \n", "
15 | \n", "BedouinB | \n", "1 | \n", "2 | \n", "
16 | \n", "Belarusian | \n", "1 | \n", "3 | \n", "
17 | \n", "Borneo | \n", "1 | \n", "4 | \n", "
18 | \n", "Bulgarian | \n", "1 | \n", "5 | \n", "
19 | \n", "Buryat | \n", "1 | \n", "6 | \n", "
20 | \n", "Cambodian | \n", "1 | \n", "7 | \n", "
21 | \n", "Chechen | \n", "1 | \n", "8 | \n", "
22 | \n", "Chukchi | \n", "1 | \n", "9 | \n", "
23 | \n", "Chukchi1 | \n", "1 | \n", "10 | \n", "
24 | \n", "Chuvash | \n", "1 | \n", "11 | \n", "
25 | \n", "Croatian | \n", "1 | \n", "12 | \n", "
26 | \n", "Cypriot | \n", "2 | \n", "0 | \n", "
27 | \n", "Czech | \n", "2 | \n", "1 | \n", "
28 | \n", "Dai | \n", "2 | \n", "2 | \n", "
29 | \n", "Daur | \n", "2 | \n", "3 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "
89 | \n", "Saami.DG | \n", "6 | \n", "11 | \n", "
90 | \n", "Saami_WGA | \n", "6 | \n", "12 | \n", "
91 | \n", "Sardinian | \n", "7 | \n", "0 | \n", "
92 | \n", "Saudi | \n", "7 | \n", "1 | \n", "
93 | \n", "Scottish | \n", "7 | \n", "2 | \n", "
94 | \n", "Selkup | \n", "7 | \n", "3 | \n", "
95 | \n", "Semende | \n", "7 | \n", "4 | \n", "
96 | \n", "She | \n", "7 | \n", "5 | \n", "
97 | \n", "Sherpa.DG | \n", "7 | \n", "6 | \n", "
98 | \n", "Sicilian | \n", "7 | \n", "7 | \n", "
99 | \n", "Spanish | \n", "7 | \n", "8 | \n", "
100 | \n", "Spanish_North | \n", "7 | \n", "9 | \n", "
101 | \n", "Syrian | \n", "7 | \n", "10 | \n", "
102 | \n", "Tajik | \n", "7 | \n", "11 | \n", "
103 | \n", "Thai | \n", "7 | \n", "12 | \n", "
104 | \n", "Tibetan.DG | \n", "8 | \n", "0 | \n", "
105 | \n", "Tu | \n", "8 | \n", "1 | \n", "
106 | \n", "Tubalar | \n", "8 | \n", "2 | \n", "
107 | \n", "Tujia | \n", "8 | \n", "3 | \n", "
108 | \n", "Turkish | \n", "8 | \n", "4 | \n", "
109 | \n", "Turkmen | \n", "8 | \n", "5 | \n", "
110 | \n", "Tuvinian | \n", "8 | \n", "6 | \n", "
111 | \n", "Ukrainian | \n", "8 | \n", "7 | \n", "
112 | \n", "Ulchi | \n", "8 | \n", "8 | \n", "
113 | \n", "Uygur | \n", "8 | \n", "9 | \n", "
114 | \n", "Uzbek | \n", "8 | \n", "10 | \n", "
115 | \n", "Xibo | \n", "8 | \n", "11 | \n", "
116 | \n", "Yakut | \n", "8 | \n", "12 | \n", "
117 | \n", "Yi | \n", "9 | \n", "0 | \n", "
118 | \n", "Yukagir | \n", "9 | \n", "1 | \n", "
119 rows × 3 columns
\n", "