\n", "
\n", "$$\\mbox{minimize} \\quad \\sum_{i=1}\\sum_{j=1} D(X_i,Y_j)^p\\pi_{ij}$$\n", "
\n", "$$\\mbox{st.} \\quad \\sum_{j=1} \\pi_{ij} = \\mu_i , \\quad i = 1,2,..n$$\n", "
\n", "$$\\quad \\sum_{i=1} \\pi_{ij} = \\upsilon_j, \\quad j = 1,2,..m$$\n", "
\n", "$$\\pi_{ij} \\geq 0, \\quad \\forall_{i,j}$$\n", "
\n", "where $D(X_i,Y_j)$ is the distance function.\n", "
" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Wasserstein Barycenter with regularization" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The entropy regularized barycenter problem with $p=2$ is:\n", "
\n", "$$\\mbox{minimize} \\quad \\frac1N \\sum_{i,j,k}^{N} D(X_i,Y_j)^2\\pi_{ij}^k + \\frac1\\lambda\\sum_{i,j,k} \\pi_{ij}^k\\log(\\pi_{ij}^k)$$\n", "
\n", "$$\\mbox{st.} \\quad \\sum_{j=1} \\pi_{ij}^{k} = \\mu_i, \\quad \\forall_{k,i} \\quad (1)$$\n", "
\n", "$$\\quad \\sum_{i=1} \\pi_{ij}^{k} = \\upsilon_j^{k}, \\quad \\forall_{k,j} \\quad (2)$$\n", "
\n", "$$\\pi_{ij}^{k} \\geq 0 \\quad \\forall_{k,i,j}$$\n", "
\n", "where $D(X_i,Y_j)$ is the euclidian distance between pixels, $\\lambda = median(D(X_i,Y_j))$ and $N$ is the number of samples.\n", "\n", "Without the entropy term the problem is just the linear problem of computing a distribution $\\mu$ minimizing the sum of distances to $\\upsilon_i$, as studied in our other notebook. Entropy regularization was suggested to us by Stefano Gualandi and appears for example in the paper by Cuturi and Doucet http://proceedings.mlr.press/v32/cuturi14.pdf. This paper contains also more details about the choice of $\\lambda$. Also more detailed information about LP aproach to Wasserstein metric can be found in [Stefano Gualandi's blogpost](http://stegua.github.io/blog/2018/12/31/wasserstein-distances-an-operations-research-perspective/).\n", "\n", "In this problem, Wasserstein Barycenter of Three's are visualized using images with size $28x28$ using $2$ handwriten '3' digits from MNIST database. 