03-broadcasting.html

<!DOCTYPE html>
<html>
  <head>
    <meta charset="utf-8">
    <meta name="generator" content="pandoc">
    <title>Software Carpentry: Advanced NumPy</title>
    <link rel="shortcut icon" type="image/x-icon" href="/favicon.ico" />
    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
    <link rel="stylesheet" type="text/css" href="css/bootstrap/bootstrap.css" />
    <link rel="stylesheet" type="text/css" href="css/bootstrap/bootstrap-theme.css" />
    <link rel="stylesheet" type="text/css" href="css/swc.css" />
    <link rel="alternate" type="application/rss+xml" title="Software Carpentry Blog" href="http://software-carpentry.org/feed.xml"/>
    <meta charset="UTF-8" />
    <!-- HTML5 shim, for IE6-8 support of HTML5 elements -->
    <!--[if lt IE 9]>
      <script src="http://html5shim.googlecode.com/svn/trunk/html5.js"></script>
    <![endif]-->
  </head>
  <body class="lesson">
    <div class="container card">
      <div class="banner">
        <a href="http://software-carpentry.org" title="Software Carpentry">
          <img alt="Software Carpentry banner" src="img/software-carpentry-banner.png" />
        </a>
      </div>
      <article>
      <div class="row">
        <div class="col-md-10 col-md-offset-1">
                    <a href="index.html"><h1 class="title">Advanced NumPy</h1></a>
          <h2 class="subtitle">Broadcasting</h2>
          <section class="objectives panel panel-warning">
<div class="panel-heading">
<h2 id="learning-objectives"><span class="glyphicon glyphicon-certificate"></span>Learning Objectives</h2>
</div>
<div class="panel-body">
<p>After the lesson learner:</p>
<ul>
<li>Knows how to add a scalar to all elements of an array.</li>
<li>Can predict the result of additions of matrices and row or column vectors.</li>
<li>Can explain why broadcasting is better than using for loops.</li>
<li>Understands the rules of broadcasting and can predict the shape of broadcasted arrays.</li>
<li>Knows how to control broadcasting using <code>np.newaxis</code> object.</li>
</ul>
</div>
</section>
<p>It’s possible to do operations on arrays of different sizes. In some case NumPy can transform these arrays automatically so that they all have the same size: this conversion is called <strong>broadcasting</strong>.</p>
<div class="figure">
<img src="fig/numpy_broadcasting.png" title="numpy broadcasting in 2D" alt="numpy broadcasting in 2D" />
<p class="caption">numpy broadcasting in 2D</p>
</div>
<p>Let’s try to reproduce the above diagram. First, we create two one-dimensional arrays:</p>
<pre><code>&gt;&gt;&gt; a = np.arange(4) * 10
&gt;&gt;&gt; b = np.arange(3)
&gt;&gt;&gt; a
array([ 0, 10, 20, 30])
&gt;&gt;&gt; b
array([0, 1, 2])</code></pre>
<p>We can tile them in 2D using <code>np.tile</code> function:</p>
<pre><code>&gt;&gt;&gt; b2 = np.tile(b, (4, 1))
&gt;&gt;&gt; b2
array([[0, 1, 2],
       [0, 1, 2],
       [0, 1, 2],
       [0, 1, 2]])</code></pre>
<p>We do the same with the second array, but we need also to transpose (exchange columns with rows) the resulting array:</p>
<pre><code>&gt;&gt;&gt; a2 = np.tile(a, (3, 1))
&gt;&gt;&gt; a2 = a2.T
&gt;&gt;&gt; a2
array([[ 0,  0,  0],
       [10, 10, 10],
       [20, 20, 20],
       [30, 30, 30]])</code></pre>
<p>Note that the <code>np.tile</code> function creates new arrays and copies the data. Then you can add the arrays element-wise:</p>
<pre><code>&gt;&gt;&gt; a2 + b2
array([[ 0,  1,  2],
       [10, 11, 12],
       [20, 21, 22],
       [30, 31, 32]])</code></pre>
<p>In the second example we add a one-dimensional array to a two-dimensional array. NumPy will automatically “tile” the 1D array along the missing direction:</p>
<pre><code>&gt;&gt;&gt; a2 + b
array([[ 0,  1,  2],
       [10, 11, 12],
       [20, 21, 22],
       [30, 31, 32]])</code></pre>
<p>However, in this case no copy of <code>b</code> array is involved. NumPy will instead use the same data in <code>b</code> for each row of <code>a</code> – we will cover the mechanism behind it at the end of the lesson.</p>
<p>In the third example we add a single column with a single vector. To obtain a column array from a 1D array we need to convert it to 2D array of four rows and one column. In NumPy we can add singular dimensions (dimensions of size 1) by a special object <code>np.newaxis</code>:</p>
<pre><code>&gt;&gt;&gt; a.shape
(4,)
&gt;&gt;&gt; a_column = a[:, np.newaxis]
&gt;&gt;&gt; a_column.shape
(4, 1)
&gt;&gt;&gt; a_column
array([[ 0],
       [10],
       [20],
       [30]])</code></pre>
<p>We can add a column vector and a 1D array:</p>
<pre><code>&gt;&gt;&gt; a_column + b
array([[ 0,  1,  2],
       [10, 11, 12],
       [20, 21, 22],
       [30, 31, 32]])</code></pre>
<p>This is the same as adding a column and row vector:</p>
<pre><code>&gt;&gt;&gt; b_row = b[np.newaxis, :]
&gt;&gt;&gt; b_row
array([[0, 1, 2]])
&gt;&gt;&gt; b_row.shape
(1, 3)
&gt;&gt;&gt; a_column + b_row
array([[ 0,  1,  2],
       [10, 11, 12],
       [20, 21, 22],
       [30, 31, 32]])</code></pre>
<section class="challenge panel panel-success">
<div class="panel-heading">
<h2 id="normalising-data"><span class="glyphicon glyphicon-pencil"></span>Normalising data</h2>
</div>
<div class="panel-body">
<p>Given the following array:</p>
<pre><code>a = np.random.rand(10, 100) </code></pre>
<p>For each column of <code>a</code> subtract its mean. Next, do the same with rows.</p>
</div>
</section>
<p>Broadcasting seems a bit magical, but it is actually quite natural to use it when we want to solve a problem whose output data is an array with more dimensions than input data. There a simple rule that allow to determine the validity of broadcasting and the shape of broadcasted arrays:</p>
<blockquote>
<p>In order to broadcast, the size of the trailing axes for both arrays in an operation must either be the same or one of them must be one.</p>
</blockquote>
<p>This does indeed work for the three addition from the figure</p>
<pre><code>a:      4 x 3 
b:      4 x 3
result: 4 x 3

a:      4 x 3
b:          3
result: 4 x 3

a:      4 x 1
b:          3
result: 4 x 3</code></pre>
<p>Lets look at two more examples:</p>
<pre><code>Image  (3d array): 256 x 256 x 3
Scale  (1d array):             3
Result (3d array): 256 x 256 x 3

A      (4d array):  8 x 1 x 6 x 1
B      (3d array):      7 x 1 x 5
Result (4d array):  8 x 7 x 6 x 5</code></pre>
<section class="challenge panel panel-success">
<div class="panel-heading">
<h2 id="broadcasting-rules"><span class="glyphicon glyphicon-pencil"></span>Broadcasting rules</h2>
</div>
<div class="panel-body">
<p>Given the arrays:</p>
<pre><code>X = np.random.rand(10,3)
Y = np.random.rand(3)</code></pre>
<p>which of the following will <em>not</em> produce an error:</p>
<ol style="list-style-type: lower-alpha">
<li><p><code>X + Y</code></p></li>
<li><p><code>X[np.newaxis, :] + Y</code></p></li>
<li><p><code>X + Y[:, np.newaxis]</code></p></li>
<li><p><code>X[:, np.newaxis] + Y</code></p></li>
<li><p><code>X + Y[np.newaxis, :]</code></p></li>
<li><p><code>X[:, np.newaxis, :] + Y</code></p></li>
</ol>
<p>What will be the shapes of the final broadcasted arrays? Try to guess and then check.</p>
</div>
</section>
<section class="challenge panel panel-success">
<div class="panel-heading">
<h2 id="three-dimensional-broadcasting"><span class="glyphicon glyphicon-pencil"></span>Three-dimensional broadcasting</h2>
</div>
<div class="panel-body">
<p>Below, produce the array containing the sum of every element in <code>x</code> with every element in <code>y</code></p>
<div class="sourceCode"><pre class="sourceCode python"><code class="sourceCode python">x <span class="op">=</span> np.random.rand(<span class="dv">3</span>, <span class="dv">5</span>)
y <span class="op">=</span> np.random.randint(<span class="dv">10</span>, size<span class="op">=</span><span class="dv">8</span>)
z <span class="op">=</span> x <span class="co"># FIX THIS</span></code></pre></div>
</div>
</section>
<section class="challenge panel panel-success">
<div class="panel-heading">
<h2 id="broadcasting-indices"><span class="glyphicon glyphicon-pencil"></span>Broadcasting indices</h2>
</div>
<div class="panel-body">
<p>Predict and verify the shape of <code>y</code>:</p>
<div class="sourceCode"><pre class="sourceCode python"><code class="sourceCode python">x <span class="op">=</span> np.empty((<span class="dv">10</span>, <span class="dv">8</span>, <span class="dv">6</span>))

idx0 <span class="op">=</span> np.zeros((<span class="dv">3</span>, <span class="dv">8</span>)).astype(<span class="bu">int</span>)
idx1 <span class="op">=</span> np.zeros((<span class="dv">3</span>, <span class="dv">1</span>)).astype(<span class="bu">int</span>)
idx2 <span class="op">=</span> np.zeros((<span class="dv">1</span>, <span class="dv">1</span>)).astype(<span class="bu">int</span>)

y <span class="op">=</span> x[idx0, idx1, idx2]</code></pre></div>
</div>
</section>
<p>A lot of grid-based or network-based problems can also use broadcasting. For instance, if we want to compute the distance from the origin of points on a 10x10 grid, we can do</p>
<pre><code>&gt;&gt;&gt; x = np.arange(5)
&gt;&gt;&gt; y = np.arange(5)[:, np.newaxis]
&gt;&gt;&gt; distance = np.sqrt(x ** 2 + y ** 2)
&gt;&gt;&gt; distance
array([[ 0.        ,  1.        ,  2.        ,  3.        ,  4.        ],
       [ 1.        ,  1.41421356,  2.23606798,  3.16227766,  4.12310563],
       [ 2.        ,  2.23606798,  2.82842712,  3.60555128,  4.47213595],
       [ 3.        ,  3.16227766,  3.60555128,  4.24264069,  5.        ],
       [ 4.        ,  4.12310563,  4.47213595,  5.        ,  5.65685425]])</code></pre>
<section class="challenge panel panel-success">
<div class="panel-heading">
<h2 id="creating-a-two-dimensional-grid"><span class="glyphicon glyphicon-pencil"></span>Creating a two-dimensional grid</h2>
</div>
<div class="panel-body">
<p>What are the dimensionalities of <code>x</code>, <code>y</code> and <code>z</code> in the two cases:</p>
<pre><code>x, y = np.mgrid[:10, :5]
z = x + y</code></pre>
<p>and</p>
<pre><code>x, y = np.ogrid[:10, :5]
z = x + y</code></pre>
<p>What might be the advantage of using <code>np.ogrid</code> over <code>np.mgrid</code>?</p>
</div>
</section>
<aside class="callout panel panel-info">
<div class="panel-heading">
<h2 id="worked-example-route-66"><span class="glyphicon glyphicon-pushpin"></span>Worked example: Route 66</h2>
</div>
<div class="panel-body">
<p>Let’s construct an array of distances (in miles) between cities of Route 66: Chicago, Springfield, Saint-Louis, Tulsa, Oklahoma City, Amarillo, Santa Fe, Albuquerque, Flagstaff and Los Angeles.</p>
<pre><code>&gt;&gt;&gt; mileposts = np.array([0, 198, 303, 736, 871, 1175, 1475, 1544,
...        1913, 2448])
&gt;&gt;&gt; distance_array = np.abs(mileposts - mileposts[:, np.newaxis])
&gt;&gt;&gt; distance_array
array([[   0,  198,  303,  736,  871, 1175, 1475, 1544, 1913, 2448],
       [ 198,    0,  105,  538,  673,  977, 1277, 1346, 1715, 2250],
       [ 303,  105,    0,  433,  568,  872, 1172, 1241, 1610, 2145],
       [ 736,  538,  433,    0,  135,  439,  739,  808, 1177, 1712],
       [ 871,  673,  568,  135,    0,  304,  604,  673, 1042, 1577],
       [1175,  977,  872,  439,  304,    0,  300,  369,  738, 1273],
       [1475, 1277, 1172,  739,  604,  300,    0,   69,  438,  973],
       [1544, 1346, 1241,  808,  673,  369,   69,    0,  369,  904],
       [1913, 1715, 1610, 1177, 1042,  738,  438,  369,    0,  535],
       [2448, 2250, 2145, 1712, 1577, 1273,  973,  904,  535,    0]])</code></pre>
<div class="figure">
<img src="fig/route66.png" alt="Distances on Route 66" />
<p class="caption">Distances on Route 66</p>
</div>
</div>
</aside>
<section class="challenge panel panel-success">
<div class="panel-heading">
<h2 id="distances"><span class="glyphicon glyphicon-pencil"></span>Distances</h2>
</div>
<div class="panel-body">
<p>Given an array of latitudes and longitudes of major European capitals calculate pairwise distances between them. Use the approximate formula:</p>
<p><span class="math display">\[D=6371.009\sqrt{(\Delta\phi)^2 + (\Delta\lambda)^2}\qquad \text{(in kilometers)},\]</span></p>
<p>where <span class="math inline">\(\Delta\phi=\phi_1-\phi_2\)</span> and <span class="math inline">\(\Delta\lambda=\lambda_1-\lambda_2\)</span> are the differences between the latitudes and longitude of two cities in radians. (<em>Hint</em>: To convert degrees to radians multiply them by <span class="math inline">\(\pi/180\)</span>).</p>
<pre><code>coords = np.array([
                  [ 23.71666667,  37.96666667], # Athens
                  [ 13.38333333,  52.51666667], # Berlin
                  [ -0.1275    ,  51.50722222], # London
                  [ -3.71666667,  40.38333333], # Madrid
                  [  2.3508    ,  48.8567    ], # Paris
                  [ 12.5       ,  41.9       ]  # Rome
]) </code></pre>
<p>When you are done you can compare the results with a more <a href="https://en.wikipedia.org/wiki/Geographical_distance#Spherical_Earth_projected_to_a_plane">precise formula</a>:</p>
<p><span class="math display">\[D=6371.009\sqrt{(\Delta\phi)^2 + (\cos(\phi_m)\Delta\lambda)^2}\]</span></p>
<p>where <span class="math inline">\(\phi_m = (\phi_1+\phi_2) / 2\)</span> is the mean latitude.</p>
</div>
</section>
        </div>
      </div>
      </article>
      <div class="footer">
        <a class="label swc-blue-bg" href="http://software-carpentry.org">Software Carpentry</a>
        <a class="label swc-blue-bg" href="https://github.com/paris-swc/advanced-numpy-lesson">Source</a>
        <a class="label swc-blue-bg" href="mailto:admin@software-carpentry.org">Contact</a>
        <a class="label swc-blue-bg" href="LICENSE.html">License</a>
      </div>
    </div>
    <!-- Javascript placed at the end of the document so the pages load faster -->
    <script src="http://software-carpentry.org/v5/js/jquery-1.9.1.min.js"></script>
    <script src="css/bootstrap/bootstrap-js/bootstrap.js"></script>
    <script src='https://cdn.mathjax.org/mathjax/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML'></script>
  </body>
</html>