Moments, Mass, and Motion #2

If you didn’t read the first post then go read it and come back here!

Nearly two centuries before Chebyshev’s systematic approach to moments there lived a mathematical prodigy. A Cambridge graduate, educated on Copernicus and Galileo, he knew about mass and motion but felt bothered by them. You see, they weren’t the mass and motion we know today; they were fickle philosophical concepts — no math, just a general idea of what to look for. That fuzz bothered him so much that he decided to investigate and clarify; by 1687, now a professor of Mathematics, Issac Newton, published Principia. Unknowingly, he had created the core of physics and calculus.

So we are talking about physics today (my favorite application of moments)! But what’s the excitement for? You may be wondering how physics actually has anything to do with moments. We spent the last post covering moments of a PDE to learn the fundamentals; now that we understand the essence of the moment, we begin to play with what we can use. It turns out that we can generalize past a probability distribution to any type of distribution, such as a density function. But what is a density function? Why do moments generalize to it? What does that generalization reveal? Let’s cover that today!

Before we go any further let me introduce a few terms as they will be used in this post:

Density Function:

  • An object’s density function, $\rho(x)$ is a function that describes the object’s distribution of mass density at any point in space

Rigid Body:

  • An object whose density function does not change (as long as we look from the object’s point of view)
  • For example a liquid would not be a rigid body because depending on its position or motion, its distribution of density can change (ie the liquid moves)

Tensor:

  • A tensor is a set of numbers. The tensor has discrete numerical indices for each number. The amount of unique indices dictates the order of a tensor
  • For example, Bob has a shelf with two rows and four columns. He could represent his shelf as a 2X4 second order tensor (aka a matrix) and make a numerical id for the objects he puts into the shelf.
  • Unlike a shelf though, the tensor can’t have any weird ‘shape’. It can only be a parallelogram (like a rectangle or square)
 

Before we can generalize moments to a density function, we need to expand the concept of moments to multiple variables (after all we live in a 3D world, not a line). 

I think we should start by redefining the $X$ in $E(X)$. We will no longer consider $X$ as one variable, $x$, but a first order tensor or a vector, $[x1, x2, … , xn]$ (basically coordinates in euclidean space).
$$X=\begin{bmatrix}x_{1} \\ x_{2} \\ \vdots \\ x_{n} \end{bmatrix}$$
Next let’s update the function $E(X)$ itself.
$$E(X^{n}) = \int_{-\infty}^{\infty}f(x_{1},x_{2},\cdots,x_{n})(X)^{n}dX$$
It’s not immediately obvious how to update $X^{n}$. Even though $X^{n}$ can be simplified to $X\cdot X\cdot … \cdot X$, tensors have many different multiplication operations. These operations include tensor contractions and tensor products.

For raw moments we will generally use the tensor product but tensor contractions will be used when calculating skewness and kurtosis (oh yeah those two will show up again…). The tensor product will basically combine our n vectors into an n order tensor.

$$X^{n}=X\otimes X\otimes \cdots\otimes X = M_{n}$$ $$\begin{bmatrix}x_{1} \\ x_{2} \\ \vdots \\ x_{n} \end{bmatrix}^{2}=\begin{bmatrix}x_{1} \\ x_{2} \\ \vdots \\ x_{n} \end{bmatrix}\otimes \begin{bmatrix}x_{1} \\ x_{2} \\ \vdots \\ x_{n} \end{bmatrix}=\begin{bmatrix}x_{1}^{2} & x_{1}x_{2} & \cdots & x_{1}x_{n}\\ x_{1}x_{2} & x_{2}^{2} & \cdots & x_{2}x_{n} \\ \vdots & \vdots & \ddots & \vdots \\ x_{1}x_{n} & x_{2}x_{n} & \cdots & x_{n}^{2} \end{bmatrix}$$
The PDE becomes the density function, $\rho(X)$.
$$E(X^{n}) = \int_{-\infty}^{\infty}\rho(x_{1},x_{2},\cdots,x_{n})(X)^{n}dX$$

And finally we will integrate from negative infinity to infinity along each variable in $X$.

$$E(X^{n}) = \int_{-\infty}^{\infty}\int_{-\infty}^{\infty}\cdots\int_{-\infty}^{\infty}\rho(x_{1},x_{2},\cdots,x_{n})(X)^{n}dx_{1}dx_{2}\cdots dx_{n}$$

As we covered last time, $X$ can be modified by centralization and standardization. We can do the exact same thing here! Centralization is just as straightforward. The center will subtract from $X$ before calculating the moment. Standardization is a bit more complicated. The variance will be a matrix, $\Sigma$. To ‘divide’ by $\sqrt{\Sigma}$ we will multiply $X$ by $\Sigma^{-\frac{1}{2}}$ (This is a thing from linear algebra).

Centralized

$$E(Y); E(Y); Y = X-\mu$$

Standardized

$$E(Y); Y = \Sigma^{-\frac{1}{2}}X$$

Both

$$E(Y); Y = \Sigma^{-\frac{1}{2}}(X-\mu)$$

Now that we are prepared, let’s traverse a density function’s moments!

While the PDE’s zeroth moment would mean nothing, $\rho(X)$’s zeroth moment actually means something. Because n would be zero, the moment would be the full sum of density. That makes the zeroth moment the mass!

The first moment in a PDE was the mean value or the center. Well, the density function’s first moment is essentially the same thing.  When finding the first moment we essentially weigh all the points in the object by the density making the first moment the product of mass and center of mass (com)!

You might be thinking that we should logically proceed by dividing out the mass from the first moment. The center of mass is a very important piece of information after all! Sure but doesn’t it seem weird that the first moment essentially gives the product of position and mass. Isn’t that awfully reminiscent of momentum, the products of velocity and mass, or force, acceleration and mass? I personally think this actually makes sense! Think of it like this… each bit of matter has the same ‘influence’. More matter has more ‘influence’. So when we take the zeroth moment, we measure not only the mass but the ‘influence’ the object has. The first moment then becomes the object’s position in a space weighed by ‘influence’. But here is the cool thing. Maybe our world truly exists in this ‘influence’ weighted position. And maybe that is why concepts like momentum, the product of mass(‘influence weight’) and velocity, stay constant within a system. It’s as if, without outside force, the ‘influence’ of an object’s motion stays the same. I think this thought process nicely complements any intuition surrounding conservation of momentum and potentially more complicated topics like relativity (mmm we’ll talk about that one soon…).

But I’m forgetting something. Can’t objects also rotate. You have probably heard of the moment of inertia. $$I=\int_{}^{}\int_{}^{}\int_{}^{}\rho(x,y,z)r^{2}dV$$ $$\text{where r is the distance from axis of rotation}$$ This formula looks awfully reminiscent of elements in the second moment. $$I=\int_{-\infty }^{\infty }\rho(x_{1},x_{2},x_{3})\begin{bmatrix}x_{1}^{2} & x_{1}x_{2} & x_{1}x_{3} \\ x_{1}x_{2} & x_{2}^{2} & x_{2}x_{3} \\ x_{1}x_{3} & x_{2}x_{3} & x_{3}^{2}\end{bmatrix}dX$$ Well, $\rho(X)$’s second centered moment is actually called an inertia tensor (the center is the pivot of rotation)! In 3 dimensions (4d and 2d is a mess of its own), this second order tensor can transform angular velocity into angular momentum. There is so much to talk about rigid body rotation (here’s a really cool video if you’re interested) but here is the key takeaway from what I understand. Unless the point of view is from the inertia plane(just like our weighted plane) where the angular momentum is entirely conserved, in normal space, only the angular momentum’s magnitude is conserved (this occurs because the inertia tensor would be rotating with the angular velocity). The actual direction results from the conservation of both rotational kinetic energy and angular momentum. While linear motion would get a singular direction from conserving both energy and momentum, rotational motion could be a singular direction or an entire set of directions that repeat during rotation (the video has a very pretty way of showing this). This (very oversimplified) understanding explains why certain axes of rotation are unstable.

(try flipping your phone in a similar way if you want to see the tennis racket phenomenon in action)

Past the second moment, we have reached speculation territory. I have never seen anyone cover the uses for the third moment in physics. What I have figured out though is that the skewness (as seen here) would (like the center of mass for linear motion) describe the object’s rotational position, not like an angle but like a kind of vector that would move with rotation. A nice intuition behind this would be that a rotationally symmetrical object, a sphere, would have no skewness meaning that all of its rotational positions would be indistinguishable!

And even further down the baseless but compelling thoughts, there might be more motion that could be described with higher moments! I personally don’t think so, but that would be the topic for another day. Today I invite y’all to speculate with me. I want to see what paths y’all will come up with; what connections you will make.
I hope y’all enjoyed this one!! This was probably the toughest but most rewarding post to explain semi coherently (I have been sitting here for hours and now my back hurts). Moments have been on my mind for about a year now and to finally be able to think them through has given me great joy.
I am 99.9% certain I made some mistake, so as always, feel free to converse, correct, and comment! Go have a wonderful day and thank you!