a generalization of the fundamental theorem of symmetric functions (idea) by Swap

A symmetric function is a polynomial or rational function (quotient of polynomials) in n variables which remains invariant no matter how you permute variables (e.g. swap x₁ with x₂). They feature prominently in Galois theory. The elementary symmetric functions appear as the coefficients of a polynomial in n indeterminates (i.e. the coefficients of f(t) = (t - x₁)···(t - x_n) ), and the fundamental theorem of symmetric functions says that any symmetric function can be expressed as a polynomial or rational function of elementary symmetric functions. When the original function isn't symmetric, we can still say something interesting.

Theorem: Let g(x) be any polynomial where x = (x₁, ..., x_n) are n variables, and let s₁, ..., s_n be the elementary symmetric functions in n variables. Then g(x) can be written as a linear combination of monomials

x₁^ν₁ x₂^ν₂ ··· x_n^ν_n

such that ν_i ≤ i - 1 and the coefficients of the monomials are polynomials in the s_i.

This theorem, seemingly due to Emil Artin, is a slight generalisation of the fundamental theorem of symmetric functions. It gives the closest possible expression of any polynomial in terms of symmetric functions no matter if the original polynomial is symmetric or not. Or if you prefer, the fundamental theorm of symmetric functions comes as an easy corollary to this theorem.

The corollary is obvious. Observe that the nature of monomials is such that they can't be symmetrised, because the powers in the monomial have to be nondecreasing by indices. Thus, if the original polynomial g(x) was symmetric, then the only way it can still be symmetric after being written in this form is if the only monomial with nonzero coefficient is the one for which all the ν_i are zero, i.e. the constant term. But then the constant term is a polynomial of elementary symmetric functions, proving the corollary.

The proof is an algorithm for putting g(x) in the desired form.

Proof: Let f_n(t) := (t - x₁)(t - x₂ )···(t - x_n) = tⁿ - s₁t^n-1 + ··· + (-1)ⁿs_n and define recursively

f_{i - 1}(t) := f_i(t)/(t - x_i).

Three things are immediately clear:

The polynomial f_i(t) has x_i as a root, the other roots being the other x_j with j < i, because it's just f_n(t) with the last n - i linear factors divided away.

By synthetic division and by the recursive definition, the coefficients of f_i(t) are polynomials in terms of the elementary symmetric functions and the x_j with j > i.

The degree of f_i(t) is i.

Now for the algorithm to put g(x) in the desired form. Since x₁ is a root of f₁(t), it is possible to express x₁ in terms of the symmetric functions s_i and the rest of the x_i with i > 1. Substitute this expression of x₁ into g(x), and expand out the result, which does not contain any term with x₁ now.

We proceed recursively as follows. Since x₂ is a root of f₂(t), it is possible to express x₂² or any higher power in terms of the symmetric functions s_i and the rest of the x_i with i > 2, with perhaps a few terms of x₂ of degree less than 2. Substitute this expression of x₂² (or higher) into g(x), and expand out the result, which no longer contains any term with x₂² or higher degree.

Continuing in this process of eliminating all third powers of x₃ or higher with f₃(t), all fourth powers of x₄ or higher with f₄(t), we obtain the desired form for g(x).

QED.

Let's work out an example. Unfortunately, the only way to make an interesting enough example involves heavy computations. I will work out some steps of the example, but I will leave most of the boring manipulations to Maxima or to a diligent reader.

Let us consider the symmetric polynomial in 3 variables

g(x) = x₁²x₂ + x₁²x₃ + x₂²x₁ + x₂²x₃ + x₃²x₁ + x₃²x₂

Now, in 3 variables, the f_i(t) from the proof above are

f₃(t) = t³ - s₁t² + s₂t - s₃,
f₂(t) = t² + (x₃ - s₁)t + (s₂ - s₁x₃ + x₃²),
f₁(t) = t - s₁ + x₂ + x₃.

Recall that f₂ and f₁ are obtained by symbolic synthetic division of the polynomial above them and that the remainders are zero. Also, recall at this point that the elementary symmetric functions in three variables are

s₁ = x₁ + x₂ + x₃,
s₂ = x₁x₂ + x₁x₃ + x₂x₃,
s₃ = x₁x₂x₃.

Since f₁(x₁) = 0, f₂(x₂) = 0 and f₃(x₃) = 0, we obtain that

x₁ = s₁ - x₂ - x₃,
x₂² = s₁x₂ + s₁x₃ - s₂ - x₂x₃ - x₃²,
x₃³ = s₁x₃² - s₂x₃ + s₃.

So, the algorithm now says to replace this expression for x₁ into g(x), which after expanding everything out becomes

3x₂x₃² - s₁x₃² + 3x₂²x₃ - 4s₁x₂ x₃ + s₁²x₃ - s₁x₂² + s₁²x₂.

Note that we have succeeded in eliminating x₁ from this expression. Now we do the same with x₂², to obtain

-3x₃³ + 3s₁x₃² - 3s₂x₃ + s₁s₂.

Finally we replace x₃³ by its own expression to conclude that

g(x) = s₁s₂ - 3s₃,

which is the expression of g(x) in terms of elementary symmetric functions that we sought.

I'm not smart enough to use this trash can	proof of the fundamental theorem on symmetric polynomials	Emil Artin	symmetric function
Galois Theory	cbriver	favicon.ico	Guido Ubaldus' proof of the existence of God
From the Ashes of the Republic	Maxima	synthetic division