Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
gmcninch-tufts
GitHub Repository: gmcninch-tufts/2024-Sp-Math190
Path: blob/main/course-contents/2024-02-07--notes-RT--diaconis-paper.md
918 views
---
title: Representations and the symmetric group - Diaconis data date: 2024-02-07
---

character table of S5S_5

Let's use GAP to find the character table of S5S_5.

gap> G:=SymmetricGroup(5); Sym( [ 1 .. 5 ] ) gap> tab:=CharacterTable(G); CharacterTable( Sym( [ 1 .. 5 ] ) ) gap> Display(tab); CT1 2 3 2 3 1 1 2 . 3 1 1 . 1 1 . . 5 1 . . . . . 1 1a 2a 2b 3a 6a 4a 5a 2P 1a 1a 1a 3a 3a 2b 5a 3P 1a 2a 2b 1a 2a 4a 5a 5P 1a 2a 2b 3a 6a 4a 1a X.1 1 -1 1 1 -1 -1 1 X.2 4 -2 . 1 1 . -1 X.3 5 -1 1 -1 -1 1 . X.4 6 . -2 . . . 1 X.5 5 1 1 -1 1 -1 . X.6 4 2 . 1 -1 . -1 X.7 1 1 1 1 1 1 1 gap>

Diaconis example -- survey data

This data is taken from the paper [@diaconisGeneralizationSpectralAnalysis1989]

It describes 5,738 completed ballots rank-ordering 5 candidates.

View a rank-ordered ballot as an element of the symmetric group S5S_5; we want to study the frequency function ff.

first ranking table

the regular representation

This diagram shows the decomposition of the regular representation into isotypic components.

Be careful: the notation Diaconis is using here does not match that used by GAP above. For example, the representation Diaconis writes as V3V_3 is the isotypic component determined by the irreducible representation labeled X.5 by GAP.

The second row reflects the decomposition of the frequency function ff. Namely, write $$f = \sum_{i=1}^7 f_i \quad \text{with $f_i \in V_iParseError: KaTeX parse error: Expected 'EOF', got '}' at position 1: }̲.$

The second row entries are the "sums of squares" fi,fi\langle f_i,f_i \rangle.

Remember that we can compute the fif_i using the idempotents in C[G]\mathbb{C}[G].

For example,

f1=15!σS5σ.ff_1 = \dfrac{1}{5!} \sum_{\sigma \in S_5} \sigma.f

More generally, if χi\chi_i denotes the character of the irreducible representation LiL_i with Vi=C[G](Li)V_i = \mathbb{C}[G]_{(L_i)} then fi=15!σS5χi(σ1)σ.ff_i = \dfrac{1}{5!} \sum_{\sigma \in S_5} \chi_i(\sigma^{-1}) \sigma.f

Note that f3,f3=459\langle f_3,f_3 \rangle = 459 is relatively large (ignoring f1,f1\langle f_1,f_1 \rangle since f1f_1 is trivial).

normalizing the first-order data

THe i,ji,j entry in this table is the number of votes ranking candidate ii in the jj-th position, minus the sample size over 5.

In particular, rows and columns sum to 0.

This normalization can also be achieved as follows:

Let f2f_2 be the projection on V2V_2, and consider the functions σδi,σ(j).\sigma \mapsto \delta_{i,\sigma(j)}.

The i,ji,j entry of the preceding table is f2,δi,σ(j)\langle f_2 , \delta_{i,\sigma(j)} \rangle

Interpretation in this last table:

Compute the projection f3f_3 of ff into the component V3V_3 of M=C[S5]M = \mathbb{C}[S_5].

Now, consider the easily understood functions σδ{i,i},{σ(j),σ(j)}\sigma \mapsto \delta_{\{i,i'\},\{\sigma(j),\sigma(j')\}} in C[S5]\mathbb{C}[S_5] for distinct i,ii,i' and distinct j,jj,j'.

The space of these functions is a 100 dimensional subspace of WW C[G]\mathbb{C}[G].

The entries in the table are the inner products f3,δ{i,i},{σ(j),σ(j)}\langle f_3 , \delta_{\{i,i'\},\{\sigma(j),\sigma(j')\}} \rangle

Summary observations

The data were to elect a president for the American Psychological Association. Candidates 1 and 3 were clinicians while candidates 4 and 5 were academicians, two groups within the association with somewhat divergent perspectives.

In the second-order table, we see a preference for candidates 1 & 3 witnessed by the entry 376 corresponding to the entry for candidates {1,3}\{1,3\} and ranks {1,2}\{1,2\}.

And we see a (slightly smaller) preference for candidates 4 and 5 witnessed by the entry 296 corresponding to the entry for candidates {4,5}\{4,5\} and ranks {1,2}\{1,2\}.

Bibliography

::: {.refs} :::