# Why is a group?

Often when people talk about groups, they say something like: groups are objects that encode the notion of symmetry.
After working a bit with groups and group actions, it’s easy to convince yourself this is the case,
but this sort of *a posteriori* explanation might seem a little circular—at least, it does to me.

For those who haven’t heard this, these next few sentences are for you.
A *group* is an object encoding a possible kind of symmetry.
The symmetries understood by a group are seen (or more evocatively, realized or effected) via *group actions* of on an object ,
and sufficiently good actions are branded with adjectives like *transitive* or *faithful* or *regular*.
We record that a set enjoys some particular -symmetry by equipping it with a representative -action, and call the result a *-set*.

What follows is (my attempt at) a more intrinsic explanation. Readers familiar with category theory might accuse me of cheating here, because it looks like I’m just reading off the categorical model of a group, but I claim that that’s just a consequence of category theory being such a natural abstraction. Plus I’m gonna talk about semigroups later and categories can’t handle that so eat it, nerd.

Consider an object . In order not to get too stuck on explaining why objects should be modelled as sets, and to not have to appeal to something like topos theory, I’ll treat this notion as a black box. But objects are almost always modelled as sets, so think of as a set.

We will say a **symmetry** of is a transformation of —for instance, a function from to itself, if it were a set—such that the image is “the same” as .
That is, after we apply the transformation, we recognize that has remained unchanged in some way.
Note that we want “is the same as” to be an equivalence relation, because it would be really weird if it wasn’t.
For instance, the identity map should be a symmetry, because clearly no change at all has occurred to .

We will ask that the collection of symmetries of obey two natural rules. First, if and are two symmetries of , then their composition should also be a symmetry. If looks the same after applying , we should be able to apply afterwards, and the result should again not change in any meaningful way. Note that function composition is an associative operation.

Second, we will ask that if the transformation is a symmetry, then it can be undone, i.e. there exists a symmetry such that is the identity transformation. This make sense too: if looks the same as , then we can apply to find that should look the same as . Since “looks the same as” is an equivalence relation, it’s symmetric, so looks the same as . Observe now that is a symmetry, so it has an inverse, and then we can prove that

so it doesn’t matter if we undo beforehand or afterwards.

Let be the set of symmetries of . is closed under the associative binary operation ; and has an identity element with respect to that operation; and every element has a two-sided compositional inverse. Furthermore, consists of maps , so there is a natural “action” , which is just an inclusion, that associates each symmetry to the corresponding (endo)transformation of . Denoting with a slight abuse of notation the action of on by , we verify and .

Because we can interface with through the action, it really doesn’t matter what the elements of are,
so long as we know how to associate them to transformations that behave correctly.
A **group**, then, is an abstract version of a collection of symmetries.
Namely, it comprises a set equipped with an associative binary operation , such that there exists an identity , and every element has an inverse .
is a group under the operation of composition.

Likewise, an abstract **group action** is an operation assigning elements of to transformations of .
acts naturally on by embedding into .

The astute will notice that the action of on arising in this way is always **effective**,
i.e. that if two transformations that act the exact same way are equal.
Relaxing this in general for our definitions causes no harm, because after some basic group theory,
we find that any action can be viewed as a homomorphism of groups, and thus has a *kernel* .
The action is effective—more commonly referred to as *faithful*—iff is trivial,
but if is nontrivial, we can excise that redundancy by taking the (faithful) action of the *quotient group* on .

We can confirm that groups are precisely the objects that arise in this way,
because every group acts *regularly*—and hence faithfully—on itself by left multiplication.
Explicitly, we can model as for as a set, and taking its symmetries to be all and only the translations by elements of .
This is precisely the content of Cayley’s theorem.

By relaxing our notion of symmetry, we can obtain more general objects.
For instance, if we relax the inverse condition, and replace the ‘same as’ relation by a more nebulous ‘part of’ relation, we end up simply looking at all endotransformations, and obtain *monoids* and monoid actions.

If we relax even further to merely -closed subsets of transformations, then we obtain *semigroups* and semigroup actions.
The only surviving guarantee is the associativity provided by .

As in the group case, there exists a Cayley theorem for semigroups, realizing every semigroup as an effective transformation semigroup of some object. For the curious, the usual choice of object is the image of under the free functor , which adjoins a two-sided identity if none is present. (This confirms the sneaking suspicion some of you might have that it doesn’t cause any trouble in general to make sure to include in the subcollection of transformations.)