Bijections

Chapter 3: Bijections and Combinatorial Proofs #

If one proves the equality of two numbers aa and bb by showing first that aba \leq b and then that ab,a \geq b, it is unfair; one should instead show that they are really equal by disclosing the inner ground for their equality.

— Emmy Noether

Hence, counting is sometimes just a pretext and the important thing is to understand, or discover, a structure in some discrete objects.

— Mireille Bousquet-Mélou

Our first general tool for the study of combinatorial classes is the notion of a bijection. Bijections allow us to use information about simple classes and classes we already understand to study complicated classes and classes we haven’t seen before.

Let AA and BB be two sets. We say that a function f:ABf:A\rightarrow B is

  • injective if aaa \neq a^\prime implies f(a)f(a)f(a) \neq f(a^\prime) for any a,aA,a,a^\prime \in A,
  • surjective if for every bBb \in B there exists some aAa \in A with f(a)=b,f(a)=b,
  • bijective if it is injective and surjective.

If there is a bijection from AA to BB then we say that AA and BB are in bijection.

Remark: An equivalent formulation for injectiveness is that ff is injective if for all a,aAa,a^\prime\in A the equality f(a)=f(a)f(a)=f(a^\prime) implies a=a.a=a^\prime. This is the contrapositive to the statement above.

Because a bijection takes distinct elements of AA to distinct elements of B,B, and every element of BB is mapped to by some element of A,A, a bijection “pairs up” the elements of AA and BB (when AA and BB are infinite we usually define them to have the same size if and only if they are in bijection).

Exercise: Prove that if AA and BB are finite sets in bijection then A=B.|A|=|B|.
Figure: A bijection from AA to BB pairs up the elements of AA and B.B.

If f:ABf:A \rightarrow B then we call a function g:BAg:B\rightarrow A the inverse of ff if g(f(a))=a for all aA  and  f(g(b))=b for all bB. g(f(a)) = a \text{ for all aAa \in A \textbf{ and } } f(g(b)) = b \text{ for all bB.b \in B.} In other words, gg is the inverse of ff if applying the functions consecutively (in either order) to any element always gives back the starting element. We usually use the notation f1f^{-1} for the inverse of ff when it exists.

Lemma 1: The function f:ABf:A\rightarrow B is a bijection if and only if ff has an inverse.
Proof of Lemma 1

First, suppose that ff has an inverse f1.f^{-1}. Then ff is injective since applying the inverse to both sides of the equation f(a)=f(a)f(a)=f(a^\prime) implies a=a,a=a^\prime, and ff is surjective since for any bBb \in B the element f1(b)Af^{-1}(b) \in A satisfies f(f1(b))=b.f(f^{-1}(b))=b. Thus, when ff has an inverse then ff is a bijection.

Conversely, suppose that ff is a bijection. Then, since ff is surjective, for any bBb \in B there exists some element g(b)Ag(b) \in A such that f(g(b))=b.f(g(b)) = b. Since ff is injective the element g(b)g(b) is unique, so the map bg(b)b \mapsto g(b) is a well-defined function. By construction f(g(b))=bf(g(b)) = b for any bB.b \in B. Furthermore, for any aAa \in A we have f(g(f(a)))=f(a)f(g(f(a))) = f(a) by applying the definition of g(b)g(b) with b=f(a),b=f(a), so ff being injective implies g(f(a))=a.g(f(a))=a. We thus see that when ff is a bijection then it has an inverse.

The two most common ways of proving sets AA and BB are in bijection are thus

  1. Define a map f:ABf:A\rightarrow B and then prove

    • that ff is well-defined (it actually takes elements of AA to elements of BB)
    • that ff is injective, and
    • that ff is surjective.
  2. Define two maps f:ABf:A\rightarrow B and g:BA,g:B\rightarrow A, and prove

    • that both maps are well-defined,
    • that g(f(a))=ag(f(a)) = a for all aA,a \in A, and
    • that f(g(b))=bf(g(b)) = b for all bB.b \in B.

Example: Prove that the set of integers Z\mathbb{Z} and the set of even integers 2Z={,4,2,0,2,4,}2\mathbb{Z} = \{\dots,-4,-2,0,2,4,\dots\} are in bijection.

Click for Answer
Define the functions f:Z2Zf:\mathbb{Z}\rightarrow2\mathbb{Z} and g:2ZZg:2\mathbb{Z}\rightarrow\mathbb{Z} by f(z)=2zf(z)=2z and g(z)=z/2.g(z) = z/2. Note that ff and gg are well-defined since ff always outputs an even integer and gg always outputs an integer (because it only takes even integers as input). Then g(f(a))=(2a)/2=ag(f(a))=(2a)/2=a and f(g(b))=2(b/2)=bf(g(b))=2(b/2)=b for all aZa\in\mathbb{Z} and b2Zb\in2\mathbb{Z} so gg is the inverse function of ff and ff is a bijection.

A bijection of combinatorial classes (A,A)(A,|\cdot|_A) and (B,B)(B,|\cdot|_B) is a bijection f:ABf:A\rightarrow B between the set of objects in each class that preserves size, meaning f(a)B=aA|f(a)|_B = |a|_A for every aA.a \in A. Two combinatorial classes that are in bijection have the same counting sequences.

Example: A permutation matrix of size nn is an n×nn\times n matrix with entries in {0,1}\{0,1\} where every row and every column has exactly one 1. Prove that the class SS of permutations and the class MM of permutation matrices are in bijection.

Click for Answer

Let f:SMf:S \rightarrow M be the function that takes a permutation σ=σ1σn\sigma = \sigma_1 \cdots \sigma_n of size nn and returns the n×nn \times n matrix with ones in positions (1,σ1),,(n,σn).(1,\sigma_1),\dots,(n,\sigma_n).

We prove that ff is a bijection by showing that

  1. ff is well-defined,
  2. ff is injective, and
  3. ff is surjective.

We prove these properties in order.

  1. If σ\sigma is a permutation then {σ1,,σn}={1,,n},\{\sigma_1,\dots,\sigma_n\} = \{1,\dots,n\}, so f(σ)f(\sigma) is an n×nn\times n matrix with one 1 in each row and each column.
  2. If f(σ)=f(τ)f(\sigma)=f(\tau) then σ\sigma and τ\tau must be permutations of the same size nn and σj=τj\sigma_j = \tau_j for each 1jn,1 \leq j \leq n, so σ=τ.\sigma = \tau.
  3. Let LL be a matrix in M.M. If LL has ones in positions (1,j1),,(n,jn)(1,j_1),\dots,(n,j_n) then σ=j1  j2    jn\sigma = j_1 \; j_2 \; \cdots \; j_n is a permutation of size nn and f(σ)=L.f(\sigma)=L.

Thus, ff is a bijection between the combinatorial classes SS and M.M.

How do we find bijections? Bijective combinatorics can be more of an art than a science, however there are some useful tips to keep in mind.

  • Look at small examples. If you want to prove that two classes with objects AA and BB are in bijection then try to pair up the objects of size 1 in AA with the objects of size 1 in B.B. Try to do the same thing for objects of sizes 2 and 3. Can you see a general pattern to pair up objects of any size nn?

  • Think back to other bijections you’ve already seen. For instance, if you’re trying to find a bijection between a class of restricted permutations and a family of matrices, then can you reuse any ideas from our argument in the last example?

  • Try to decompose a complicated class into simpler parts. For instance, in a class whose objects are pairs then can you think about out where to send the first object in a pair and the second object in a pair independently?

A multiset of size nn with tt types is a sequence (m1,,mt)Nt(m_1,\dots,m_t) \in \mathbb{N}^t with m1++mt=n.m_1 + \cdots + m_t = n.
Exercise: Prove that the set of multisets of size nn with tt types is in bijection with the set of sets containing nn elements from {1,,t}\{1,\dots,t\} where elements are allowed to repeat.
Lemma 2: If n,tNn,t \in \mathbb{N} with t1t \geq 1 then there are (n+t1t1) \binom{n+t-1}{t-1} multisets of size nn with tt types.
Proof Sketch of Lemma 2 (‘Balls and Lines Bijection’)

Let MM be the set of solutions to m1++mt=nm_1 + \cdots + m_t = n with (m1,,mt)Nt.(m_1,\dots,m_t) \in \mathbb{N}^t. The idea is to picture dividing nn balls into tt bins by drawing t1t-1 lines between nn balls to denote the boundaries between the mj.m_j.

Figure: Representing the multiset 1+3+21 + 3 + 2 of size 6 with 3 types by a sequence of 6 balls and 2 lines.

Formally, let II be the tuples of length n+t1n+t-1 defined by nn identical balls and t1t-1 identical lines. Then I=(n+t1)!n!(t1)!=(n+t1t1)|I| = \frac{(n+t-1)!}{n!(t-1)!} = \binom{n+t-1}{t-1} since there are (n+t1)!(n+t-1)! ways to create such a tuple when all elements are distinct, dividing out by n!n! accounts for the fact that all nn balls are identical, and dividing out by (t1)!(t-1)! accounts for the fact that the t1t-1 lines are identical.

Let f:MIf:M \rightarrow I be the map that takes (m1,,mt)M(m_1,\dots,m_t) \in M and returns the tuple which, from left to right, contains m1m_1 balls followed by a line, then m2m_2 balls followed by a line, and so on until ending with mtm_t balls. Lemma 2 follows from the following exercise.

Exercise: Prove that ff is a bijection.

Combinatorial Proofs #

Bijections and counting methods allow us to give combinatorial proofs of identities. To prove an identity of the form LHS=RHS \text{LHS} = \text{RHS} there are two common approaches.

  1. Find two sets AA and BB where it is easy to prove A=LHS|A| = \text{LHS} and B=RHS|B| = \text{RHS} and then prove that AA and BB are in bijection.

  2. Count a single set AA in two different ways, one of which shows that A=LHS|A| = \text{LHS} and the other of which shows that A=RHS.|A| = \text{RHS}.

Combinatorial proofs stand in contrast to algebraic proofs, because they show equality by giving meaning to both sides of an equation instead of performing symbolic manipulations. We now give two simple examples to illustrate the method. Both of our examples are fairly easy to prove algebraically, but the method of combinatorial proof can be very powerful (especially once one has build up a large “dictionary” of combinatorial objects and their counting sequences).

Example: Give a combinatorial proof that (nk)=(nnk) \binom{n}{k} = \binom{n}{n-k} for any n,kNn,k\in\mathbb{N} with 0kn.0 \leq k \leq n.

Click for Answer

Let AA be the set of subsets of [n]={1,,n}[n]=\{1,\dots,n\} of size kk and BB be the set of subsets of [n][n] of size nk,n-k, so that A=(nk)|A| = \binom{n}{k} and B=(nnk).|B| = \binom{n}{n-k}.

Consider the function f:ABf:A\rightarrow B which takes an element SAS \in A to its complement f(S)=[n]S.f(S) = [n] \setminus S. Then ff takes a subset of [n][n] with kk elements to a subset with nkn-k elements, so it is well-defined. If g:BAg:B \rightarrow A is also the complement function g(T)=[n]Tg(T) = [n] \setminus T then g(f(S))=[n]([n]S)=Sg(f(S)) = [n] \setminus ([n] \setminus S) = S for all SAS \in A and, for the same reason, f(g(T))=Tf(g(T))=T for all TB.T \in B. Thus gg is the inverse of ff so ff is a bijection and (nk)=A=B=(nnk). \binom{n}{k} = |A| = |B| = \binom{n}{n-k} .

Example: Give a combinatorial proof that 2n=k=0n(nk). 2^n = \sum_{k=0}^n \binom{n}{k}.

Click for Answer
Let AA be the set of subsets of [n]={1,,n},[n]=\{1,\dots,n\}, so A=2n.|A|=2^n. We can express AA as the disjoint union A=A0AnA = A_0 \cup \cdots \cup A_n where AkA_k denotes the subsets of [n][n] with kk elements. Since Ak=(nk)|A_k| = \binom{n}{k} we see that 2n=A=k=0nAk=k=0n(nk). 2^n = |A| = \sum_{k=0}^n |A_k| = \sum_{k=0}^n \binom{n}{k}.

The key to giving combinatorial proofs of identities is to identify combinatorial objects that are counted by the terms on each side. Here are some sequences that appear frequently in identities, and some of the combinatorial objects that are counted by them.

Sequence Combinatorial Interpretation
n!n! permutations of size nn
2n2^n binary strings of length n,n, subsets of [n],[n], compositions of size n+1n+1
AnA^n strings of length nn on alphabet of size AA
(nk)\binom{n}{k} subsets of [n][n] with kk elements
(a+ba)\binom{a+b}{a} lattice paths from (0,0)(0,0) to (a,b)(a,b) using (1,0)(1,0) and (0,1)(0,1)
(n+t1t1)\binom{n+t-1}{t-1} multisets of size nn with tt types
a×ba \times b Cartesian product of sets with sizes aa and bb
a+ba + b union of disjoint sets with sizes aa and bb

The online encyclopedia of integer sequence is a key tool that allows users to look up the combinatorial objects counted by the initial terms of a sequence. If you have an unknown sequence, plug some of the terms into the oeis, find another objected counted by the sequence, and then try to find a bijection. (If you find one, then update the entry on the oeis with your new information!)

Additional Problems #

Problem 1. For any positive n1n\geq 1 give a bijection between the set of subsets of {1,2,,n}\{1,2,…,n\} with an even number of elements and the set of subsets of {1,2,,n}\{1,2,…,n\} with an odd number of elements. Conclude that there are 2n12^{n-1} subsets of {1,2,,n}\{1,2,…,n\} with an even number of elements.

Problem 2. Let nNn\in\mathbb{N} be a natural number.

(a) Give a bijection between the sets S={(A,B):A,B[n] are disjoint}T={(P,Q):PQ[n]} \begin{aligned} \mathcal{S} &= \{ (A,B) : \text{A,B[n]A,B \subset [n] are disjoint} \} \\[+2mm] \mathcal{T} &= \{ (P,Q) : \text{PQ[n]P\subset Q \subset [n]} \} \end{aligned}

(b) Give a bijection between the sets S={(A,B,C):A,B,C[n] are pairwise disjoint}T={(P,Q,R):PQR[n]} \begin{aligned} \mathcal{S} &= \{ (A,B,C) : \text{A,B,C[n]A,B,C \subset [n] are pairwise disjoint} \} \\[+2mm] \mathcal{T} &= \{ (P,Q,R) : \text{PQR[n]P\subset Q \subset R \subset [n]} \} \end{aligned}

(c) Generalize parts (a) and (b).

Problem 3. Give combinatorial proofs of the following identities (all quantities are integers). (nk)=(n1k1)+(n1k)(1kn)k(nk)=n(n1k1)(1kn)(nk)=(n2k)+2(n2k1)+(n2k2)(2kn)(nm)(mk)=(nk)(nkmk)(0kmn)(nk)(nk)=(nk+)(k+)(0k+n)n2n1=i=0n(ni)i(1n)n(n1)2n2=i=0n(ni)i(i1)(2n)(2nn)=i=0n(ni)2(0n)(n+1k+1)=i=kn(ik)(0n,k)(n+k+1k)=i=0k(n+ii)(0n,k)(n+t1t1)=k=1t(tk)(n1k1)(0n,t)(2nn)=2i=0n1(n1+ii)(1n)3n=k=0n(nk)2k(0n)4n=k=0n(nk)3nk(0n)4n=k=0n(2kk)(2n2knk)(0n)(nk)2=i=0k(ni)(niki)(nkki)(0kn)(p+qn)=k=0n(pk)(bnk)(0n)(2a+b+1b)=i=0b(a+ii)(a+bibi)(0a,b)(3n+1n)=i=0n(n+ii)(2nini)(0n)(4n+22n+1)=2i=0n(n+in)(3n+1in)(1n) {\small \begin{aligned} \binom{n}{k} &= \binom{n-1}{k-1} + \binom{n-1}{k} && (1 \leq k \leq n) \\[+5mm] k\binom{n}{k} &= n\binom{n-1}{k-1} && (1 \leq k \leq n) \\[+5mm] \binom{n}{k} &= \binom{n-2}{k} + 2\binom{n-2}{k-1} + \binom{n-2}{k-2} && (2 \leq k \leq n) \\[+5mm] \binom{n}{m}\binom{m}{k} &= \binom{n}{k}\binom{n-k}{m-k} && (0 \leq k \leq m \leq n) \\[+5mm] \binom{n}{k}\binom{n-k}{\ell} &= \binom{n}{k+\ell}\binom{k+\ell}{\ell} && (0 \leq k+\ell \leq n) \\[+5mm] n2^{n-1} &= \sum_{i=0}^n \binom{n}{i}i && (1 \leq n) \\[+5mm] n(n-1)2^{n-2} &= \sum_{i=0}^n \binom{n}{i}i(i-1) && (2 \leq n) \\[+5mm] \binom{2n}{n} &= \sum_{i=0}^n \binom{n}{i}^2 && (0 \leq n) \\[+5mm] \binom{n+1}{k+1} &= \sum_{i=k}^n \binom{i}{k} && (0 \leq n,k) \\[+5mm] \binom{n+k+1}{k} &= \sum_{i=0}^k \binom{n+i}{i} && (0 \leq n,k) \\[+5mm] \binom{n+t-1}{t-1} &= \sum_{k=1}^t \binom{t}{k}\binom{n-1}{k-1} && (0 \leq n,t) \\[+5mm] \binom{2n}{n} &= 2\sum_{i=0}^{n-1} \binom{n-1+i}{i} && (1 \leq n) \\[+5mm] 3^{n} &= \sum_{k=0}^n \binom{n}{k}2^k && (0 \leq n) \\[+5mm] 4^{n} &= \sum_{k=0}^n \binom{n}{k}3^{n-k} && (0 \leq n) \\[+5mm] 4^{n} &= \sum_{k=0}^n \binom{2k}{k}\binom{2n-2k}{n-k} && (0 \leq n) \\[+5mm] \binom{n}{k}^2 &= \sum_{i=0}^k \binom{n}{i}\binom{n-i}{k-i}\binom{n-k}{k-i} && (0 \leq k \leq n)\\[+5mm] \binom{p+q}{n} &= \sum_{k=0}^n \binom{p}{k}\binom{b}{n-k} && (0 \leq n)\\[+5mm] \binom{2a+b+1}{b} &= \sum_{i=0}^b \binom{a+i}{i}\binom{a+b-i}{b-i} && (0 \leq a,b) \\[+5mm] \binom{3n+1}{n} &= \sum_{i=0}^n \binom{n+i}{i}\binom{2n-i}{n-i} && (0 \leq n) \\[+5mm] \binom{4n+2}{2n+1} &= 2\sum_{i=0}^n \binom{n+i}{n}\binom{3n+1-i}{n} && (1 \leq n) \\[+5mm] \end{aligned} }

Problem 4. A Dyck path of size nn is a sequence of nn upsteps (1,1)(1,1) and nn downsteps (1,1)(1,-1) that start at the origin and never go below the xx-axis. A return in a Dyck path is a maximal sequence of consecutive downsteps that ends on the xx-axis.

Figure: A Dyck path of size 6 that has returns of length two and length three (highlighted).

Prove that the set of Dyck paths of size n1n-1 is in bijection with the set of Dyck paths of size nn that have no returns with an even number of steps.

Problem 5. Fix a positive integer kk and define S={(a1,,ak)Nk:aj is divisible by j for all j}T={(b1,,bk)Nk:b1b2bk}. \begin{aligned} \mathcal{S} &= \{(a_1, \dots, a_k) \in \mathbb{N}^k : a_j \text{ is divisible by } j \text{ for all j} \} \\[+2mm] \mathcal{T} &=\{(b_1, \dots, b_k) \in \mathbb{N}^k : b_1 \geq b_2 \geq \cdots \geq b_k \}. \end{aligned} Give a bijection f:STf:\mathcal{S}\rightarrow\mathcal{T} such that f((a1,,ak))=(b1,,bk)f((a_1, \dots, a_k)) = (b_1, \dots, b_k) implies a1++ak=b1++bka_1 + \dots + a_k = b_1 + \dots + b_k.

Click here for the next chapter