Week 8 at a glance

Textbook reading: Chapter 4, Section 5.3

Before Monday, “An undecidable language", Sipser pages 207-209.

Before Wednesday, Definition 5.20 and figure 5.21 (page 236) of mapping reduction.

Before Friday, Example 5.24 (page 236).

For Week 9 Monday: Example 5.26 (page 237).

We will be learning and practicing to:

Clearly and unambiguously communicate computational ideas using appropriate formalism. Translate across levels of abstraction.
- Give examples of sets that are regular, context-free, decidable, or recognizable (and prove that they are).
  - Define and explain the definitions of the computational problem $A_{TM}$
  - Define and explain the definitions of the computational problem $HALT_{TM}$
Know, select and apply appropriate computing knowledge and problem-solving techniques. Reason about computation and systems.
- Use diagonalization to prove that there are ’hard’ languages relative to certain models of computation.
  - Trace the argument that proves $A_{TM}$ is undecidable and explain why it works.
- Use mapping reduction to deduce the complexity of a language by comparing to the complexity of another.
  - Define computable functions, and use them to give mapping reductions between computational problems
  - Build and analyze mapping reductions between computational problems
  - Deduce the decidability or undecidability of a computational problem given mapping reductions between it and other computational problems, or explain when this is not possible.
- Classify the computational complexity of a set of strings by determining whether it is regular, context-free, decidable, or recognizable.
  - State, prove, and use theorems relating decidability, recognizability, and co-recognizability.
  - Prove that a language is decidable or recognizable by defining and analyzing a Turing machines with appropriate properties.

TODO:

Review Quiz 7 on PrairieLearn (http://us.prairielearn.com), due 2/26/2025

Homework 5 submitted via Gradescope (https://www.gradescope.com/), due 2/27/2025

Review Quiz 8 on PrairieLearn (http://us.prairielearn.com), due 3/5/2025

Monday: $A_{TM}$ is recognizable but undecidable

Acceptance problem
for Turing machines	$A_{TM}$	$\{ \langle M,w \rangle \mid \text{$M$ is a Turing machine that accepts input string $w$}\}$
Language emptiness testing
for Turing machines	$E_{TM}$	$\{ \langle M \rangle \mid \text{$M$ is a Turing machine and $L(M) = \emptyset$\}}$
Language equality testing
for Turing machines	$EQ_{TM}$	$\{ \langle M_1, M_2 \rangle \mid \text{$M_1$ and $M_2$ are Turing machines and $L(M_1) =L(M_2)$\}}$

$M_1$	$M_2$

Example strings in $A_{TM}$

Example strings in $E_{TM}$

Example strings in $EQ_{TM}$

Theorem: $A_{TM}$ is Turing-recognizable.

Strategy: To prove this theorem, we need to define a Turing machine $R_{ATM}$ such that $L(R_{ATM}) = A_{TM}$.

Define $R_{ATM} =$ “

Proof of correctness:

We will show that $A_{TM}$ is undecidable. First, let’s explore what that means.

To prove that a computational problem is decidable, we find/ build a Turing machine that recognizes the language encoding the computational problem, and that is a decider.

How do we prove a specific problem is not decidable?

How would we even find such a computational problem?

Counting arguments for the existence of an undecidable language:

The set of all Turing machines is countably infinite.
Each recognizable language has at least one Turing machine that recognizes it (by definition), so there can be no more Turing-recognizable languages than there are Turing machines.
Since there are infinitely many Turing-recognizable languages (think of the singleton sets), there are countably infinitely many Turing-recognizable languages.
Such the set of Turing-decidable languages is an infinite subset of the set of Turing-recognizable languages, the set of Turing-decidable languages is also countably infinite.

Since there are uncountably many languages (because $\mathcal{P}(\Sigma^*)$ is uncountable), there are uncountably many unrecognizable languages and there are uncountably many undecidable languages.

Thus, there’s at least one undecidable language!

What’s a specific example of a language that is unrecognizable or undecidable?

To prove that a language is undecidable, we need to prove that there is no Turing machine that decides it.

Key idea: proof by contradiction relying on self-referential disagreement.

Theorem: $A_{TM}$ is not Turing-decidable.

Proof: Suppose towards a contradiction that there is a Turing machine that decides $A_{TM}$. We call this presumed machine $M_{ATM}$.

By assumption, for every Turing machine $M$ and every string $w$

If $w \in L(M)$, then the computation of $M_{ATM}$ on $\langle M,w \rangle ~~ \underline{\phantom{\hspace{2.5in}}}$
If $w \notin L(M)$, then the computation of $M_{ATM}$ on $\langle M,w \rangle ~~ \underline{\phantom{\hspace{2.5in}}}$

Define a new Turing machine using the high-level description:

$D =$“ On input $\langle M \rangle$, where $M$ is a Turing machine:

Run $M_{ATM}$ on $\langle M, \langle M \rangle \rangle$.

If $M_{ATM}$ accepts, reject; if $M_{ATM}$ rejects, accept."

Is $D$ a Turing machine?

Is $D$ a decider?

What is the result of the computation of $D$ on $\langle D \rangle$?

Summarizing:

$A_{TM}$ is recognizable.
$A_{TM}$ is not decidable.

Recall definition: A language $L$ over an alphabet $\Sigma$ is called co-recognizable if its complement, defined as $\Sigma^* \setminus L = \{ x \in \Sigma^* \mid x \notin L \}$, is Turing-recognizable.

and Recall Theorem (Sipser Theorem 4.22): A language is Turing-decidable if and only if both it and its complement are Turing-recognizable.

$A_{TM}$ is recognizable.
$A_{TM}$ is not decidable.
$\overline{A_{TM}}$ is not recognizable.
$\overline{A_{TM}}$ is not decidable.

Wednesday: Computable functions and mapping reduction

Mapping reduction

Motivation: Proving that $A_{TM}$ is undecidable was hard. How can we leverage that work? Can we relate the decidability / undecidability of one problem to another?

If problem $X$ is no harder than problem $Y$

…and if $Y$ is easy,

…then $X$ must be easy too.

If problem $X$ is no harder than problem $Y$

…and if $X$ is hard,

…then $Y$ must be hard too.

“Problem $X$ is no harder than problem $Y$” means “Can answer questions about membership in $X$ by converting them to questions about membership in $Y$”.

Definition: For any languages $A$ and $B$, $A$ is mapping reducible to $B$ means there is a computable function $f : \Sigma^* \to \Sigma^*$ such that for all strings $x$ in $\Sigma^*$, \[x \in A \qquad \qquad \text{if and only if} \qquad \qquad f(x) \in B.\] Notation: when $A$ is mapping reducible to $B$, we write $A \leq_m B$.

Intuition: $A \leq_m B$ means $A$ is no harder than $B$, i.e. that the level of difficulty of $A$ is less than or equal the level of difficulty of $B$.

TODO

What is a computable function?
How do mapping reductions help establish the computational difficulty of languages?

Computable functions

Definition: A function $f: \Sigma^* \to \Sigma^*$ is a computable function means there is some Turing machine such that, for each $x$, on input $x$ the Turing machine halts with exactly $f(x)$ followed by all blanks on the tape

Examples of computable functions:

The function that maps a string to a string which is one character longer and whose value, when interpreted as a fixed-width binary representation of a nonnegative integer is twice the value of the input string (when interpreted as a fixed-width binary representation of a non-negative integer) \[f_1: \Sigma^* \to \Sigma^* \qquad f_1(x) = x0\]

To prove $f_1$ is computable function, we define a Turing machine computing it.

High-level description

“On input $w$

1. Append $0$ to $w$.

2. Halt.”

Implementation-level description

“On input $w$

1. Sweep read-write head to the right until find first blank cell.

2. Write 0.

3. Halt.”

Formal definition $(\{q0, qacc, qrej\}, \{0,1\}, \{0,1,\textvisiblespace\},\delta, q0, qacc, qrej)$ where $\delta$ is specified by the state diagram:

The function that maps a string to the result of repeating the string twice. \[f_2: \Sigma^* \to \Sigma^* \qquad f_2( x ) = xx\]

The function that maps strings that are not the codes of NFAs to the empty string and that maps strings that code NFAs to the code of a DFA that recognizes the language recognized by the NFA produced by the macro-state construction from Chapter 1.

The function that maps strings that are not the codes of Turing machines to the empty string and that maps strings that code Turing machines to the code of the related Turing machine that acts like the Turing machine coded by the input, except that if this Turing machine coded by the input tries to reject, the new machine will go into a loop. \[f_4: \Sigma^* \to \Sigma^* \qquad f_4( x ) = \begin{cases} \varepsilon \qquad&\text{if $x$ is not the code of a TM} \\ \langle (Q \cup \{q_{trap} \}, \Sigma, \Gamma, \delta', q_0, q_{acc}, q_{rej} ) \rangle \qquad&\text{if $x = \langle (Q, \Sigma, \Gamma, \delta, q_0, q_{acc}, q_{rej} )\rangle$}\end{cases}\] where $q_{trap} \notin Q$ and \[\delta'( (q,x) ) = \begin{cases} (r,y,d) &\text{if $q \in Q$, $x \in \Gamma$, $\delta ((q,x)) = (r,y,d)$, and $r \neq q_{rej}$} \\ (q_{trap}, \textvisiblespace, R) & \text{otherwise} \end{cases}\]

Definition: $A$ is mapping reducible to $B$, $A \leq_m B$m means there is a computable function $f : \Sigma^* \to \Sigma^*$ such that for all strings $x$ in $\Sigma^*$, \[x \in A \qquad \qquad \text{if and only if} \qquad \qquad f(x) \in B.\] In this case, we say the function $f$ witnesses that $A$ is mapping reducible to $B$.

Making intutition precise …

Theorem (Sipser 5.22): If $A \leq_m B$ and $B$ is decidable, then $A$ is decidable.

Theorem (Sipser 5.23): If $A \leq_m B$ and $A$ is undecidable, then $B$ is undecidable.

Friday: The Halting problem

Recall definition: $A$ is mapping reducible to $B$ means there is a computable function $f : \Sigma^* \to \Sigma^*$ such that for all strings $x$ in $\Sigma^*$, \[x \in A \qquad \qquad \text{if and only if} \qquad \qquad f(x) \in B.\] Notation: when $A$ is mapping reducible to $B$, we write $A \leq_m B$.

Intuition: $A \leq_m B$ means $A$ is no harder than $B$, i.e. that the level of difficulty of $A$ is less than or equal the level of difficulty of $B$.

Example: $A_{TM} \leq_m A_{TM}$ $A_{TM} = \{ \langle M, w \rangle \mid M \text{ is a TM and } w \text{ is a string and } w \in L(M) \}$

To prove, need a witnessing function $f: \Sigma^* \to \Sigma^*$ that is (1) computable and (2) for each $x \in \Sigma^*$, $x \in A_{TM}$ iff $f(x) \in A_{TM}$

Corollary: For any language $L$, $L \leq_m L$, as witnessed by

Example: $A_{DFA} \leq_m \{ ww \mid w \in \{0,1\}^* \}$

To prove, need a witnessing function $f: \Sigma^* \to \Sigma^*$ that is (1) computable and (2) for each $x \in \Sigma^*$, $x \in A_{DFA}$ iff $f(x) \in \{ ww \mid w \in \{0,1\}^* \}$

Corollary: For any language decidable language $X$ and any set $Y$ with at least one string string in $Y$ and at least one string not in $Y$, $X \leq_m Y$, as witnessed by

Next: consider mapping reductions between potentially undecidable languages.

Halting problem \[HALT_{TM} = \{ \langle M, w \rangle \mid \text{$M$ is a Turing machine, $w$ is a string, and $M$ halts on $w$} \}\]

We know $A_{TM}$ is undecidable. If we could prove that $A_{TM} \leq_m HALT_{TM}$ then we could conclude that $HALT_{TM}$ is undecidable too.

Could we adapt our approach from before by tweaking the identity map?

Define $F: \Sigma^* \to \Sigma^*$ by \[F(x) = \begin{cases} const_{out} \qquad &\text{if $x \neq \langle M,w \rangle$ for any Turing machine $M$ and string $w$ over the alphabet of $M$} \\ \langle M'_x, w \rangle \qquad & \text{if $x = \langle M, w \rangle$ for some Turing machine $M$ and string $w$ over the alphabet of $M$.} \end{cases}\] where $const_{out} = \langle %\includegraphics[width=1.5in]{../../resources/machines/Lect22TM1.png} \begin{tikzpicture}[->,>=stealth',shorten >=1pt, auto, node distance=2cm, semithick] \tikzstyle{every state}=[text=black, fill=none] \node[initial,state] (q0) {$q0$}; \node[state,accepting] (qacc) [right of=q0, xshift=20pt] {$q_{acc}$}; \path (q0) edge [loop above] node {\parbox{1cm}{$0; \square, R$\newline$1; \square, R$\newline $\square; \square, R$}} (q0) ; \end{tikzpicture}, \varepsilon \rangle$ and $M'_x$ is a Turing machine that computes like $M$ except, if the computation of $M$ ever were to go to a reject state, $M'_x$ loops instead.

$F( \langle \begin{tikzpicture}[->,>=stealth',shorten >=1pt, auto, node distance=2cm, semithick] \tikzstyle{every state}=[text=black, fill=none] \node[initial,state] (q0) {$q0$}; \node[state] (q1) [right of=q0, xshift=80pt] {$q1$}; \node[state,accepting] (qacc) [above of=q0] {$q_{acc}$}; \path (q0) edge [bend left=20] node {\parbox{1cm}{$0; 0,R$\newline $1; 1, R$}} (q1) (q1) edge [bend left=20] node {\parbox{1cm}{$0; 0,R$\newline $1; 1, R$}} (q0) (q0) edge [bend left=0] node {$\square; \square, R$} (qacc) ; \end{tikzpicture}, \varepsilon \rangle)$ =

To use this function to prove that $A_{TM} \leq_m HALT_{TM}$, we need two claims:

Claim (1): $F$ is computable

Claim (2): for every $x$, $x \in A_{TM}$ iff $F(x) \in HALT_{TM}$.

Acceptance problem
for Turing machines	\(A_{TM}\)	\(\{ \langle M,w \rangle \mid \text{$M$ is a Turing machine that accepts input string $w$}\}\)
Language emptiness testing
for Turing machines	\(E_{TM}\)	\(\{ \langle M \rangle \mid \text{$M$ is a Turing machine and $L(M) = \emptyset$\}}\)
Language equality testing
for Turing machines	\(EQ_{TM}\)	\(\{ \langle M_1, M_2 \rangle \mid \text{$M_1$ and $M_2$ are Turing machines and $L(M_1) =L(M_2)$\}}\)