Week1

We will work together to build an environment in CSE 105 that supports your learning in a way that respects your perspectives, experiences, and identities (including race, ethnicity, heritage, gender, sex, class, sexuality, religion, ability, age, educational background, etc.). Our goal is for you to engage with interesting and challenging concepts and feel comfortable exploring, asking questions, and thriving.
If you are skipping and stretching meals, or having difficulties affording or accessing food, you may be eligible for CalFresh, California’s Supplemental Nutrition Assistance Program, that can provide up to $292 a month in free money on a debit card to buy food. Students can apply at benefitscal.com/r/ucsandiegocalfresh. The Hub Basic Needs Center empowers all students by connecting them to resources for food, stable housing and financial literacy. Visit their site at basicneeds.ucsd.edu.

Financial aid resources, the possibility of emergency grant funding, and off-campus housing referral resources are available: see your College Dean of Student Affairs.

If you find yourself in an uncomfortable situation, ask for help. We are committed to upholding University policies regarding nondiscrimination, sexual violence and sexual harassment. Here are some campus contacts that could provide this help: Counseling and Psychological Services (CAPS) at 858 534-3755 or http://caps.ucsd.edu; OPHD at 858 534-8298 or ophd@ucsd.edu , http://ophd.ucsd.edu; CARE at Sexual Assault Resource Center at 858 534-5793 or sarc@ucsd.edu , http://care.ucsd.edu.

Please reach out (minnes@ucsd.edu) if you need support with extenuating circumstances affecting CSE 105.

Introductions

CSE 105’s Big Questions

In this context, a problem is defined as: “Making a decision or computing a value based on some input"

In Computer Science, we operationalize “hardest” as “requires most resources”, where resources might be memory, time, parallelism, randomness, power, etc.

To be able to compare “hardness” of problems, we use a consistent description of problems

Output: Yes/ No, where Yes means that the input string matches the pattern or property described by the problem.

Weeks 0 and 1 at a glance

Textbook reading: Chapter 0, Sections 1.3, 1.1

Notice: we are jumping to Section 1.3 and then will come back to Section 1.1 next week.

Before Friday, read Definition 1.52 (definition of regular expressions) on page 64.

For Week 2 Monday: Figure 1.4 and Definition 1.5 (definition of finite automata) on pages 34-35.

Textbook references: Within a chapter, each item is numbered consecutively. Example 1.51 is the fifty-first numbered item in chapter one.

We will be learning and practicing to:

TODO:

#FinAid Assignment on Canvas (complete as soon as possible) and read syllabus on Canvas

Schedule your Test 1 Attempt 1, Test 2 Attempt 1, Test 1 Attempt 2, and Test 2 Attempt 2 times at PrairieTest (http://us.prairietest.com)

Review Quiz 1 on PrairieLearn (http://us.prairielearn.com), complete by 1/15/25

Create a homework group, possibly by using the Piazza (https://piazza.com/) find-a-teammate tool

Homework 1 submitted via Gradescope (https://www.gradescope.com/), due 1/16/25

Week 1 Monday: Terminology and Notation

The CSE 105 vocabulary and notation build on discrete math and introduction to proofs classes. Some of the conventions may be a bit different from what you saw before so we’ll draw your attention to them.

Term	Typical symbol	Meaning
	or Notation

Alphabet	$\Sigma$, $\Gamma$	A non-empty finite set
Symbol over $\Sigma$	$\sigma$, $b$, $x$	An element of the alphabet $\Sigma$
String over $\Sigma$	$u$, $v$, $w$	A finite list of symbols from $\Sigma$
(The) empty string	$\varepsilon$	The (only) string of length $0$
The set of all strings over $\Sigma$	$\Sigma^*$	The collection of all possible strings formed from symbols from $\Sigma$
(Some) language over $\Sigma$	$L$	(Some) set of strings over $\Sigma$
(The) empty language	$\emptyset$	The empty set, i.e. the set that has no strings (and no other elements either)


The power set of a set $X$	$\mathcal{P}(X)$	The set of all subsets of $X$
(The set of) natural numbers	$\mathcal{N}$	The set of positive integers
(Some) finite set		The empty set or a set whose distinct elements can be counted by a natural number
(Some) infinite set		A set that is not finite.


Reverse of a string $w$	$w^\mathcal{R}$	write $w$ in the opposite order, if $w = w_1 \cdots w_n$ then $w^\mathcal{R} = w_n \cdots w_1$. Note: $\varepsilon^\mathcal{R} = \varepsilon$
Concatenating strings $x$ and $y$	$xy$	take $x = x_1 \cdots x_m$, $y=y_1 \cdots y_n$ and form $xy = x_1 \cdots x_m y_1 \cdots y_n$
String $z$ is a substring of string $w$		there are strings $u,v$ such that $w = uzv$
String $x$ is a prefix of string $y$		there is a string $z$ such that $y = xz$
String $x$ is a proper prefix of string $y$		$x$ is a prefix of $y$ and $x \neq y$


Shortlex order, also known as string order over alphabet $\Sigma$		Order strings over $\Sigma$ first by length and then according to the dictionary order, assuming symbols in $\Sigma$ have an ordering

A string over an alphabet $\Sigma$ is an element of $\Sigma^*$ OR a subset of $\Sigma^*$.

A language over an alphabet $\Sigma$ is an element of $\Sigma^*$ OR a subset of $\Sigma^*$.

With $\Sigma_1 = \{0,1\}$ and $\Sigma_2 = \{a,b,c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v,w,x,y,z\}$ and $\Gamma = \{0,1,x,y,z\}$

True or False: $\varepsilon$ is a prefix of some string over $\Sigma_1$

True or False: There is a string over $\Sigma_1$ that is a proper prefix of $\varepsilon$

The first five strings over $\Sigma_1$ in string order, using the ordering $0 < 1$:

The first five strings over $\Sigma_2$ in string order, using the usual alphabetical ordering for single letters:

Week 1 Wednesday: Regular expressions

Our motivation in studying sets of strings is that they can be used to encode problems. To calibrate how difficult a problem is to solve, we describe how complicated the set of strings that encodes it is. How do we define sets of strings?

How would you describe the language that has all strings over $\{0,1\}$ as its elements?

**This definition was in the pre-class reading** Definition 1.52: A regular expression over alphabet $\Sigma$ is a syntactic expression that can describe a language over $\Sigma$. The collection of all regular expressions over $\Sigma$ is defined recursively:

The semantics (or meaning) of the syntactic regular expression is the language described by the regular expression. The function that assigns a language to a regular expression over $\Sigma$ is defined recursively, using familiar set operations:

The language described by the regular expression $\varepsilon$ is $L(\varepsilon) = \{ \varepsilon \}$

The language described by the regular expression $\emptyset$ is $L(\emptyset) = \emptyset$

The language described by the regular expression $1^* \circ 1$ is $L(1^* \circ 1) =$

The language described by the regular expression $(~(0 \cup 1) \circ (0 \cup 1) \circ (0 \cup 1) ~)^*$ is $L(~(~(0 \cup 1) \circ (0 \cup 1) \circ (0 \cup 1) ~)^*~) =$

Week 1 Friday: Regular expressions conventions

Review: Determine whether each statement below about regular expressions over the alphabet $\{a,b,c\}$ is true or false:


Assuming $\Sigma$ is the alphabet, we use the following conventions

$\Sigma$	regular expression describing language consisting of all strings of length $1$ over $\Sigma$
$*$ then $\circ$ then $\cup$	precedence order, unless parentheses are used to change it
$R_1R_2$	shorthand for $R_1 \circ R_2$ (concatenation symbol is implicit)
$R^+$	shorthand for $R^* \circ R$
$R^k$	shorthand for $R$ concatenated with itself $k$ times, where $k$ is a (specific) natural number

Caution: many programming languages that support regular expressions build in functionality that is more powerful than the “pure” definition of regular expressions given here.

Software tools and languages often have built-in support for regular expressions to describe patterns that we want to match (e.g. Excel/ Sheets, grep, Perl, python, Java, Ruby).

Under the hood, the first phase of compilers is to transform the strings we write in code to tokens (keywords, operators, identifiers, literals). Compilers use regular expressions to describe the sets of strings that can be used for each token type.

Next time: we’ll start to see how to build machines that decide whether strings match the pattern described by a regular expression.

For example: Which regular expression(s) below describe a language that includes the string $a$ as an element?

Page references are to the 3rd edition of Sipser’s Introduction to the Theory of Computation, available through various sources for approximately $30. You may be able to opt in to purchase a digital copy through Canvas. Copies of the book are also available for those who can’t access the book to borrow from the course instructor, while supplies last (minnes@ucsd.edu)↩︎

Term	Typical symbol	Meaning
	or Notation

Alphabet	\(\Sigma\), \(\Gamma\)	A non-empty finite set
Symbol over \(\Sigma\)	\(\sigma\), \(b\), \(x\)	An element of the alphabet \(\Sigma\)
String over \(\Sigma\)	\(u\), \(v\), \(w\)	A finite list of symbols from \(\Sigma\)
(The) empty string	\(\varepsilon\)	The (only) string of length \(0\)
The set of all strings over \(\Sigma\)	\(\Sigma^*\)	The collection of all possible strings formed from symbols from \(\Sigma\)
(Some) language over \(\Sigma\)	\(L\)	(Some) set of strings over \(\Sigma\)
(The) empty language	\(\emptyset\)	The empty set, i.e. the set that has no strings (and no other elements either)


The power set of a set \(X\)	\(\mathcal{P}(X)\)	The set of all subsets of \(X\)
(The set of) natural numbers	\(\mathcal{N}\)	The set of positive integers
(Some) finite set		The empty set or a set whose distinct elements can be counted by a natural number
(Some) infinite set		A set that is not finite.


Reverse of a string \(w\)	\(w^\mathcal{R}\)	write \(w\) in the opposite order, if \(w = w_1 \cdots w_n\) then \(w^\mathcal{R} = w_n \cdots w_1\). Note: \(\varepsilon^\mathcal{R} = \varepsilon\)
Concatenating strings \(x\) and \(y\)	\(xy\)	take \(x = x_1 \cdots x_m\), \(y=y_1 \cdots y_n\) and form \(xy = x_1 \cdots x_m y_1 \cdots y_n\)
String \(z\) is a substring of string \(w\)		there are strings \(u,v\) such that \(w = uzv\)
String \(x\) is a prefix of string \(y\)		there is a string \(z\) such that \(y = xz\)
String \(x\) is a proper prefix of string \(y\)		\(x\) is a prefix of \(y\) and \(x \neq y\)


Shortlex order, also known as string order over alphabet \(\Sigma\)		Order strings over \(\Sigma\) first by length and then according to the dictionary order, assuming symbols in \(\Sigma\) have an ordering


Assuming \(\Sigma\) is the alphabet, we use the following conventions

\(\Sigma\)	regular expression describing language consisting of all strings of length \(1\) over \(\Sigma\)
\(*\) then \(\circ\) then \(\cup\)	precedence order, unless parentheses are used to change it
\(R_1R_2\)	shorthand for \(R_1 \circ R_2\) (concatenation symbol is implicit)
\(R^+\)	shorthand for \(R^* \circ R\)
\(R^k\)	shorthand for \(R\) concatenated with itself \(k\) times, where \(k\) is a (specific) natural number

Let’s get started