CS30 - Spring 2016 - Class 5
Example code in this lecture
while.py
scores-lists.py
Lecture notes
admin
- assignment 2 out
isprime
- recall that last time we wrote a function called isprime (see
while.py code
)
- take as input a number and returns a boolean (True or False) whether or not the number is a prime
>>> isprime(5)
True
>>> isprime(6)
False
>>> isprime(100)
False
>>> isprime(101)
True
- how could we use this to print out the first 10 (100, 1000, etc) prime numbers?
- like to do some sort of loop
- will a for loop work?
- we don't know when we're going to stop
- we'd like to keep a count of how many we've seen and only stop when we've reached the number we want
while loop
- another way to do repetition
while <bool expression>:
statement1
statement2
...
statement3
as long as the <bool expression> evaluates to True, it continues to repeat the statements, when it becomes False, it then continues on and executes statement3, etc.
- specifically:
evaluates the boolean expression
- if it's False
- it "exits" the loop and goes on to statement3 and continues there
- if it's True
- executes statement1, statement2, ... (all statements inside the "block" of the loop, just like a for loop)
- go back to beginning and repeat
- how could we use a while loop for our prime numbers problem?
- keep a count of how many primes we've found (initially starts at 0)
- start count from 1 and work our way up
- check each number
- if it's prime
- print it out
- increment the counter of how many primes we've found
- keep repeating this as long as (while) the number of primes we've printed is less than the number we want
can you emulate a for loop with a while loop?
- yes!
for i in range(10):
...
is equivalent to writing:
i = 0
while i < 10:
...
i = i + 1
look at firstprimes function in
while.py code
- current += 1 every time through the loop we increment the number we're examining
- if that current number happens to be prime, we increment count
- the loop continues "while" count < num, that is as long as the number we've found is less than the number we're looking for
infinite loops
- what would the following code do?
while True:
print "hello"
- will never stop
- in this case you should see some output
- sometimes, it will look like the program just froze if you're not actually printing anything out
- you can stop this by selecting "reset shell"
- be careful about these with your program. They're called an infinite loop.
- if you think you might have an infinite loop
- put in some print statements to debug
- think about when the boolean expression will become False and make sure that is going to happen in your loop
run
scores-lists.py code
- First, prompts the user to enter a list of scores one at a time
- how is this done?
- while loop
- what is the exit condition?
- checks to see if the line is empty
while line != ""
- then, calculate various statistics based on what was entered
- how are we calculating these statistics?
- average?
- could keep track of the sum and the number of things entered
- divide at the end
- max?
- keep track of the largest seen so far
- each time a new one is entered, see if it's larger, if so, update the largest
- min?
- same thing
- median?
- the challenge with median is that we can't calculate it until we have all of the scores
- need to sort them and then find the middle score
- why can't we do this using int/float variables?
- we don't know how many scores are going to be entered
- even if we did, if we had 100 students in the class, we'd need 100 variables!
lists
- lists are a data structure in Python
- what is a data structure?
- a way of storing and organizing data
- lists allow us to store multiple values with a single variable
creating lists: we can create a new list using square brackets
>>> [7, 4, 3, 6, 1, 2]
[7, 4, 3, 6, 1, 2]
>>> 10 # not a list
10
>>> [10]
[10]
>>> l = [7, 4, 3, 6, 1, 2]
>>> l
[7, 4, 3, 6, 1, 2]
>>> type(l)
<type 'list'>
lists are a type and represent a value, just like floats, ints, bools and strings. We can assign them to variables, print them, etc.
- what do you think [] represents?
- empty list
>>> []
[]
accessing lists
- we can get at particular values in the list by using the [] to "index" into the list
>>> l = [7, 4, 3, 6, 1, 2]
>>> l[3]
6
notice that indexing starts counting at 0, not at 1!
>>> l[0]
7
- What do you think l[20] will give us?
>>> l[20]
Traceback (most recent call last):
File "<string>", line 1, in <fragment>
IndexError: list index out of range
we can only index from 0 up to the length of the list minus 1
- What do you think l[-1] will give us?
>>> l[-1]
2
if the index is negative it counts back from the end of the list
- notice that the type thing in the list is as you'd expect:
>>> type(l[3])
<type 'int'>
storing other things in lists
- draw the list representation
- a list is a contiguous set of spaces in memory
- we can store anything in each of these spaces
>>> ["this", "is", "a", "list", "of", "strings"]
['this', 'is', 'a', 'list', 'of', 'strings']
>>> list_of_strings = ["this", "is", "a", "list", "of", "strings"]
>>> list_of_strings[0]
'this'
>>> list_of_strings[1:4]
['is', 'a', 'list']
>>> [1, 5.0, "my string"]
[1, 5.0, 'my string']
>>> l = [1, 5.0, "my string"]
>>> type(l[0])
<type 'int'>
>>> type(l[1])
<type 'float'>
>>> type(l[2])
<type 'str'>
- In general, it's a good idea to have lists be homogeneous, i.e. be of the same type
slicing
- sometimes we want more than just one item from the list (this is called "slicing")
- We can specify a range in the square brackets, [], using the colon (:)
>>> l = ["this", "is", "a", "list", "of", "strings"]
>>> l[0:3]
['this', 'is', 'a']
>>> l[1:5]
['is', 'a', 'list', 'of']
>>> l[1:1]
[]
>>> l[-3:-1]
['list', 'of']
- generates a *new* list
- that includes the items from the list starting at the first number and up to, but not including, the second number
range function
- we've used the range function in loops to iterate over numbers, e.g.
for i in range(10):
# do something
- what does the range function actually do?
>>> range(10)
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
>>> range(15,20)
[15, 16, 17, 18, 19]
>>> range(-5, 0)
[-5, -4, -3, -2, -1]
looping over lists
- all of our for loops so far have actually just been looping over lists
for i in range(10):
# do something
is really the same as:
some_list = range(10)
for i in some_list:
# do something
- the way to read this list is:
- for each element in the list, do something
- for each iteration of the loop, i (or whatever variable you put there) will get the next value in the list
>>> my_list = [4, 1, 8, 10, 11]
>>> for i in my_list:
... print i
...
4
1
8
10
11
back to our stats program... how could we write average given what we know so far, that is a function that takes a list as a parameter and calculates the average?
- look at the inelegant_average function in
scores-lists.py code
- loop over each of the elements in the list
- accumulate a sum
- accumulate a count
- divide the sum by the count
- look at the average function in
scores-lists.py code
built-in functions over lists: there are also some built-in functions that take a list as a parameter
- we can get the length of a list
>>> len(l)
3
>>> len([1, 2, 3, 4, 5])
5
>>> len([])
0
- max
>>> l = [5, 3, 2, 1, 10]
>>> max(l)
10
- min
>>> min(l)
1
- sum
>>> sum(l)
21
lists are objects and therefore have methods. Any guesses?
- append: add a value on to the end of the list
>>> my_list = [15, 2, 1, 20, 5]
>>> my_list.append(100)
>>> my_list
[15, 2, 1, 20, 5, 100]
- notice that append does NOT return a new list, it modifies the existing list!
- try some out on your own:
http://docs.python.org/tutorial/datastructures.html
- pop: remove a value off of the end of the list and return it
>>> my_list.pop()
100
>>> my_list
[15, 2, 1, 20, 5]
- notice that it both modifies the list and returns a value
- if you want to use this value, you need to store it!
>>> x = my_list.pop()
>>> x
5
- pop also has another version where you can specify the index
>>> my_list = [15, 2, 1, 20, 5]
>>> my_list.pop(2)
1
>>> my_list
[15, 2, 20, 5]
- insert: inserts a value at a particular index
>>> my_list = [15, 2, 1, 20, 5]
>>> my_list.insert(2, 100)
>>> my_list
[15, 2, 100, 1, 20, 5]
- again, lists are mutable, so insert does not return a new list, but modifies the underlying one
- sort
>>> my_list = [15, 2, 1, 20, 5]
>>> my_list.sort()
>>> my_list
[1, 2, 5, 15, 20]
>>> my_list = ["these", "are", "some", "words", "to", "sort"]
>>> ["these", "are", "some", "words", "to", "sort"].sort()
>>> my_list = ["these", "are", "some", "words", "to", "sort"]
>>> my_list.sort()
>>> my_list
['are', 'some', 'sort', 'these', 'to', 'words']
back to our grades program: look at
scores-lists.py code
- there is a function called get_scores. That gets the scores and returns them as a list. How?
- starts with an empty list
- uses append to add them on to the end of the list
- returns the list when the loop finishes
- average function
- has a single parameter, but this parameter will represent a list
- inelegant_average
- calculates the sum and divides by the number of entries
- uses a for loop to iterate over the values
- often, we'll use something besides "i" as a variable name that makes our program more readable
- is there a better way to do this?
- look at fancy_average
- us the sum function over lists
- median function
- sorts the values
- notice again that sort does NOT return a value, but sorts the list that it is called on
- returns the middle entry
lists are mutable
- what does that mean?
- we can change (or mutate) the values in a list
- notice that many of the methods that we call on lists change the list itself
- we can mutate lists with methods, but we can also change particular indices
>>> my_list = [15, 2, 1, 20, 5]
>>> my_list
[15, 2, 1, 20, 5]
>>> my_list[2] = 100
>>> my_list
[15, 2, 100, 20, 5]
sequences
- lists are part of a general category of data structures called sequences that represent a sequence of things
- *all* sequences support a number of shared behavior
- the ability to index using []
- the ability to slice using [:]
- a number of built-in functions:
- len
- max
- min
- the ability to iterate over in with a for loop
- We've actually seen one other sequence?
- strings!
strings as sequences
- notice that we can do all the sequence-like things with strings
>>> s = "banana"
>>> s[4]
'n'
>>> s[2:5]
'nan'
>>> len(s)
6
>>> for letter in s:
... print letter
b
a
n
a
n
a
- strings, however, are immutable
>>> s[4] = "c"
Traceback (most recent call last):
File "<string>", line 1, in <fragment>
TypeError: 'str' object does not support item assignment
- no matter how hard you try, you cannot mutate a string