CS 111: Introduction to Computer Science

Takehome exam. Due 9:50AM Monday, 8 March 2010.

This is an exam. You may not speak with anybody other than Jeff Ondich about its contents. You may, however, use your book, your notes, the books in the library, the Internet, and divine guidance (should any be offered to you). If you obtain information from some source other than your own brain, cite your source and use quotation marks where appropriate.

Hand your exam in on paper.

  1. (8 points) Sections 11.6.1 through 11.6.3 of your textbook include a discussion of an important Python structure called a dictionary. Your job for this problem is to read about dictionaries, and then write a Python program that does the following:

    • Opens the file cities.txt, which consists of lines of the form "City,State". (These cities happen to be the US cities that Wikipedia says have population greater than 100,000, but your program will work with any similarly formatted file.)
    • Reads the file one line at a time, keeping track of how many times each state name appears in the file. (This part uses a dictionary.)
    • Closes the file.
    • Prints the state names and counts in descending order of the number of times they appeared in the file. That is, the state with the largest number of cities with population > 100,000 will appear at the top of the list, while the one with the fewest will appear at the bottom (and some, such as the relatively sparsely populated North Dakota, will not appear at all).

    Hand in a copy of your program on paper, but also hand it in electronically in the usual way. Please call it dictionary.py.

  2. (8 points) I have a list of integers, and I want to know the distance between the two integers that are furthest apart. For example, if the list is [3, 5, -2, 8, 10, 6], then the distance I'm looking for is 12 (i.e. the distance between 10 and -2). Elmo suggests the following code, which does the job:

    
        def elmo(theList):
            N = len(theList)
            biggestDistance = 0
            for k in range(N):
                for j in range(N):
                    if theList[j] - theList[k] > biggestDistance:
                        biggestDistance = theList[j] - theList[k]
    
            return biggestDistance
        
    • When I used Elmo's code on an array of 10000 integers on my laptop, it took 19.5 seconds (I tried it several times--19.5 seconds every time). Approximately how long will Elmo's code take to run if N is 25000 (assuming I use the same computer)? Justify your answer. (It is not sufficient to tell me that you ran the code yourself and it took a certain amount of time. I want you to explain why it will take as long as you say it will.)
    • Zoe says she thinks Elmo's code is silly. Describe an algorithm that will run significantly faster than Elmo's, and explain why it's faster.
  3. (8 points) On the day I introduced Insertion Sort in class, somebody asked whether it could be performed recursively, which was, I think, an interesting question. In this problem, you will sorter.py to help investigate the different behaviors of iterative and recursive Insertion Sorts, with further comparison to Merge Sort.

    As we discussed in class, Insertion sort and Merge Sort are members of a class of sorting algorithms whose performance is usually measured by counting the number of list element comparisons that need to be done to complete the sorting of the list. That is, we count the number of times something like "a[j] < a[k]" gets executed during the run of the algorithm. When we count comparisons, we are counting the number of iterations of the inner loop, and by counting inner loop iterations, we are counting the most time-consuming portion of the algorithm.

    To see how this plays out, try running "python sorter.py insertion 10 verbose" or "python sorter.py merge 1000" to see a report of the sorting times and number of comparisons for a 10-element list or a 1000-element list. (The "verbose" causes the program to print out the list before and after sorting, to help you believe that sorting is actually taking place.)

    In what follows, you're going to count comparisons and measure running times for the two algorithms and several values of N.

    • Fill in the following chart.

      N Comp. count for Ins. Sort Running time for Ins. Sort Comp. count for Merge Sort Running time for Merge Sort
      3000
      6000
      9000
      12000

    • Can you use the comparison counts to predict the running times? If so, how? If not, why not?
    • When we analyzed Insertion Sort in class, we concluded that its running time is approximately proportional to N^2. Do the numbers in your chart support this conclusion?
    • What happens if you try to fill in a column of this chart for Recursive Insertion Sort? Explain why this problem occurs for Recursive Insertion Sort but not Merge Sort, even though they're both recursive algorithms.
  4. (2 points) I'm going to do a little relaxing over break. Could you recommend a book for me to read?
  5. (8 points) Steganography is the art of hiding messages in places where people will be unaware there is a message at all, let alone an encoded message. In this exercise, you will work with a form of steganography that involves hiding messages in digital images.

    The image ox-with-message.png is my usual Babe the Blue Ox photograph, with a slight modification. The blue values of the pixels in the leftmost column have been altered to hold a message consisting of a sequence of 8-bit ASCII characters. Each character is represented by 8 pixels, where an even blue value represents a 0 bit, and an odd blue value represents a 1 bit. For example, if the top 8 pixels in the left column of the photograph have blue values equal to 50, 51, 48, 48, 50, 52, 52, 51 (i.e. even, odd, even, even, even, even, even, odd), then the first character in the hidden message is the character whose binary ASCII representation is 01000001 (that is, the capital letter A).

    Your job is to write a program to extract the hidden message from the modified ox photograph. Write the message on your test paper, and submit your program as usual. Please call the program problem5.py.