Python辅导 | Artificial Intelligence Summer 2019 Homework 1: Search

PROGRAMMING

In this assignment you will create an agent to solve the N-puzzle game. You will implement and compare several search algorithms, and collect some statistics related to their performances. Visit ​mypuzzle.org/sliding​ for the game’s rules. Please read all sections carefully:

I. Introduction
II. Algorithm Review
III. What You Need To Submit
IV. What Your Program Outputs
V. Implementation and Testing
VI. Before You Finish

Note on Python 3​: We will only accept homeworks from now and on in python 3 (python 2 is being discontinued).

Note on skeleton code​: The skeleton code is provided to help you get started if you need it but you don’t have to use it. If you do, feel free to change the function parameters, and adapt it to your needs.

Note on FAQs​: There is also a FAQs document in addition to this spec. I. Introduction
The N-puzzle game consists of a board holding N = m^2 − 1 distinct movable tiles, plus one empty space. There is one tile for each number in the set {1, …, m^2 −1}. In this assignment, we will represent the blank s​ pace with the number 0 and focus on the m = 3 case (8-puzzle).

In this combinatorial search problem, the aim is to get from any initial board state to the configuration with all tiles arranged in ascending order ⟨0, 1,…, m^2 − 1⟩ — this is the goal state with​ the empty tile at top left and the rest of the tiles in ascending order.​ The search space is the set of all possible states reachable from the initial state. Each move consists of swapping the empty space with a component in one of the four directions {‘Up’, ‘Down’, ‘Left’, ‘Right’}. Give each move a cost of one. Thus, the total cost of a path will be equal to the number of moves made.

Recall from lecture that search begins by visiting the root node of the search tree, given by the initial state. Three main events occur when visiting a node:
– First, we remove a node from the frontier set.
– Second, we check if this node matches the goal state.

– If not, we then expand the node. To expand a node, we generate all of its immediate successors and add them to the frontier, if they (i) are not yet already in the frontier, and (ii) have not been visited yet.

This describes the life cycle of a visit, and is the basic order of operations for search agents in this assignment—(1) remove, (2) check, and (3) expand. We will implement the assignment algorithms as described here. Please refer to lecture notes for further details, and review the lecture pseudo-code before you begin.

IMPORTANT:​ You may encounter implementations that attempt to short-circuit this order by performing the goal-check on successor nodes immediately upon expansion of a parent node. For example, Russell & Norvig’s implementation of BFS does precisely this. Doing so may lead to edge-case gains in efficiency, but do not alter the general characteristics of complexity and optimality for each method. For simplicity and grading purposes in this assignment, do not make such modifications to algorithms learned in lecture.

Your job in this assignment is to write ​driver.py​, which solves any 8-puzzle board when given an arbitrary starting configuration. ​Note that we are only asking to implement the 8-puzzle, not the general case​. The program will be executed as follows:
$ python driver.py <method> <board>
The method argument will be one of the following. You must implement all three of them: bfs​ (Breadth-First Search)
dfs​ (Depth-First Search)
ast​ (A-Star Search)
The board argument will be a comma-separated list of integers containing no spaces. For example, to use the bread-first search strategy to solve the input board given by the starting configuration {0,8,7,6,5,4,3,2,1}, the program will be executed like so (with no spaces between commas):
$ python driver.py bfs 0,8,7,6,5,4,3,2,1

Your program will create and/or write to a file called ​output.txt​, containing the following statistics:
path_to_goal​: the sequence of moves taken to reach the goal
cost_of_path​: the number of moves taken to reach the goal
nodes_expanded​: the number of nodes that have been expanded
search_depth​: the depth within the search tree when the goal node is found max_search_depth​: the maximum depth of the search tree in the lifetime of the algorithm running_time​: the total running time of the search instance, reported in seconds

max_ram_usage​: the maximum RAM usage in the lifetime of the process as measured by the ​ru_maxrss​ attribute in the ​resource​ module, reported in megabytes
Example #1: Breadth-First Search
Suppose the program is executed for breadth-first search as follows:
$ python driver.py bfs 1,2,5,3,4,0,6,7,8

This should result in the solution path:
The output file will contain ​exactly​ the following lines:
path_to_goal: [‘Up’, ‘Left’, ‘Left’] cost_of_path: 3 nodes_expanded: 10 search_depth: 3 max_search_depth: 4 running_time: 0.00188088 max_ram_usage: 0.07812500
Example #2: Depth-First Search
.
Suppose the program is executed for depth-first search as follows:
$ python driver.py dfs 1,2,5,3,4,0,6,7,8
This should result in the solution path:
The output file will contain ​exactly​ the following lines:
path_to_goal: [‘Up’, ‘Left’, ‘Left’] cost_of_path: 3 nodes_expanded: 181437 search_depth: 3 max_search_depth: 66125 running_time: 5.01608433 max_ram_usage: 4.23940217 .
More test cases are provided in ​the FAQs​.
Note on Correctness

All variables​, except ​running_time​ and m​ ax_ram_usage​, have ​one
answer when running BFS and DFS. A* ​nodes_expanded​might vary depending on implementation details (e.g., how you break ties…). You’ll be fine as long as your algorithm follows all specifications listed in these instructions.

As ​running_time​ and ​max_ram_usage​ values vary greatly depending on your machine and implementation details, there is no “correct” value to look for. They are for you to monitor time and space complexity of your code, which we highly recommend. A good way to check the correctness of your program is to walk through small examples by hand, like the ones above.

For your first programming project, we are providing hints and explicit instructions. Before posting a question on the discussion board, make sure your question is not already answered here or in the FAQs.

1. Implementation

You will implement the following three algorithms as demonstrated in lecture. In particular:

● Breadth-First Search​. Use an explicit queue, as shown in lecture.
● Depth-First Search​. Use an explicit stack, as shown in lecture.
and only one ​correct
● A-Star Search​. Use a priority queue, as shown in lecture. For the choice of heuristic, use the ​Manhattan priority function​; that is, the sum of the distances of the tiles from their goal positions. Note that the blanks space is not considered an actual tile here.

2. Order of Visits

In this assignment, where an arbitrary choice must be made, we always ​visit​ child nodes in the “​UDLR​” order; that is, [‘Up’, ‘Down’, ‘Left’, ‘Right’] in that exact order. Specifically:

● Breadth-First Search​. Enqueue in UDLR order; de-queuing results in UDLR order.
● Depth-First Search​. Push onto the stack in reverse-UDLR order; popping off results in
UDLR order.
● A-Star Search​. Since you are using a priority queue, what happens with duplicate keys?
How do you ensure nodes are retrieved from the priority queue in the desired order? To handle ties, explore nodes by key value, then ​UDLR order, and then time put in queue.

3. Submission Test Cases

Run all three of your algorithms on the following test cases:
Test Case #1
python driver.py bfs 3,1,2,0,4,5,6,7,8 python driver.py dfs 3,1,2,0,4,5,6,7,8 python driver.py ast 3,1,2,0,4,5,6,7,8
Test Case #2
python driver.py bfs 1,2,5,3,4,0,6,7,8 python driver.py dfs 1,2,5,3,4,0,6,7,8 python driver.py ast 1,2,5,3,4,0,6,7,8
Make sure your code passes at least these test cases and follows our formatting exactly​. The results of each test are assessed by 8 items: 7 are listed in ​Section IV. What Your Program Outputs​. The last point is for code that executes and produces any output at all. Each item is worth 0.75 point.

4. Grading and Stress Tests

We will grade your project by running ​additional test cases​ on your code. In particular, there will be five test cases in total, each tested on all three of your algorithms, for a total of ​15​ distinct tests. Similar to the submission test cases, each test will be graded by 8 items, for a total of 90 points. ​Plus, we give 10 points for code completing all 15 test cases within 10 minutes​. If you implement your code with reasonable designs of data structures, your code will solve all 15 test cases within a minute in total. We will be using a wide variety of inputs to stress-test your algorithms to check for correctness of implementation. So, we recommend that you test your own code extensively.

Don’t worry about checking for malformed input boards, including boards of non-square dimensions, other sizes, or unsolvable boards.
You will not be graded on the absolute values of your running time or RAM usage statistics.​ The values of these statistics can vary widely depending on the machine. ​However, we recommend that you take advantage of them in testing your code.​ Try batch-running your algorithms on various inputs, and plotting your results on a graph to learn more about the space and time complexity characteristics of your code. Just because an algorithm provides the correct path to goal does not mean it has been implemented correctly.

5. Tips on Getting Started

Begin by writing a class to represent the ​state​ of the game at a given turn, including parent and child nodes. We suggest writing a separate s​ olver​ class to work with the state class. Feel free to experiment with your design, for example including a ​board​ class to represent the low-level physical configuration of the tiles, delegating the high-level functionality to the state class.

You will not be graded on your design, so you are at a liberty to choose among your favorite programming paradigms. Students have successfully completed this project using an entirely object-oriented approach, and others have done so with a purely functional approach. Your submission will receive full credit as long as your driver program outputs the correct information.

● Make sure​ your code passes at least the submission test cases.
● Make sure​ your algorithms generate the correct solution for an arbitrary solvable
problem instance of 8-puzzle.
● Make sure​ your program always terminates without error, and in a reasonable amount of
time. ​You will receive zero points from the grader if your program fails to terminate. Running times of more than a minute or two may indicate a problem with your implementation.​ If your implementation exceeds the time limit allocated (20 minutes for all test cases), your grade may be incomplete.
● Make sure​ your program output follows ​the specified format exactly.​ In particular, for the path to goal, use square brackets to surround the list of items, use single quotes
around each item, and capitalize the first letter of each item. Round floating-point numbers to 8 places after the decimal. You will not receive proper credit from the grader if your format differs from the provided examples above.