Skip to content

Latest commit

 

History

History
177 lines (135 loc) · 15.1 KB

README.md

File metadata and controls

177 lines (135 loc) · 15.1 KB

COMP 211 Final Project: Simple Shell

Introduction

Now that you've familiarized yourself with the command line interface over the course of the semester, you will have the opportunity to build a simple shell in C! But don't fret - when we say simple, we mean simple. The shell specified in this assignment is merely capable of changing directories and executing system programs such as pwd and ls. The goal of this assignment is to familiarize you with system-related library functions and to give you the pride of knowing that something you've been working with all semester is something that you could have built all along.

Background Reading

Project Structure

Four files are included in this project:

  • Makefile- contains information used to compile your program with the make command. Do not modify.
  • shell.h- includes declarations and specifications for all of the functions in shell.c. Do not modify.
  • shell.c- contains function definitions for all functions in shell.c. In particular, your goal for this assignment will be to implement the following functions: parse, find_fullpath, and execute.
  • driver.c- contains the main function, which is just a loop that reads in a command and uses the functions written in shell.c to determine whether the command is valid and handle it appropriately. Reading this file before you begin will help you understand the usage of functions you must implement. Do not modify.

Honor Code Header

Please include the honor code header at the top of the shell.c file. Since we do grade manually for style we do not include names on code listings to avoid biasing the grading.

// PID: 9DigitPidNoSpacesOrDashes
// I pledge the COMP211 honor code.

Shell Fundamentals

A shell command takes the form <command_name> <arg_1> <arg_2> ... with an arbitrary number of arguments and an arbitrary amount of space in between each term. <command_name> is the name of an executable file, the rest of the arguments are parameters passed to the executable. When a command is entered, a shell will find the absolute path to <command_name> and spawn a new process with the path to the executable and supply its parameters.

How does the shell locate the exact location of this command?

This is where the concept of the PATH environment variable comes in. Environment variables basically amount to variables that are globally available to your system. The purpose of the PATH variable is to include every possible path where executable binaries might be located. For example, try entering the command echo $PATH in the learncli shell. You will see something like:

/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/mnt/learncli/bin

The PATH variable is essentially a list containing every path that the shell should search for when looking for the command entered. For example, if I want to run the C debugger, gdb, I might enter the command gdb. The shell will then see if there is a path leading to this program. First it will try /usr/local/sbin/gdb, then /usr/local/bin/gdb and so on, until it searches /usr/bin/gdb and determines that this is where gdb's binaries are located.

Note: You can use the which command to see where individual commands are stored on the system. For example which ls returns something like /bin/ls meaning that the binary for ls is located in the directory /bin. This may be helpful in debugging part 2.

All of the binary files for every command on your unix system exist in one of the paths listed in $PATH (take a look!). So for this lab, you don't need to implement any commands yourself, you're just writing an interface to run these commands like a shell. However, note that cd and exit are special built-in commands that have already been implemented for you. cd is often included as a built-in in shells because a forked process inherits the directory that the parent process was in. Since the purpose of cd is to move us to a new working directory, we have to treat it specially. (You'll notice that which cd doesn't return anything because cd isn't technically a command in the same sense that other commands are.)

In Part 1, you will parse the raw string entered on the command line into it's component parts. In Part 2 you will implement a function to find the location of the <command_name> file and in Part 3, you will finally spawn the new process to execute the command!

Part 0: System Calls

This project differs from previous lab assignments in the sense that it is not self-contained - it must be able to understand and manipulate the system it is running on. For example, it must have a way to know whether a program exists or not, and if a program does exist then it must have a way to create a process for that program's execution. C is a very systems-driven language, and as such, contains a number of standard library functions that let you interact with the system!

Below is a list of all the functions that you may need to use in the project. Please ensure that you are familiar with their syntax and usage. It is strongly recommended that you read the documentation for each function before proceeding. Detailed information about each function listed below can be found using the man command from the console: i.e. man chdir, will show the man page (short for manual page) for the chdir function. Additionally, the documentation in the GNU C Library Reference Manual is a good repository for this information.

  • Change Directory: int chdir(const char* path)
  • Fork a Process: pid t fork(void)
  • Execute External Command: int execv(const char* file, char* const argv[])
  • Get Environment Variable: char getenv(const char* name)
  • Memory Allocation: void* malloc(size t size)
  • Free Memory: void free(void* ptr)
  • File/Directory Status: int stat(const char* path, struct stat* buffer)
  • Blocking Operation: pid t wait(int *status)

Once you have a good understanding of how these functions are used, it'll be time to start using them! At a high level, this assignment can be divided into three tasks, each of which correspond with a function to implement in shell.c. Each of these functions will be described below, but many more details about their specification are available in shell.h.

Part 1: A Problem with Parsing

A good shell does what it is told, so your first task is to write a parser in the parse function that will populate a command_t struct that represents the format of a command. The definition of the command_t struct and prototype for the parse function are as follows:

typedef struct {
	
	char* path;		// fully qualified path to the executable or built-in cmd (must malloc)
	int argc;		// total number of arguments (0 if no command provided, -1 if command not found)
	char** argv;		// argument values (must malloc)
	
} command_t;
void parse( char* line, command_t* p_cmd );

Your goal is to use the information from the raw command string line to assign the values in the p_cmd struct according to criteria defined in shell.h.

For example, if the user enters the command cd /mnt/cdrom at the shell prompt, the parse function would take cd /mnt/cdrom as the line parameter. Using the space character as the delimiter, the fields in the command_t struct would be:

	path = "cd"
	argc = 2
	argv = {"cd", "/mnt/cdrom", NULL}  

Note that for non-builtin commands, path should be an absolute path. Further examples are included in shell.h.

For this function you will make use of the is_builtin and do_builtin functions to handle the special cases. Read the do_builtin function to understand the behavior of the cd built-in.

Complete parse such that the argc and argv fields in the command_t struct are populated correctly and the built-in functions are handled. You will implement find_fullpath in part 2 and return to finish the parse function.

Part 2: Searching the System

Once Part 1 is complete, the parse function will be able to populate a command_t struct with enough information to determine the name of the program you are trying to run (based on the first element of argv).

Your task is to complete the find_fullpath function, which takes in a command name and a command_t struct, populating the command_t's path field with the complete path to the command's executable binaries if they exist, and populating path with the original command otherwise. In simple terms, you must write a loop that parses each folder defined in the PATH environment variable, then use this folder along with the stat function to determine whether the command's binaries are within that folder. Additional information about the usage of stat is available below.

Once you have implemented the find_fullpath function, you should call it within the parse function to determine the absolute path of the command. The end result should be such that after parse is called, all fields of the command_t struct are populated correctly.

File / Directory Status

The stat function can be used to determine the existence of a file, or directory, on the file system. The following incomplete code segment can be used in the find_fullpath function to determine if the file or directory is on the file system. Additionally, this pattern is already used in the do_builtin function.

struct stat buffer;
int exists;
// string that represents the fully qualified
// path of a file or directory on the file system
char* file_or_dir;
exists = stat( file_or_dir, &buffer );
if ( exists == 0 && ( S_IFDIR & buffer.st_mode ) )  {
          // Directory exists
} else if ( exists == 0 && ( S_IFREG & buffer.st_mode ) ) {
          // File exists
} else { }
// Not a valid file or directory

Part 3: Execute Order 211

With parse and find_fullpath completed, we have all the information we need to actually execute our desired command! To this end, you must complete the definition of the execute function, which takes a command_t struct as an argument and executes the corresponding command using the fork-exec-wait paradigm learned in class. If the child process either cannot be forked or terminates in an error condition then you can emit a descriptive error with perror or just printf. The return value of the execute function should be either SUCCESSFUL or ERROR (defined in shell.h). A sketch of the fork-exec-wait can be seen below:

Executing a Command

When a command is executed using the execv function, a child process will be created. The following incomplete code segment can be used as a guide when implementing the execute function defined in shell.c. Take another look at the man pages for fork and execve to make sure you understand what parameters and return values are expected for these functions.

if ( fork() == 0 ) {
      // This is the child
      // Execute in same environment as parent
      execv( ... );
      perror("Execute terminated with an error condition!\n");
      exit( 1 );
}
// This is the parent - wait for the child to terminate
wait( ... );

Part 4: Testing

If everything has been correct up to this point then congratulations, you are finished! But now that you've implemented a shell, you'll want to actually try using it. Use the make command and execute the compiled code with ./driver. You should be greeted with a shell prompt that allows you to enter commands, so try it out! Here is an example trace of what it looks like to execute a few commands with the completed shell:

image

This shell should be able to run commands like a normal shell! Try using ls, mkdir, pwd, etc to make sure it works as expected.

Note: Where it says "null", it is outputting the value stored in the USER environment variable, which happens to be null in the learncli environment. If you run this program in a shell on your personal machine, you should see your computer username show up instead!

Submit your assignment

  1. Use git to push your finished code to this Github repository.
  2. Go to the COMP 211 course in GradeScope and click on the assignment called Final Project.
  3. Click on the option to Submit Assignment and choose GitHub as the submission method.
  4. You should see a list of your public repositories. Select the one named final-project-yourname and submit it.
  5. Your assignment should be autograded within a few seconds and you will receive feedback for the autograded portion.
  6. If you receive all the points, then you have completed this lab! Otherwise, you are free to keep pushing commits to your GitHub repository and submit for regrading up until the deadline of the lab.

Grading

Even though this project is graded for the same amount of points as other labs, it will be weighted more than the rest of your assignments when entered in the Sakai gradebook. This project cannot count towards one of the "dropped coding assignments" and is worth 5% of the final grade.

Note that the autograder tests have been roughly ordered in a way such that the later tests are dependent on the earlier tests working (i.e. some tests for parse will not work until find_fullpath works correctly). When you debug your code, it will be worthwhile to focus on passing the lower-numbered test cases first.

Autograder tests (8 pts)

Style (2pts)

  • Variable Names (.5 pts)
    • Single-character variable names are only used for counting/indexing, or when used to represent a variable whose type has only one instance.
    • All "magic numbers" are defined as constants.
    • Variable names are either related to the usage of the variable, or the meaning is commented.
    • No unused variables are declared.
  • Readability (.75 pts)
    • Proper indentation (use the following Vim trick: (1G=G) )
    • Consistent whitespace theme used throughout.
      • Always put consecutive curly braces one line apart (or two, depending on preference).
    • Logically distinct blocks of code are separated by a whitespace.
    • No more than two consecutive lines of empty whitespace.
    • No old debug code is present (including in the comments).
  • Correctness of Code (.75 pts)
    • For all functions (including main if applicable), every path of execution leads to a return statement.
    • No libraries are included that weren't provided in the starter code or mentioned in the readme.

We reserve the right to deduct points from the autograder if we notice that you have hard-coded any test cases and not actually fully implemented the functions.