当前位置:服务支持 >  软件文章 >  CS 471/571课程:操作系统详解

CS 471/571课程:操作系统详解

阅读数 2131
点赞 49
article_banner

my wechat:Yooo932851

Don't hesitate to contact me

1. Introduction

Your current OS/161 system has minimal support for running executables - nothing that could be

considered a true process. This project involves the transformation of OS/161 into a true multi‐tasking operating system. The project is left open-ended deliberately; there are several design options that you can consider and commit to. After this assignment, your OS/161 kernel will be capable of running multiple processes (as long as those processes use the system calls you implement) at once from actual compiled programs stored in your account. These programs will be loaded into OS/161 and executed in user mode by System/161; this will occur under the control of your kernel. In other words, in this project, you will implement some of the system calls we discussed in the beginning of the semester, including fork,  exec, and waitpid, etc.

This project has two parts with two distinct deadlines: Part 1 is due on 4/14 and it consists of your answers to code reading questions, declaration of your project partner (if any), and a short progress report. Part 2 is due on 5/5 and it consists of your coding solutions – the programming part, and your design document.

In this project, you have to work onzeus.vse.gmu.edu. Before starting, you will need to complete a few steps (as in OS/161 Asst 1). Follow the steps from the previous assignment (OS 161/PA1) in the sections “Tag your Git Repository”, “Configure OS/161 for ASST1”, and “Building for ASST1”, but using ASST2 in place of ASST1, and build-asst2.php in place of build-asst1.php. Please see the previous assignment’s (OS 161/Asst 1) hand out for instructions on how to do that. When you run your kernel configured for this assignment, you will again see a menu with various options (including test options).

As a first step, you must implement the interface between user-mode programs ("userland") and the kernel. As usual, we provide part of the code you will need. Your job is to design and build the missing pieces. You will also be implementing the subsystem that keeps track of the multiple tasks you will have in the future. You must decide what data structures you will need to hold the data pertinent to a "process" (hint: look at kernel include files of your favorite operating system for suggestions, specifically the proc structure.)

In order to do that you will have to read and understand the parts of the system that are written for you. This is an essential component of this assignment. The existing code can run one user-level C program at a time as long as it doesn't want to do anything but shut the system down. We have provided sample user programs that do this (reboot, halt, poweroff), as well as others that make use of features you will be adding in this assignment.

So far, all the code you have written for OS/161 has only been run within, and only been used by, the operating system kernel. In a real operating system, the kernel's main function is to provide support for user-‐level programs. Most such support is accessed via "system calls." The system gives you one system call, reboot(), which is implemented in the function sys_reboot() in main.c. In GDB, if you put a breakpoint on sys_reboot and run the "reboot " program, you can use "backtrace" to see how it got there.

User--level programs

The System/161 simulator can run normal programs compiled from C. The programs are compiled with a cross--‐compiler, cs161-gcc. This compiler runs on the host machine and produces MIPS executables; it is the same compiler used to compile the OS/161 kernel. To create new user programs, you will need to edit the Makefile in bin, sbin, or testbin (depending on where you put your user programs) and then create a directory similar to those that already exist. Use an existing program and its Makefile as a template.

Design

Please note that yourdesign documents become an important part of the work you submit in this project. The design documents should clearly reflect the development of your solution. They should not merely explain what you programmed.

Note that it can often be hard to write (or talk) about new software design - you are facing problems that you have not seen before, and therefore even finding terminology to describe your ideas can be difficult. There is no magic solution to this problem; but it gets easier with practice. Now is a really good time to work closely with your partner. As you design this assignment, you may have to ”invent” terminology and notation - this is fine (just be sure to explain it to your TA in your design document). If you do this, by the time you have completed your design, you will find that you have the ability to efficiently discuss problems that you have never seen before.

To help you get started, we have provided the following questions as a guide for reading through the code. We recommend that you divide up the code and have each partner answer questions for different modules. After reading the code and answering questions, get together and exchange summations of the code you each reviewed. Once you have done this, you should be ready to discuss strategy for designing your code for this assignment. As always, make sure to comply with the GMU and CS honor codes – the only person you can cooperate with is your partner (if you have one).

2. Part 1 (20 points)

Include the answers to the code walk-through questions in a file calledcode-reading.txt. You will need to create this file at your convenient directory on Zeus.

kern/userprog: This directory contains the files that are responsible for the loading and running of user-level programs. Currently, the only files in the directory are loadelf.c, runprogram.c, and uio.c, although you may want to add more of your own during this assignment. Understanding these files is the key to getting started with the implementation of multiprogramming. Note that to answer some of the questions, you will have to look in files outside this directory.

loadelf.c: This file contains the functions responsible for loading an ELF executable from the filesystem and into virtual memory space. (ELF is the name of the executable format produced by cs161-gcc.) Of course, at this point this virtual memory space does not provide what is normally meant by virtual memory -- although there is translation between the addresses that executables "believe" they are using and physical addresses, there is no mechanism for providing more memory than exists physically.

runprogram.c: This file contains only one function, runprogram(), which is responsible for running a program from the kernel menu. It is a good base for writing the execv() system call, but only a base -- when writing your design doc, you should determine what more is required for execv() that runprogram() does not concern itself with. Additionally, once you have designed your process system, runprogram() should be altered to start processes properly within this framework; for example, a program started by runprogram() should have the standard file descriptors available while it's running.

uio.c: This file contains functions for moving data between kernel and user space. Knowing when and how to cross this boundary is critical to properly implementing user-level programs, so this is a good file to read very carefully. You should also examine the code in lib/copyinout.c

Questions

1. What are the ELF magic numbers?

2. What is the difference between UIO_USERISPACE and UIO_USERSPACE? When should one use UIO_SYSSPACE instead?

3. Why can the struct uio that is used to read in a segment be allocated on the stack in load_segment() (i.e., where does the memory read actually go)?

4. In runprogram(), why is it important to call vfs_close() before switching to usermode?

5. What function forces the processor to switch into usermode? Is this function machine dependent?

6. In what file are copyin, copyout and memmove are defined? Why can't copyin and copyout be implemented simply as memmove?

7. What is the purpose of userptr_t? Explain briefly.

kern/arch/mips/mips: traps and syscalls

Exceptions are the key to operating systems; they are the mechanism that enables the OS to regain control mof execution and therefore do its job. You can think of exceptions as the interface between the processor and the operating system. When the OS boots, it installs an "exception handler" (carefully crafted assembly code) at a specific address in memory. When the processor raises an exception, it invokes this, which sets up a "trap frame" and calls into the operating system. Since "exception" is such an overloaded term in computer science, operating system lingo for an exception is a "trap" -- when the OS traps execution. In OS 161 terminology, interrupts are exceptions, and more significantly for this assignment, so are system calls. (The terminology difference between OS/161 and many OS textbooks are unfortunate; but it is not uncommon at all – recall our class discussion about “interrupts” and “exceptions”). Specifically, syscall.c handles traps that happen to be syscalls. Understanding at least the C code in this directory is key to being a real operating systems programmer, so we highly recommend reading through it carefully.

trap.c: mips_trap() is the key function for returning control to the operating system. This is the

C function that gets called by the assembly exception handler. md_usermode() is the key function for returning control to user programs. kill_curthread() is the function for handling broken user

programs; when the processor is in usermode and hits something it can't handle (say, a bad instruction), it raises an exception. There's no way to recover from this, so the OS needs to kill off the process. Part of this assignment will be to write a useful version of this function.

syscall.c: mips_syscall() is the function that delegates the actual work of a system call to the kernel function that implements it. Notice that reboot() is the only case currently handled. You will also find a function, md_forkentry(), which is a stub where you will place your code to implement the fork() system call. It should get called from mips_syscall().

Questions

8. What is the numerical value of the exception code for a MIPS system call?

9. Why does mips_trap() set curspl to SPL_HIGH "manually", instead of using splhigh()?

10. How many bytes is an instruction in MIPS? (Answer this by reading mips_syscall() carefully, not by looking somewhere else.)

11. Why do you "probably want to change" the implementation of kill_curthread()?

12. What would be required to implement a system call that took more than 4 arguments? lib/crt0: This is the user program startup code. There's only one file in here, mips-crt0.S, which contains the MIPS assembly code that receives control first when a user-level program is started. It calls the user program's main(). This is the code that your execv() implementation will be interfacing to, so be sure to check what values it expects to appear in what registers and so forth.

lib/libc: This is the user-level C library. There's obviously a lot of code here. We don't expect you to read it all, although it may be instructive in the long run to do so. For present purposes you need only look at the code that implements the user-level side of system calls, which we detail below.

errno.c: This is where the global variable errno is defined.

syscalls-mips.S: This file contains the machine-dependent code necessary for implementing the user-level side of MIPS system calls.

syscalls.S: This file is created from syscalls-mips.S at compile time and is the actual file assembled into the C library. The actual names of the system calls are placed in this file using a script called callno-parse.sh that reads them from the kernel's header files. This avoids having to make a second list of the system calls. In a real system, typically each system call stub is placed in its own source file, to allow selectively linking them in. OS/161 puts them all together to simplify the makefiles.

Questions

13. What is the purpose of the SYSCALL macro?

14. What is the MIPS instruction that actually triggers a system call? (Answer this by reading the source in this directory, not looking somewhere else.)

Once you finish the code reading questions, create a second file named “progress-report.txt

In that file, first give the full names and G numbers of the team members who decided to collaborate in project PA2 (see the “Working with a partner” paragraph in Section 1 of the PA1 specification for the rules governing partner selection – you can’t change partner from one project to another). Then, in the same file, give a short (around one-page) summary of the progress you made on the design and coding part of your project. While you are not required tosubmit any code in Part 1, you are expected to begin at least the design part and possibly the coding during the first two weeks. In the progress report, you should summarize your efforts towards that goal (for example, you can describe your efforts towards designing/implementing specific system calls.)

Finally, briefly explain the responsibilities of the team members in Part 2 of the project (the coding and design part). Failure to submit the progress report may result in deduction of points.

You can work with a partner in Part 2 (Coding Part) below only if that student’s name is explicitly stated in Part 1 submission in a timely manner.

Submission of Part 1:

Create a directory called ‘asst2-part1’, inside the ‘~/os161/’ directory.

% cd ~/os161/

% mkdir asst2-part1

When both code-reading.txt and progress-report.txt files are ready, put these two files in it.

Next, tar and compress your asst2-part1 directory.

% cd ~/os161

% tar -czf uid1_uid2-asst2-part1.tar.gz asst2-part1

You should replace uid1 and uid2 with you and your partner's GMU email IDs. If you are working alone, the last line should read tar -czf uid-asst2-part1.tar.gz. Submit the compressed Part 1 file on Blackboard. All members of a group must submit separately the same compressed file, and before the Part 1 deadline 4/14 11:59 PM. Make sure to coordinate. Make sure to double check that you are submitting the correct file -- if you submit wrong, corrupt, or empty file you are likely to get zero.

Late penalty for Part 1 will be 15% for each day. Submissions that are late by 3 or more days will not be accept. Please plan in advance.

You can make multiple submissions; we will consider ONLY the last submission that you make: so in case you need to re-submit, make sure the compressed file contains all the components listed above.

Note: The contents of Part 1 should be submitted separately from Part 2, and in a timely manner.

3. Part 2 -- Design (20 points) and Implementation (60 points) - (80 points total)

Download necessary files

As in the previous ASST1, you need to copy two files and put them in their appropriate directory. You need to update your source code with these two files before you begin your assignment:

• cd ~/os161/os161-1.11/

• cp /usr/local/shared/cs471/PA2/callno.h kern/include/kern/

• cp /usr/local/shared/cs471/PA2/unistd.h include/

After copying the files accurately, tag your repository as asst2-begin, if you have not already done

so. Also, create a directory asst2-part2 inside ~/os161/ in which you will put the submission items.

System calls and exceptions

Implement system calls and exception handling. The full range of system calls that we think you might want over the course of the semester is listed in kern /include/kern/callno.h. For this assignment you should implement:

● getpid

● getppid

● fork

● execv

● waitpid

● _exit

It's crucial that your syscalls handle all error conditions gracefully (i.e., without crashing OS/161.) You should consult the OS/161 man pages included in the distribution (in ~/os161/os161- 1.11/man/syscall directory) and understand fully the system calls that you must implement.You must return the error codes as described in those man pages. We also recommend that you look at the comments in syscall.c and the examples within the mips_syscall() function that you can find there.

Additionally, your syscalls must return the correct value (in case of success) or error code (in case of failure) as specified in the man pages. Some of the grading scripts rely on the return of appropriate error codes; adherence to the guidelines is as important as the correctness of the implementation. The file include/unistd.h contains the user-level interface definition of the system calls that you will be writing for OS/161 (including ones that are relevant for another potential assignment). This interface is different from that of the kernel functions that you will define to implement these calls. You need to design this interface and put it in kern/include/syscall.h. As you may have discovered in OS 161/Asst0, the integer codes for the calls are defined in kern/include/kern/callno.h. You need to think about a variety of issues associated with implementing system calls. Perhaps the most obvious one is: can two different user-level processes (or user-level threads, if you choose to implement them) find themselves running a system call at the same time? Be sure to argue for or against this and explain your final decision in the design document.

For any given process, the first file descriptors (0, 1, and 2) are considered to be standard input (stdin), standard output (stdout), and standard error (stderr). These file descriptors should start out attached to the console device ("con:").

An important design consideration is to decide where to put the code for these system calls. A good place would be inside the kern/userprog directory. For the 6 system calls in this assignment, create 6 files named getpid.c, getppid.c, fork.c, execv.c, waitpid.c, and exit.c, inside the kern/userprog directory. Make sure to add the file names to your kern/conf/conf.kern file in an appropriate place, as you have done in Assignment-0 for the hello.c file. By convention, the names of the systems call functions will be as follows: sys_getpid, sys_getppid, sys_fork, sys_execv, sys_waitpid, sys_exit. These names must also be added as declarations in the kern/include/syscall.h file.

getpid()

A pid, or process ID, is a unique number that identifies a process. The implementation of getpid() is not terribly challenging, but pid allocation and reclamation are the important concepts that you must implement. It is not OK for your system to crash because over the lifetime of its execution you've used up all the pids. Design your pid system; implement all the tasks associated with pid maintenance, and only then, implement getpid().

getppid()

This system call works exactly like getpid(), except that it returns the ‘parent process’ pid. Note that, the getppid() system call is not included in the OS/161 man pages, and you can assume the following spec:

getppid() will return the process ID of the parent of the calling process. This will be either the ID of the process that created this process using fork() (if the parent hasn’t terminated yet), or -1 (if the parent has already terminated).

fork(), execv(), waitpid(), _exit()

These system calls are probably the most challenging part of the assignment, but also the most rewarding.

They enable multiprogramming and make OS/161 a much more useful entity.

fork() is the mechanism for creating new processes. It should make a copy of the invoking process and make sure that the parent and child processes each observe the correct return value (that is, 0 for the child and the newly created pid for the parent). You will want to think carefully through the design of fork() and consider it together with execv() to make sure that each system call is performing the correct functionality. You would also need to make a copy of the parent's address space; you can do that by calling as_copy in the current dumbvm system (see kern/arch/mips/mips/dumbvm.c). You can ignore kern/vm/addrspace.c for this assignment, it is not used in ASST2.

execv(), although "only" a system call, is really the heart of this assignment. It is responsible for taking newly created processes and make them execute something useful (i.e., something different than what the parent is executing). Essentially, it must replace the existing address space with a brand new one for the new executable (created by calling as_create in the current dumbvm system) and then run it. While this is similar to starting a process straight out of the kernel (as runprogram() does), it's not quite that simple. Remember that this call is coming out of userspace, into the kernel, and then returning back to userspace. You must manage the memory that travels across these boundaries very carefully. (Also, notice that runprogram() doesn't take an argument vector -- but this must of course be handled correctly in execv()).

While you have to make a number of design decisions, you could consider the following main steps when implementing execv(): i.) Copy arguments from user space into kernel buffer; ii.) Open the executable file, create a new address space and load the elf into it; iii.) Copy the arguments from kernel buffer into user stack; iv.) Return to user mode.

Although it may seem simple at first, waitpid() requires a fair bit of design. Read the specification in the OS 161 waitpid() man page carefully to understand the semantics and consider these semantics from the ground up in your design. You may also wish to consult the general UNIX man page, though keep in mind that you are not required to implement all the things UNIX waitpid() supports, nor is the UNIX parent/child model of waiting the only valid or viable possibility.

The implementation of _exit() is intimately connected to the implementation of waitpid(). They are essentially two halves of the same mechanism. Most of the time, the code for _exit() will be simple and the code for waitpid() more complicated, but it's perfectly viable to design it the other way around as well. If you find both are becoming extremely complicated, it may be a sign that you should rethink your design.

A note on errors and error handling of system calls:

The man pages in the OS/161 distribution contain a description of the error return values that you must return. If there are conditions that can happen that are not listed in the man page, return the most appropriate error code from kern/errno.h. If none seems particularly appropriate, consider adding a new one. If you're adding an error code for a condition for which Unix has a standard error code symbol, use the same symbol if possible. If not, feel free to make up your own, but note that error codes should always begin with E, should not be EOF, etc. Consult Unix man pages to learn about Unix error codes; on Linux systems "man errno" will do the trick. Note that if you add an error code to kern/include/kern/errno.h, you need to add a corresponding error message to the file kern/include/kern/errmsg.h

kill_curthread()

Feel free to write kill_curthread() in as simple a manner as possible. Just keep in mind that essentially nothing about the current thread's userspace state can be trusted if it has suffered a fatal exception -- it must be taken off the processor in as judicious a manner as possible, but without returning execution to the user level.

Design Considerations

Here are some additional questions and thoughts to aid in writing your design document. They are not, by any means, meant to be a comprehensive list of all the issues you will want to consider. You do not need to explicitly answer or discuss these questions in your design document, but you should at least think about them.

Your system must allow user programs to receive arguments from the command line. In addition, it must also work with the “p” menu option (e.g., it must work for p testbin/add 2 5.) To accomplish

that, you need to edit the file menu.c. See “Testing Your System Call Implementation” section below as well.

For instance, you should be able to run the following program:

char *filename = "/testbin/add";

char *args[4];

pid_t pid;

args[0] = "add";

args[1] = "3";

args[2] = "4";

args[3] = NULL;

pid = fork();

if (pid == 0) execv(filename, args);

which will load the executable file testbin/add, install it as a new process, and execute it by computing the sum (7).

Some questions to think about:

- Passing arguments from one user program, through the kernel, into another user program, is a bit of a chore. What form does this take in C? You will probably need several iterations to get this right.

- What primitive operations exist to support the transfer of data to and from kernel space? Do you want to implement more on top of these?

- How will you determine: (a) the stack pointer initial value; (b) the initial register contents; (c) the return value; (d) whether you can exec the program at all?

- You will need to "bullet-proof" the OS/161 kernel from user program errors. There should be nothing a user program can do to crash the operating system (with the exception of explicitly asking the system to halt).

- What new data structures will you need to manage multiple processes?

- What relationships do these new structures have with the rest of the system?

Testing your system call implementations

To test your system call implementations, the best way would be to write small programs that make use of those system calls (fork, exec, waitpid, etc.), compile with cs161-gcc compiler, and run it under os161.

Specifically, when you build and invoke your kernel, on the sys161 menu, choose first the “Operations Menu”, and then use the “[p]” option to invoke the executable. For example, typing:

p testbin/forktest would invoke the forktest executable which is already provided in the “testbin” directory. The “testbin” directory has other executables; and some of them may be useful for debugging. By reading the first few lines of C programs there, you may be able to see whether the test program is suitable for this project or not.

But more importantly, do not restrict your tests with the programs in the testbin directory. In fact, it may be wiser to first use very small programs (like the program with fork() and exec() whose code is given on the previous page). For example, you can store that program in a separate directory under testbin and compile (by preparing and running a Makefile similar to that available for “forktest” in testbin ). Those makefiles call cs161-gcc compiler with proper flags. And then

you can again use p testbin/myprogram if the name of your new executable is myprogram. In

your submission, include in your asst2-part2 directory the scripts for the tests that you designed so that we can evaluate them as well.

In this project, it is NOT necessary to implement the filesystem related system calls (such as read() and write()). You can certainly do that if you wish; but it is not required. However, you will still need a “shortcut” in order to execute library calls like putchar(), printf() to print to the screen; because

those calls issue a call to “write()”. If you don’t want to implement “write()” separately, you can

do the following:

Go to the kern/arch/mips/mips directory, and open for editing the file syscall.c. Within the mips_syscall() function, you will see a switch statement. To handle the calls to “write()”, add the following to the switch statement:

case SYS_write:

for (i = 0; i < (size_t) tf->tf_a2; ++i) {

kprintf("%c", ((char *) tf->tf_a1)[i]);

}

break;

You would also need to add a declaration for the variable i as an unsigned int, and put it in the beginning of the mips_syscall()function. Save syscall.c, build your kernel again, and this should be sufficient for simply printing to the screen. In fact, even the “forktest” program will need this “patch” in addition to the correct implementation of fork(), and exec(), for proper printing. But as we mentioned, it is best to test your implementations first with very short programs, before moving to “forktest” which is more complicated. You may also find some of the programs from the bin directory useful, such as false, and true. For getppid(), write your own (short) test program.

4. Submission and Grading

In this project, you can continue to work with your partner from Project 1. If you worked alone in Project 0 and Project 1, you could pair up with another CS 471/571 student from your section who also worked alone in the same projects.

You will need to (again) put significant time on reading the existing OS 161 kernel code. Make sure to allocate sufficient time to debugging and extensive testing.

On the assignment due date:

1. Commit all your final changes to the system. Make sure your partner has committed everything as well. Make sure you have the most up-to-date version of everything. Re-run make on both the kernel and userland to make sure all the last-minute changes get incorporated.

2. Run the tests and generate scripts. If anything breaks, repeat step 1.

3. Tag your Git repository; prepare the diff and submission source tree.

4. Make your submission to the Blackboard by following the guidelines below.

Submission for Part 2

(Part 1 must be submitted separately and by an earlier deadline. See p. 5 for details.)

Your final submission should be a .tar.gz file. Please name this file uid1_uid2-asst2- part2.tar.gz, where uid1 and uid2 are you and your partner's GMU email IDs. If you work alone, it should be named as uid-asst2-part2.tar.gz where uid is your GMU e-mail id. Before you submit, make sure you commit your latest working copy first:

% cd ~/os161/os161-1.11/

% git commit -am “Completed project-2”

Tag your latest working copy and run a diff:

% git tag asst2-end

% git diff asst2-begin asst2-end > ../asst2-part2/asst2.diff

In the ‘asst2-part2’ directory, you should place the following:

Your design document. Please use only .pdf, or.txt file formats.

● A “diff” between asst2-begin and asst2-end.

● A script of OS/161 running the varioustt* tests successfully.

● A script of the new test or tests you added for testing your system call implementations.

Note: Your design document is worth 20% of the grade for this assignment. It should contain:

● A high-level description of how you are approaching the problem.

● A detailed description of the implementation (e.g., new structures, why they were created, what

they are encapsulating, what problems they solve).

● A discussion of the pros and cons of your approach.

● Alternatives you considered and why you discarded them.

Next, tar and compress your asst2-part2 directory AND your entire source tree.

% cd ~/os161

% tar -czf uid1_uid2-asst2-part2.tar.gz os161-1.11 asst2-part2

On Blackboard, under the Projects/Project 2 folder you will find a link to submit your compressed file. All members of a group must submit separately the same compressed file, and before the part 2 deadline 5/5 11:59 PM. Make sure to coordinate.

You can make as many submissions as you like; we will consider ONLY the last submission that you make. So in case you need to re-submit, you must make sure the compressed file contains all the components listed above.

Late penalty for part 2 of this assignment will be 15% for each day. Submissions made on 5/8 or later will not be accepted.

Warning: The design and coding effort needed for this project is not trivial; and that is wh

相关文章
QR Code
微信扫一扫,欢迎咨询~

联系我们
武汉格发信息技术有限公司
湖北省武汉市经开区科技园西路6号103孵化器
电话:155-2731-8020 座机:027-59821821
邮件:tanzw@gofarlic.com
Copyright © 2023 Gofarsoft Co.,Ltd. 保留所有权利
遇到许可问题?该如何解决!?
评估许可证实际采购量? 
不清楚软件许可证使用数据? 
收到软件厂商律师函!?  
想要少购买点许可证,节省费用? 
收到软件厂商侵权通告!?  
有正版license,但许可证不够用,需要新购? 
联系方式 155-2731-8020
预留信息,一起解决您的问题
* 姓名:
* 手机:

* 公司名称:

姓名不为空

手机不正确

公司不为空