CS 111

Scribe Notes for 1/23/13

by Alexander Chiou, Derek Ng

Orthogonality

In Emacs version 24.2, there is code that reads in a file:


1  (directory-files-and attributes "/usr/src/kernel/fs/")
2  result ("ufs" 209361271 "eggert")
  	      ("README" 209361271 "eggert")

At the moment, it does so slowly, because:


1  DR *d = opendir ("/usr/src/kernel/fs/");
2  while (dp = readdir (d ...)) {
3    dp -> d_name = "ufs";
4    char buf[1024];
5    strcpy (buf, "/usr/sr/kernel/fs/");
6    strcat (buf, dp->d_name);
7  }

The cost of this is O(DN), where D is the length of the filename and N is the number of files in the directory. We can speed it up by taking the strcpy call out of the loop and reusing the char buffer.


1  DR *d = opendir ("/usr/src/kernel/fs/");
2  char buf[1024];
3  strcpy (buf, "/usr/sr/kernel/fs/");
4  while (dp = readdir (d ...)) {
5    dp -> d_name = "ufs";
6    strcpy (buf + dirlen, dp->d_name);
7    // dirlen is length of directory name
8	  }

And so the code continues, but we still have to traverse the length of the string that holds the file's full name.


8  struct stat sb;
9  lstat (buf, &st)
10 ...convert to Lisp form...

We want O(N) cost, so we use another function, fstatat, instead of lstat:


1  int fd = dirfd (d) // this file gets the file descriptor
2  fstatat (fd, dp -> d_name, &st, O_SYMLINK);
3  // fd* points to the directory the file is in

fstatat allows us to get file status relative to a directory's file descriptor. Using it, we no longer have to traverse over the full length of a directory's name.

As the above graph shows, we want to be able to choose z freely without affecting x or y. Were the axes not orthogonal, moving along one axis would affect its coordinates along the other axes. In other words, we should choose axes that are orthogonal. In the previous example, the x-axis would be the file name while the y-axis would be the file contents. Since the length of the file name affects how easily the system is able to get information about the file's contents, this is not an orthogonal setup. In the end, everything ends up colliding. It's our job to figure out how to minimize these collisions.

The Classic Unix Model

The classic Unix model for orthogonality attempts to make the following factors orthogonal: processes, file names, device access, and file systems.

Processes

A process is a program that's executing in an isolated domain. To specify how processes work, we build an API for dealing with processes using our associated intuition visualizing them as data structures.

Forking

Forking is, put simply, cloning a process. The current process is called the parent process, and the new process is called the child process.


1  pid_t fork();
2  pid_t c = fork();

The function fork() takes in a void and returns a value of type pid_t, which is a signed int type. fork() clones the current process, spawning a child process. The child starts from where fork() was called in the parent. The return value of fork(), or c in the code segment above, is the pid of the child if we are in the original process, 0 if we are in the child process, -1 if there are not enough resources to create a child. For example:


1  pid_t c = fork();
2  if (c == 0) {
3    // do child stuff
4  } else {
5    // do parent stuff
6  }

In the example above, the return value of fork() is 0 in the child and something that isn't 0 in the parent. After fork() is called, in the parent, c will the child's pid. The parent will fail the else case and move to where the above code says "do parent stuff." In the child, however, since c is equal to 0, it will move to where the code says "do child stuff."


1  while (fork())
2    continue;
3  exit();

This code will attempt to make lots of children that immediately all exit. The parent will never have 0 as its return value, while the children fail out of the while loop and proceed to exit.


1  while (1)
2    fork();

This code, on the other hand, makes an exponential amount of children. The parent will fork, and then the parent and its child will fork, and the parent and its child and the two new grandchildren will fork, and so on and so forth. Running this code is ill-advised.

Suppose we want to print the date in an incredibly roundabout manner, forking the process and then having the child print the date. This is what the function would look like:


1  bool printdate (void) {
2    pid_t p = fork()
3    if (p < 0) return 0;
4    if (p == 0) {
5      execvp ("/usr/bin/date", (char *) {"date", "-u", 0});
6      exit(126);
7    }
8    // back in the parent
9    return 1;
10 }

int exec* (...) is a function that retuses a process and always returns -1. void exit(int) destroys a process.

The printdate() function forks itself, then checks if the fork completed successfully. The parent returns true, while the child prints the date and exits.

waitpid


1  pid_t waitpid (pid_t, int *, int);

waitpid is a function that waits on a process until it exits. It returns -1 if the first argument doesn't exist. In the printdate example above, the following code can be used instead:


1  int status;
2  while (waitpid (p, &status, 0) < 0)
3    continue;
4  return WIFEXITED (status) && WEXITSTATUS (status) == 0;

However, waitpid can cause deadlock. For example, if process a waits on process b and process b also waits on process a, this causes perpetual waiting. Unix convention circumvents this by allowing processes to only wait on their children (but not other descendents, like grandchildren).

Suppose a child is exited before waitpid is called on the child. This would be problematic. To solve this, we have zombie processes, processes that have exited but is still kept alive in the system. This means that any wait calls on finished processes won't crash.

If we want to produce lots of zombies, we can do so with the following code:


1  while ((p = fork()) >= 0)
2    if (!p)
3      exit(); // zombies!

We fork the current process, then the parent and child enter the loop. The parent fails out of the if statement and produces more children, and the child passes the if statement and immediately exits. These exited child processes are zombies.

Here is some example code that "reaps the zombies", forcing any processes in a zombified state to exit.


1  init.c
2  while (waitpid(-1, ...)) // This waits on any child.
3  {
4	  continue;
5  }

A Problem With Our Date Implementation

Suppose 'date' loops?

To fix this, we implement a 5 second time limit for processes, using a system call named alarm.

Here is some example code which uses alarm:


1  alarm(5); // If our process is still running after 5 minutes, it gets a signal
2  while (waitpid(p, &status, 0) < 0)
3  {
4      continue;
5  }
6  
7  signal (SIGALARM, (donothing)); // Implementation of donothing: void donothing {}
8  alarm(5);
9  if (waitpid(p, &status, 0) < 0)
10 {
11     // alarm went off
12     kill(p, SIGNIT) // kills child process
13     waitpid (p, &status, 0);
14 }

From line 12 of the above code, we can see that we have a new API member, kill. Here is its function declaration


1  int kill (pid_t, int sigNumber)

The Process Table

The process table is an object that the kernel looks at to make decisions concerning processes. It contains information about each of the processes that are currently running. Below is a rough example process table:

Pid	Exit Status	State	Parent Pid	RAM Start	RAM Size	Registers	User_ID	Group_ID	Instruction Pointer
1	126	ZOMBIE	9	0x0000	1024	%eax = 5	Bob	Students	0x00AA
2	0	EXITED	8	0x0B05	2048	%eax = 23	Billy	Students	0x0D6A

However, not all of the information in a process table may be accurate, especially for running processes. For example, let's say that Process 3 is running out of 10 possible process on a single core machine. Process 3 is "special", because it can have a lot of incorrect values. One of the most common values to be incorrect is the instruction pointer. In order for it to be right, the instruction pointer would have to update each time it ran an instruction and that is simply too much of a hassle to maintain. Because of dilemmas similar to the one above, many values in the process table are actually static when they have to be dynamic.

Files

Now let's move on from processes to file. Similar to processes, let's construct an API that deals with files. Here is the function declaration for open, a function that opens a file:


1  int open (char const* filename, int flags, extra info)

The above function returns a file descriptor object, but from the point of view of the user's application, the return value function is simply another non-negative integer.

Now let's implement the opposite of open, which is close. Here's the function declaration for this function:


1  int close (int fd)

This function takes in file descriptor fd, which points to the file that we want to close. As a side note, passing in a raw integer to this function crashes it.

Here is an analogy between processes and files to illustrate the roles of open and close.

Processes:	fork	exit
Files:	open	close

Lastly, we have dup, which is short for duplicate. This function creates a new file descriptor.


1  int dup (int);

dup "clones" files similar to fork() for processes, but it isn't entirely similar to fork(), because both the old and new file descriptors refer to the same file, which is aliasing.

Process Table Alteration

To accomodate the files of processes, we add a files section to each record in the process table, containing file descriptors to each of the files each process is working with. Each file descriptor points to another table, which contains information about the file itself. All of this is illustrated in the following diagram:

Final Question of the Day

Q: Does fork clone file descriptors?

A: To be answered in the next lecture

CS111 Operating Systems Principles, UCLA. Paul Eggert. January 30, 2013.