CS111: Scribes Notes for April 26, 2013

Scribe: Phillip Woytowitz

Preemptive Sceduling (Round Robin)

The basic idea here is that with preemptive scheduling (also known as "Round Robin" scheduling), each task that comes in gets a time slot of, lets say, 10 ms. A task gets to run for 10 ms, and once that time interval is finished, another process runs for its 10 ms time slot. Each of the incoming tasks get run in spurts and take their turns.

If A, B, C, and D are incoming tasks, then their execution with preemptive sceduling may look like this:

ABCDABCDACDACDBADAC

What's nice about this is that the average wait time for any task coming in is low.

There are two approaches with regards to Round Robin Sceduling. There is:

"Polite" Round Robin, where new arrivals (tasks) get first dibs via an earlier time slot, and...
"Rude" Round Robin, where new arrivals go to the end of the line

The "Rude" Round Robin would be easier to implement since we have a queueing system already in place. The "Polite" Round Robin approach, in addition to being somewhat more difficult to implement, suffers from the possibility of being able to starve its processes.

Threads, Race Conditions, and Critical Sections

The main problem with threads in general is that they can share memory and step on each others' toes. As software engineers, the problem we must solve is how to handle synchronization, i.e., how to arrange thread execution such that no one steps on anyone else's stuff. Because by default, threads are unsynchronized and WILL stomp of each others' work.

Let's look at some code for a program that manipulates Bill Gates's bank account:

	long long int balance = 0;	//big enough for Bill Gates

	//now for some primitives	
	void deposit (long long int amt)
	{
	  balance += amt;
	}

	bool withdraw(long long int amt)
	{
	  if(amt <= balance)
	  {
	    balance -= amt;
	    return 1;
	  }
	  else
	    return 0;
	}

Let's say that thread 1 wants to call withdraw(10000) and thread 2 wants to call deposit(50000). One thing that could go wrong is that we could lose track of the $50 dollar deposit. If we look at the machine code:

	bool withdraw(long long int amt)	
	{			
 	  if(amt <= balance)	//movq balance, %rax; cmpq %rax, amt
  	{
    	  balance -= amt;	//subq amt, %rax; movq %rax, balance	
      		return 1;
    	}
    	else
     	 return 0;
  	}

The deposit thread can run after the withdraw thread moves its perceived value of balance into its register (movq balance, %rax). By the time it writes back the balance value from its register, it will have returned an outdated one that never saw a $50 dollar deposit! This sort of issue is often referred to as a race condition.

We could try to fix this by declaring balance as a volatile variable, but even this wont prevent the compiler from trying to cache it. Here's an idea:


[[if(amt <= balance)
    {
      balance -= amt;   movq balance %rax]]

Let's try to run this without colliding with another thread

One approach we could take would be to add a machine instruction for every action that want to do that might collide with some other thread (we'll call these actions critical sections). Another option would be to add 2 machine instructions, one to indicate the beginning of the critical senction and one to indicate the end of the critical section. Both can work and both have been used, but the 2nd approach seems more general, so we'll start with this.

Suppose we have these two instructions. Here are some potential problems that can occur while using this approach:

We could begin a critical section but never end it.
- We want to enforce a bounded wait condition on critical sections. This means that if a thread wants to enter a critical section, but it can't, then the amount of time it has to wait is bounded (e.g. by 1 ms).
We could forget to start the critical section entirely. In our earlier example, we actually forgot to put the brackets around the code that needed it(the deposit primitive more specifically).
- We must maintain mutual exclusion. That is, one thread should exclude other threads from accessing the shared resource when accessing the shared resource (in our case, the balance variable).
We could exit ]] a critical section when we're not inside of one (programming error).
Then there's this issue: [[ [[ ]] ]] . Can we nest critical sections? This issue is a bit iffy and debatable, so for now we'll assume it is not allowed.

Let's Play Plumber

Pipe() is usually a system call. These pipes we're making will live in user space (no kernel involvement). Once we have a our data structures and primitives defined for this pipe, we'll have threads operate on them and figure out where the critical sections are in our code.

	enum {N = 1024}	//size of a pipe

	struct pipe {	//1 byte I/O
	  char buf[N];	//buffer
	  unintmax_t R, W;	//total number of reads and writes
	};

	void writec( struct pipe *p, char c)
	{
	  while(pipe_full(p)) continue;
	  p->buf[p->W++%N] = c;
	}

	char readc( struct pipe *p )
	{
	  while(pipe_empty(p) continue;
	  return p->buf[p->R++%N];
	}

	bool pipe_full(struct pipe* p)
	{
		return p->W - p->R == N;
	}

If we have threads operate on this as is. We could probably pinpoint a few bugs. One thing that could happen is that 2 writers may be trying to write to the last byte. Once one reader gets through the pipe_full loop, before he updates the W variable, the other read can also get past the pipe_full loop and write to the buffer, even though it should already be full.

To synchronize our code, we'll bookend critical sections using this datatype and these two primitives:

	typedef ... spinlock_t
	
	void lock (spinlock_t* l);
	void unlock (spinlock_t* l);

When we use these primitives to counter race conditions, we must follow a few rules:

Obey bounded wait condition. We write in the pattern:
- lock()
- small stuff
- unlock()
Can't lock something twice.
- lock() lock()
Can't unlock a resource in case you've already locked it.
- ... no lock done... unlock()

The general strategy is going to be:

look at a bunch of code.
identify a race condition.
fight it by inserting critical sections around the code to avoid race conditions.

Here are a couple problems we must keeping mind when marking critical sections:

code: ----[[========================]]---- making the critical section too long. The other threads will have to wait a long time before they can execute.
code: ---[[====]]==------------[[===]]----- making the critical section too small. This fails to fix the race condition.

We must mark our critical sections to be big enough to prevent the race condition, but not so big that it hurts performance. In effect we are obeying a Goldilocks principle.

So how do we right-size a critical section?

find shared writes. Code: ----[[w]]------[[w]]---[[w]]-------
for each shared write, look for dependent reads. Code: ----[[w====r]]--------[[w====r]]---

Here are some shared writes in our pipe example in the writec function:

W++ incrementing W.
= c; assigning the new buffer value.

And a here is a dependent read:

p->W++ reading the write index of the pipe.

But just putting the lock around the line p->buf[p->W++%N] = c; won't fix the race condition, for the while loop above it is actually a dependent read from the perspective of another thread. We could place another lock around the while loop, but this violates the bounded wait condition and will kill performance. What we must do to fix this is place the lock around the pipe_full check so that it locks and unlocks on each iteration of the loop:

	spinlock_t l;

	void writec(struct pipe* p, char c)
	{
	  for(;;)
	  {
	    lock(&l);
	    if(!pipe_full(p)) 
	      break;	//will maintain the lock needed for the writes
	    unlock(&l);
	  }
	  p->buf[p->w++%N] = c;
	  unlock(&l);
	}

The only problem with this approach is that if we have several threads running different pipes, they'll be bottlenecked by the fact that we have a single lock handling all pipe synchronization. A single lock that handles access to a lot of data is called a coarse-grained lock. To fix this issue, we should use fine-grained locks, where we have more locks each controlling less data. To accomplish this, let's just give each pipe its own lock:

	struct pipe {
	  ...
	  spinlock_t l;
	}

and update our existing lock and unlock calls with lock(&p->l) and unlock(&p->l).

So we've achieved fine-grain locking with our piping mechanism, but there's still an issue. If a pipe is full, we'll eat up a lot of our CPU polling for the lock's availability. We could replace the first unlock with a yield(), allowing another thread to get to work.

But now what we really want is a locking primitive that blocks if the resource isn't available. We'll call it a blocking mutex.

	typedef ... bmutex_t

	void acquire( bmutex_t* b)
	  get lock, but wait if already has lock (give up CPU)
	
	void try_acquire( bmutex_t* b)

	void release(bmutex_t* b)

So let's try to implement this:

	void acquire( bmutex_t* b){
	  while(!try_acquire(b)
	    continue;	//but wait... this in effect is still spinning!
	}

So how do we build these things? Next time...