CS 111 Lecture 14

Overview

3 main goal of a file system
1. Durability
Survive failures in underlying hardware e.g losing power
2. Atomicity
Changes either all made or not made at all
3. Performance
Throughput and latency

Atomicity

How to implement atomicity atop a device where writes aren't atomic?
Golden Rule of Atomicity

Never overwrite the last copy of your data. A drive often contains 2 copies of a file. There is also a commit record to inform one on which copy to use.
Trying to Implement Atomicity with 2 commit records.
Ex: 2 Commit Logs
Start up with the same content:
A
A
Begin to write in commit 1:
A ?
A A
Continue:
Steps: 0 1 2 3 4
A ? B B B
A A A ? B
Issue: If we were to loose power (reboot) in step 3 where one commit represents A
and the other commit represents B then we will not know which file is
garbage and which one is relevant.
Solution:
In order to correctly implement atomicity we need 3 copies of the commit record. You should choose the file that the majority of these commits says is the newest. In some cases you may actually have all 3 disagree (hence the need for 3 copies). This scenario could exist if the computer lost power when copying the value of the first commit to the second commit. The first commit copy could point to file A, the second commit copy would be unsure, and the third commit copy could point to file B. In this case we would choose the first commit and thus file A.
Ex: 3 Commit logs
Steps: 0 1 2 3 4 5 6
A ? B B B B B
A A A ? B B B
A A A A A ? B
Fixed Issue: We will always choose the majority and if they all disagree as in step 3,
we choose the first one (Explained in more detail above in "Solution:").

Lampson-Sturges Assumptions

1. Storage writes may fail or may corrupt other blocks
2. A later read can detect the bad block. Thus a computer can avoid reading a block
with corrupted information. This can help to avoid system crashes/glitches
3. Storage can spontaneously decay
4. Erros are rare (~Not everything is going to crash and not all the storage will
spontaneously decay at the same time. There are no (quick) fixes for this)

2 main methods to create a file system based off these assumptions

Idea #1 Commit Record

We off course keep the golden rule of atomiticty in mind when implementing this method. Here are the steps involved when you issue a write command.

Idea #2 Journal (Introduction)

A journal is separate file that tracks every change of the file system. Any change to the FS first gets logged to the journal. If power is lost while actually writing to to the FS in question, we can go back to the journal and restore the file system. Journals are usually implemented as circular arrays. When the buffer gets close to getting full, the journal entries are commited to the FS and the old journal entries are rewritten with new ones.

File System Robustness:
CS111 Lecture 14

By Christian Nersesyan, David Antonio Garcia, Victoria Bernard

Overview

Robustness in File Systems

Berkley Fast File System (BSD)

Ways to evaluate a file system

Invariants for a FS

Consequences of breaking these rules

Performance Issues

Problem 1 : Cache Coherence

Potential Solutions

Problem 2 : Renaming