counter stats

Scheduling for Routers and OSs:
How to Schedule when you're Blind


People

Adam Wierman
Misja Nuyens
Mor Harchol-Balter
Nikhil Bansal


Motivation

While many computer applications, e.g. web servers, have knowledge of the service demands of requests, routers and operating systems have no knowledge about the remaining lengths of the network flows they schedule. As a result, policies that use job size information, such as Shortest-Remaining-Processing-Time (SRPT), cannot be applied in these applications. However, even when the scheduler cannot use job size information to prioritize, it can use other statistics in order to prioritize jobs that are likely to have small remaining service demands. One such statistic is the age or attained service of a job, which is defined as the amount of service already given to a job. By giving priority to the jobs with the Least Attained Service (LAS), it is possible to achieve good performance under heavy-tailed service distributions (which are common across computer systems). This is because, under the heavy tailed distributions that are characteristic of IP flow durations, jobs that have received a large amount of service are likely to be very large, and thus have remaining sizes that are still large. LAS has been studied since the early 1960s, however there remain many unanswered questions about the policy. Our work is motivated by the following question: How close can the performance of LAS be to the performance of SRPT? In other words, how much does LAS suffer because it is blind?

Results and Impact

We have shown that, under some distributions, LAS loses very little. In particular, when the service distribution is Pareto (or more generally regularly varying), LAS is constant competitive, i.e. has mean response time within a constant of that of SRPT regardless of the system load. Further, in this setting, the tail of the distribution of response times under LAS is asymptotically equivalent to that under SRPT. Due to the prevalence of heavy-tailed distributions in computer systems, results such as these have led to LAS being applied in both routers, e.g. the work of Rai, Urvoy-Keller, and Biersack, and operating systems, e.g. the work of Feng, Misra, and Rubenstein. However, our results also illustrate that one must be careful when applying LAS: Under light-tailed distributions LAS is no longer constant competitive; and further, the tail of the response time distribution is asymptotically heavier than that of SRPT.  In addition, one must be careful about fairness when using LAS.  We have shown that LAS is unfair to medium-large jobs under all service distributions with finite variance, though the degree of unfairness is small as long as the load is not too high.

Publications