Approximation Algorithms for 2-Stage Stochastic 
Scheduling Problems 

David B. Shmoys1⋆ and Mauro Sozio2⋆⋆ 

1 School of ORIE and Dept. of Computer Science, Cornell University, Ithaca, NY 14853 

shmoys@cs.cornell.edu 

2 Dept. of Computer Science, University of Rome “La Sapienza”, Italy. 

sozio@di.uniroma1.it 

Abstract. There has been a series of results deriving approximation algorithms 
for 2-stage discrete stochastic optimization problems, in which the probabilistic 
component of the input is given by means of “black box”, from which the algorithm 
“learns” the distribution by drawing (a polynomial number of ) independent 
samples. The performance guarantees proved for such problems, of course, 
is generally worse than for their deterministic analogue. We focus on a 2-stage 
stochastic generalization of the problem of fnding the maximum-weight subset 
of jobs that can be scheduled on one machine where each job is constrained to 
be processed within a specifed time window. Surprisingly, we show that for this 
generalization, the same performance guarantee that is obtained for the deterministic 
case can be obtained for its stochastic extension. 
Our algorithm builds on an approach of Charikar, Chekuri, and P´al: one frst designs 
an approximation algorithm for the so-called polynomial scenario model 
(in which the probability distribution is restricted to have the property that there 
are only a polynomial number of possible realizations of the input that occur with 
positive probability); then one shows that by sampling from the distribution via 
the “black box” to obtain an approximate distribution that falls in this class and 
approximately solves this approximation to the problem, one nonetheless obtains 
a near-optimal solution to the original problem. Of course, to follow this broad 
outline, one must design an approximation algorithm for the stochastic optimization 
problem in the polynomial scenario model, and we do this by extending a 
result of Bar-Noy, Bar-Yehuda, Freund, Naor, and Schieber. 
Furthermore, the results of Bar-Noy et al. extend to a wide variety of resource-
constrained selection problems including, for example, the unrelated parallel-

P 


machine generalization R|rj | wj Uj and point-to-point admission control routing 
in networks (but with a different performance guarantee). Our techniques can 
also be extended to yield analogous results for the 2-stage stochastic generalizations 
for this class of problems. 

⋆ Research supported partially by NSF grants CCR-0635121 & DMI-0500263. 

⋆⋆ This work was done while this author was a visiting student at Cornell University. The work 
was partially supported by NSF grant CCR-0430682 and by EC project DELIS. 


1 Introduction 

Consider the following 2-stage stochastic optimization problem: there are n users, each 
of whom might request a particular communication channel, which can serve at most 
one user at a time, for a specifed length of time within a specifed time interval; for 
a given planning period, it is not known which of the n users will actually make their 
request – all that is known is a probability distribution over the subsets of users indicating 
which subset might be active; each user has an associated proft for actually being 
scheduled on the channel; alternatively, the manager of the channel can redirect the 
user to other providers, thereby obtaining a specifed (but signifcantly smaller) proft; 
the aim is to decide which users to defer so as to maximize the expected proft over 
the two stages (where the expectation is with respect to the probability distribution over 
subsets of active users). Thus, this is a stochastic generalization of the (maximization 
version) of the single machine scheduling problem that is denoted in the notation of 

P 


[4] as 1|rj | wj Uj and we shall refer to this generalization as the 2-stage stochas-

P 


tic 1|rj | wj Uj . For the deterministic version of this problem, Bar-Noy, Bar-Yehuda, 
Freund, Naor, & Schieber give a ρ-approximation algorithm for any constant ρ> 2; 
rather surprisingly, we show that the exact same result holds for the stochastic generalization. 
(A ρ-approximation algorithm for an optimization problem is a (randomized) 
polynomial-time algorithm that fnds a feasible solution with (expected) cost within a 
factor of ρ of optimal.) 

Recently, there has been a series of results for 2-stage discrete stochastic optimization 
problems with recourse, starting with the work of Dye, Stougie, and Tomasgard[3] 
that addressed a knapsack-like single-node network provisioning problem. That paper 
made the simplifying assumption of the polynomial scenario model in which there are 
(only) a polynomial number of scenarios that can be realized in the second stage, and 
thereby derived the frst worst-case performance guarantees for polynomial-time algorithms 
for models of this type. Kong & Schaefer [8] gave an 2-approximation algorithm 
for a 2-stage variant of the the maximum-weight matching problem, again in a polynomial 
scenario model. Later, Immorlica, Karger, Minkoff, and Mirrokni [7], and also 
Ravi and Sinha [9] addressed analogous questions based on deterministic problems 
such as the vertex cover problem, the set covering problem, the uncapacitated facility 
location problem, and network fow problems. The former paper also considered 
the situation when the probability distribution conformed to an independent activation 
model which, in our setting for example, would mean that there is a probability associated 
with each user and the active set is drawn by assuming that these are independent 
Bernoulli random events. However, for these latter results they introduced the proportionality 
assumption in which the corresponding costs for an element in the two stages 
had constant ratio λ for all elements. Gupta, P´al, Ravi, and Sinha [5] proposed a much 
more general mechanism for specifying the probability distribution, in which one has 
access to a black box from which to generate independent samples according to the 
distribution, and thereby make use of a polynomial number of samples in the process 
of computing the frst-stage decisions. They gave constant approximation algorithms 
for a number of 2-stage stochastic optimization problems in this model, most notably 
the minimum-cost rooted Steiner tree problem and the uncapacitated facility location 
problem, but they also require the proportionality assumption. 


Shmoys & Swamy [10] gave an LP-rounding technique, and showed that one could 
derive a polynomial-time approximation scheme for the exponentially-large linear programming 
relaxations in order to derive the frst approximation algorithms in the black 
box model without the proportionality assumption, in particular for a variety of set 
covering-related problems, the uncapacitated facility location problem, and multi-commodity 
fow problems. Swamy & Shmoys [11] extend this to constant-stage models, 
and also show that the so-called sample average approximation yields a polynomial 
approximation scheme for the LP relaxations. Charikar, Chekuri, and Pal´ [2] gave a 
general technique based on the sample average approximation that, for a broad class of 
2-stage stochastic minimization problem with recourse, in effect reduced the problem 
of obtaining a good approximation algorithm for the black box model, to the problem 
of obtaining the analogous result in the polynomial scenario setting. 

We build on these results, by frst constructing an approximation algorithm for our 
maximization problem in the polynomial scenario model, and then derive a maximization 
variant of the result of [2] (but still specialized to our class of problems) to obtain 
approximation algorithms in the black box probability model. 

We focus on the central model in the class proposed by Bar-Noy, Bar-Yehuda, Freund, 
Naor, and Schieber [1], who gave primal-dual algorithms for a rich class of deterministic 
resource allocation and scheduling problems. In their terminology, there is 
a set of activities, {A1,..., An}; let N = {1,...,n} index this set. For each activity 
Aj , j ∈N , there is a set of possible instances Aj that specify the various ways 
in which the activity might be handled (so, in the description above, assuming integer 
data for the input times, for each user we have one instance for each possible integer 
starting time that would have it complete by the deadline). This approach appears to 
convert the original input to a new input in which there are a pseudopolynomial number 
of instances for each activity. However, Bar-Noy et al. also show how to convert their 
pseudopolynomial-time algorithm into a polynomial-time one, while losing only a 1+ǫ 
factor in the performance guarantee. 

Our algorithm is a rather natural extension of the approach of Bar-Noy et al. We 
frst run their algorithm on each of the polynomially many scenarios, where the proft 
of selecting an instance is its contribution to the overall expected second stage proft. 
For each scenario (which is, after all just an ordinary deterministic input), this generates 
a feasible dual solution. The deterministic dual variables are of two types: those that are 
dual to the constraint that says that each activity is scheduled in at most one way (that 
is, at most one instance of each activity is selected); and those that correspond to the 
constraint that at each time at most one instance (over all activities) is active. The usual 
interpretation of dual variables leads us to view the former as providing the marginal expected 
proft attainable by having this activity on hand in a particular scenario. Thus, we 
decide to defer an activity Aj , if the total of the corresponding dual variables, summed 
over all scenarios, is less than the proft collected by actually deferring that activity. 
This gives the stage I actions. The stage II actions for each scenario are computed by 
adapting the algorithm of Bar-Noy et al.; we frst compute a dual solution that includes 
even the deferred activities, but then does not select any instance of a deferred activity 
in constructing the primal solution. 


The analysis of our algorithm is also surprisingly simple, and is based on a primal-
dual approach using an integer programming formulation of the 2-stage problem. We 
show that the dual solutions constructed in each scenario can be pieced together to 
yield a feasible solution for the dual to the linear programming relaxation, and can 
then show that the expected proft of the primal solution constructed is at least half the 
value of the feasible dual solution found. This yields that the resulting algorithm is a 
2-approximation algorithm. Like the algorithm of Bar-Noy et al., this is a pseudopolynomial-
time algorithm, but an approach identical to the one they employed yields a 
polynomial-time algorithm, while losing a factor of 1+ ǫ in the performance guarantee. 
Although we focus on this single-machine scheduling model, our approach can be 
generalized to yield analogously strong results for 2-stage stochastic generalization of 
the class of problems for which the framework of Bar-Noy et al. applies. This will be 
discussed in detail in the full version of this paper. 

There are other potential 2-stage stochastic extensions of the problem of computing 
a maximum-weight subset of jobs that can be feasible scheduled. One other natural 
approach is to use the frst stage to make initial decisions about which users to service 
(but to commit to serve them if they are active), and then to allow the possibility of 
serving additional users in the second stage, once the probabilistic choice of scenario 
has been made (with correspondingly lesser proft). We show that the maximum independent 
set problem can be reduced to an extremely restricted special case of this 
model in an approximation-preserving way, and hence we cannot hope to obtain a good 
approximation algorithm for this setting (unless P = NP). There are few (if any) such 
strong inapproximability results known for stochastic optimization problems for which 
their deterministic analogue is relatively easily approximable. 

2 IP & LP formulations: 2-stage stochastic models 

We start by giving a natural integer (linear) programming formulation (and its dual) for 

P 


the 2-stage stochastic version of 1|rj | j wj Uj , in its pseudopolynomial-sized variant. 

Let S be a collection of explicitly given scenarios {S1,...,Sm} that occur with 
positive probability; in each scenario S, for each activity Aj , there is an associated set 
of available instances Aj (S) ⊆ Aj . For each instance I, there is an associated starting 
time s(I), and an associated ending time e(I). For each scenario S ∈S, there is an 

P 


associated probability q(S), where q(S) ≥ 0 and S2S q(S)=1. In stage I, we must 

I

decide which activities to defer, and thereby obtain a (small) proft of pj , or else retain 

II 

for stage II, in which for each scenario S we can obtain a proft pj (I,S) for assigning 
this activity using instance I ∈ Aj (S). We give an integer programming formulation 
of this problem. For each activity Aj , we have a 0-1 variable xj that indicates whether 
activity Aj is deferred in the frst phase or not (where xj =1 means that it is deferred). 
For each instance I of activity Aj (S), we have a variable yj (I,S) whose value is 1 if 
and only if instance I of this activity is scheduled. Let T be the set of all start-times and 
end-times of all instances belonging to all activities and let TI = {t ∈T|s(I) ≤ t< 
e(I)} for each instance I. Moreover, let f(I) ∈T be maximal such that f(I) <e(I). 

We can formulate the 2-stage problem of maximizing the total expected proft as 
follows: 


X 
I XX 
X 
II 

max pj xj + q(S)pj (I,S)yj (I,S) (SIP) 

j2N j2N S2S I2Aj (S) 

X 


s.t. xj + yj (I,S) 1 8j 2N,S 2S, (1) 

I2Aj (S) 

XX 


yj (I,S) 1 8S 2S,t 2T, (2) 

j2N I2Aj (S):t2TI 

xj ,yj (I,S) 2{0, 1}, 8j 2N,S 2S,I 2Aj (S). (3) 

Let (SLP) be the LP obtained by replacing (3) by non-negativity constraints for 
these variables. If we let uj (S) be the dual variables corresponding to the constraints 
(1), and let vt(S) denote the dual variables corresponding to the constraints (2), then 
we can write the LP dual of (SLP) as: 

X 
X 
X 
X 
min uj (S) + vt(S) (SD) 
j2N S2S S2S t2T 
X 
Is.t. uj (S) pj , 8j 2N, (4) 
S2S 
X 
II uj (S) + vt(S) q(S)pj (I, S), 8j 2N, S 2S, I 2Aj (S), (5) 
t2TI 
uj (S), vt(S) 0. (6) 

It is important to note that our algorithm will not need to solve any of these linear 
programs! We will simply apply an algorithm for the deterministic variant (for which a 
performance guarantee relative the optimal value of the deterministic LP is known) to 
an input based on each scenario S ∈S, and then use the linear programs to analyze the 
performance of the resulting algorithm. 

3 An algorithm for the polynomial scenario model 

We shall show how to adapt the primal-dual algorithmic framework of Bar-Noy, Bar-
Yehua, Freund, Naor, & Schieber [1] to yield an approximation algorithm with the 

P 


identical performance guarantee for the 2-stage stochastic variant of 1|rj | wj Uj , in 
the polynomial scenario model. For this model, it is straightforward to derive a constant 
approximation algorithm. The simplest approach is to randomize, and with probability 
1/2 to defer all jobs, and otherwise, to run the 2-approximation algorithm of Bar-Noy 
et al. on the active jobs in the second stage; this is a randomized 4-approximation algorithm. 
In the polynomial scenario model, one can improve upon this by comparing the 
beneft of deferring all users with the expected proft obtained by the Bar-Noy algorithm 
based on not deferring anyone, and then selecting the better of the two. This is easily 
shown to be a 3-approximation algorithm (and can be extended to the black box model 
while losing only a factor of 1+ ǫ). Thus, the surprising aspect of our result is that it is 


in fact possible to obtain an algorithm for the 2-stage generalization without degrading 
the performance guarantee at all. 

The framework of Bar-Noy et al. works in two phases: a pushing phase in which 
a dual solution is constructed along with a stack of instances that might be selected 
to be scheduled; and a popping phase in which elements of the stack are popped off, 
and accepted for scheduling provided that they do not confict with activities already 
scheduled by this procedure. 

The algorithm for the 2-stage problem proceeds as follows. For each scenario S ∈ 

II 

S, the deterministic proft pj (I) is q(S)pj (I,S) for each j ∈N , and each I ∈ Aj (S). 
We execute the pushing procedure of the algorithm proposed in Bar-Noy et al. for each 
scenario S ∈S. Algorithm 1 shows the pseudocode for this procedure. We let uj (S) 
denote the dual variable corresponding to the deterministic analogue of (1) computed 
by this procedure. Then, for each activity Aj , j ∈N , we check if 

X 


I 

pj ≥ uj (S), (7) 

S2S 
and defer each activity Aj that satisfes this condition. This completes the frst stage 
action. We shall also denote this solution by setting x¯ j =1 for each deferred activity 
Aj , and setting x¯ j =0 otherwise. 

In what follows, we shall say that an instance I is uncovered if constraint (5) for 
instance I is not satisfed and we say that I is tight if this constraint is satisfed with 
equality. 

For the second stage, for a given scenario S ∈S, we recompute the execution of the 
pushing procedure. Then we compute a feasible schedule by executing the popping procedure 
of the algorithm of Bar-Noy et al., but we delete each activity that was deferred 
in the frst phase. We denote this solution by setting y¯ j (I,S)=1 for each scheduled 
instance I, and setting y¯ j (I,S)=0 otherwise. Algorithm 2 shows the pseudocode for 
the second phase for a given scenario. 

The main intuition behind the deferring rule is the following. Suppose at the end of 
the pushing phase the total value of variables u of an activity Aj is “small”. There are 

I

two possible reasons for this. The total proft of all instances of Aj is smaller than pj . 
In this case, it is clear that deferring the activity is the best we can do. If the total proft 

I

P of instances of Aj is greater than pj , then since u is “small”, there are many other 
instances of other activities which are in confict with instances of Aj . Hence, P can 
be “replaced” by the proft of these instances, and we can gain other proft by deferring 
Aj . More generally, the value of the sum refects the total expected marginal value of 
the activity Aj ; if this is less than the (sure) proft gained by deferring it, then certainly 
deferring it is a good thing to do. 

We shall prove that the performance guarantee of the two-phase algorithm is 2. The 
main idea behind this proof is the following. Each instance increases the total value 
of the dual variables by some amount 2δ. For instances that belong to a non-deferred 
activity, we are able to charge δ to a scheduled instance. For instances that belong to a 
deferred activity, we charge this amount to the proft gained by deferring that activity. 

Given a scenario S we say that I ∈ Aj (S) and Iˆ ∈ Al(S) are incompatible if j = l 
or their time intervals overlap. For each instance I ∈ Aj (S), we refer to the variables 
which occur in the constraint (5) for I, as “the variables of I”. 


Algorithm 1 Pushing procedure for the frst phase in scenario S 

1: Stack(S)=;; 
2: uj (S)0 8j 2N; 
3: vt(S)0 8t 2T; 

4: while no uncovered instance is left do 

5: select an uncovered instance I 2Aj (S),j 2Nwith minimum end-time; 

6: push(I,Stack(S)); 

7: let (I,S)=(q(S)pj 
II(I,S) − P 
vt(S))/2;

uj (S) − t2TI 

8: uj (S) uj (S)+ (I,S); 

9: vf (I)(S) vf(I)(S)+ (I,S); 

10: end while 

Algorithm 2 The algorithm for the second phase in scenario S 

1: /* pushing procedure */ 

2: Stack(S)=;; 
3: uj (S)0 8j 2N; 
4: vt(S)0 8t 2T; 

5: while no uncovered instance is left do 

6: select an uncovered instance I 2Aj (S),j 2Nwith minimum end-time; 

7: push(I,Stack(S)); 

8: let (I,S)=(q(S)pj 
II(I,S) −uj (S) − P 
t2TI 
vt(S))/2; 

9: uj (S) uj (S)+ (I,S); 

10: vf (I)(S) vf(I)(S)+ (I,S); 

11: end while 

12: /* scheduling procedure */ 

13: while Stack(S) is not empty do 

14: I=pop(Stack(S)); 

15: Let j 2N: I 2Aj (S); 

16: if Aj is not deferred and I is not in confict with other scheduled instances then 

17: schedule I and set y¯ j (I,S)=1; 

18: end if 

19: end while 

P

Theorem 1. For the 2-stage stochastic maximization version of 1|rj | wj Uj , there is 
a (2 + ǫ)-approximation algorithm in the polynomial scenario model. 

Proof. We shall consider only the version of the problem in which we have a pseudopolynomial 
representation of the input: that is, for each activity, we have an explicitly 
given set of allowed starting times. However, for each scenario, this is exactly the 
algorithm of Bar-Noy et al. (on a carefully constructed input), who show that it can be 

P 


converted to run in polynomial time for 1|rj | wj Uj , while losing a factor of 1+ ǫ 
in the performance guarantee. This will thereby yield the theorem in the form stated 
above. 

Let u¯ j (S) and v¯ t(S) be the value of the dual variables u and v at the end of the 
algorithm. First consider the constraints (5); the algorithm ensures that these are satisfed 
by the dual solution computed. This is a consequence of the fact that as long as 
there exists an uncovered instance, the algorithm pushes an instance in the stack and 


increases its dual variables making a constraint (5) tight. Hence, at the end of the algorithm, 
there does not exist an uncovered instance, and each constraint (5) is satisfed. 
On the other hand, constraint (4) can be violated by any deferred activity. In order to 
satisfy this constraint, we increase the value of dual variables in the following way. Let 

X 


I

δj = pj − u¯ j (S) j =1,...,n 
S2S 

¯

and let S ∈S, be an arbitrarily chosen scenario. For each activity Aj , we increase the 
value of u¯ j (S) by δj . Clearly, this maintains that the other constraints are satisfed, and 
ensures that constraint (4) is satisfed now as well. 

We now prove the performance guarantee of the algorithm is 2. The essence of the 
proof is as follows. In each scenario S, for each instance I of a non-deferred activity, 
we charge δ(I,S) to some scheduled instance. For each instance I of a deferred activity 

I

Aj , we charge δj and δ(I,S) to the proft pj . Hence, at the end of the algorithm, all 
amounts δ are “charged” to some proft. Moreover, the sum of all these δ, multiplied by 
2, gives a bound on the total value of the dual variables. The theorem then follows from 
weak duality. 

Consider a scenario S. Let Iˆ ∈ Aj (S) be an instance scheduled in S such that Aj 
is not deferred, j ∈N . Let Bˆ(S) be a set which contains Iˆ and as well as instances 

I 

that are: 

– incompatible with Iˆ and 
– pushed onto Stack(S) before Iˆ. 


Consider each instance I in Bˆ(S). When I is placed on the stack, there are two 

I 

dual variables that are increased by δ(I,S). For each such I, one of these two variables 
are variables of Iˆ. If I ∈ Aj (S), then the variable uj (S) occurs in constraint (5) for Iˆ. 
Otherwise, since e(Iˆ) ≥ e(I), then the variable vf (I)(S) occurs in this constraint. Let 

uˆ and vˆ be the value of dual variables u and v at the time Iˆ is pushed in the stack. We 
have that: 

XX 


II 

δ(I,S) ≤ uˆj (S)+ vˆt(S) ≤ qS pj (I,Sˆ) (8) 

ˆ 

ˆ

ˆ

(S)

II 

where last inequality follows from the fact that Iˆ is uncovered before being pushed on 
the stack and after that, its variables are increased in order to make constraint (5) tight. 

Note that each instance I of a non-deferred activity belongs to the set Bˆ(S) for 

I 

some instance Iˆ. This follows from the fact that either I is scheduled or there is another 
instance Iˆ pushed after I in the stack, which has been scheduled instead of I. This 
implies that for each scenario S ∈S 

XX 
XXX 


δ(I,S)= δ(I,S) 

(S)

I 

I2B 

t2T 

j2N : I2Aj (S) j2N :ˆ I2B

I2Aj (S): 

x¯ j =0 x¯ j =0 

yj (ˆ

I,S)=1 

XX 


II 

≤ qS pj (ˆ I,S)

I,S)¯yj (ˆ (9) 

j2N :ˆ

I2Aj (S) 

x¯ j =0 


For each deferred activity Aj , we have that: 

XX 
X 


I

δj + δ(I,S)= u¯ j (S)= pj (10) 

S2S I2Aj (S) S2S 

By combining Equation (9) and Equation (10), we obtain 

 
 


XX 
XXX 
XX 


 
 


 


 j + (I,S)= (I,S)+ j + (I, S) 

 
 


j2N S2S S2S j2N : I2Aj (S) j2N : S2S 
I2Aj (S) x¯ j =0 x¯ j =1 I2Aj (S) 

XXX 
X 


 qS pj 
II(I,S)¯yj (I,S)+ pj 
I 

S2S j2N : I2Aj (S) j2N : 
x¯ j =0 x¯ j =1 

X 
XX 


 pj 
I x¯ j + q(S)pj 
II(I, S)¯yj (I,S) (11) 

j2N j2N S2S 
I2Aj (S) 

Since the initial value of each dual variable is zero, and each instance I ∈ Aj (S) 
increases the total value of the dual variables by at most 2δ(I,S), we can sum over all 
such δ to bound the total value of the dual variables: 

 
 
XX 
XX 
XXX 
u¯ j (S)+ v¯ t(S) 2  j + (I,S) 
(12) 

j2N S2S S2S t2T j2N S2S I2Aj (S) 

Equations (11) and (12), together with the weak duality theorem, immediately imply 
the claimed result. 

4 An algorithm for the black box model 

We show next that we can adapt the algorithm derived in the previous section for the 
polynomial scenario setting to the black box model, where the probability distribution is 
specifed only by allowing access to an oracle from which independent samples according 
the distribution can be drawn. We show that applying the previous algorithm to an 
approximate version of the distribution based on sampling can be shown to still yield 
the same performance guarantee. Our analysis uses the structure of the analysis used 
for the previous algorithm, and builds on the general result for minimization 2-stage 
stochastic problems derived by Charikar, Chekuri, and P´

al [2]. 

We shall make use of the following version of the Chernoff bound. 

Lemma 1. Let X1,...XN be independent random variables with Xi ∈ [0, 1] and let 

PN

X = Xi. Then, for any ǫ ≥ 0, we have Pr [|X − E[X]| > ǫN] ≤ 2 exp(−ǫ2N).

i=1 

II 

We assume that there is an infation factor λ ≥ 1 such that pj (I,S) ≤ λpj 
I, ∀j ∈ 
N , ∀S ∈S, ∀I ∈ Aj (S). 


The algorithm frst takes a polynomial-sized sample from the set of scenarios and 
then proceeds just as the Algorithm 1 in Section 3 while using a slightly different deferring 
rule. 

More precisely, it takes N = Θ(λ2 
log n ) independent random samples S1,...,SN

ǫ2 γ 

from the black box, where n is the number of activities, ǫ will be the allowed additional 
relative error, and γ is the confdence parameter (that is, we shall obtain that the desired 
approximation is found with probability at least 1− γ). Then the algorithm executes the 
pushing procedure (see Algorithm 1) for each scenario that occurs in the polynomial 
sample. Observe that the data used by this algorithm for scenario S is described to 
be q(S)pj 
II(I,S). At frst glance, this might be worrying, but of course the value q(S) 
is just a uniform scalar multiple for all profts, and so it makes sense to defne u˜ and 

II 

v˜ as the dual variables computed after executing this algorithm with inputs pj (I,S). 
Observe that the values u¯ and v¯ for a scenario S from our exact distribution are equal 
to q(S)˜u and q(S)˜v, respectively. Given ǫ> 0, we shall defers an activity Aj , j ∈N , 
if and only if: 

N 

X 


I

(1 + ǫ)pj ≥ 
1 
u˜j (Si) (13) 

N 

i=1 

This is the deferring rule for the black box model. 

This concludes the description of the frst stage action. For the second stage, for 
a given scenario S ∈S, we execute Algorithm 2 for scenario S. (Again, note that 

II 

the linearity effect of q(S) implies that we can run the algorithm with inputs pj (I,S) 
instead.) 

Let us analyze the performance guarantee of this algorithm. The proof proceeds by 
showing that, under the assumption that there is an infation factor λ, equation (13) is a 
good approximation for equation (7). This approach is inspired by the proof in [2] for 
“low scenarios”. 

Theorem 2. For any ǫ> 0 and γ> 0, with probability at least 1 − γ, the proposed 
deferring rule is a (2 + ǫ)-approximation algorithm for the 2-stage stochastic variant 

P 


of the problem 1|rj | wj Uj in the black box model. 

Proof. Suppose we run Algorithm 1 in each of the exponentially-many scenarios and 
let u¯ and v¯ be the value of dual variables computed in this way. Consider activity Aj . 
Let 

N 

XX 
X

1 

r = u¯ j (S)= q(S)˜uj (S) rˆ= u˜j (Si). 

N 

S2S S2S i=1 

We will prove that, with “high” probability, rˆ is “close” to r. We can view rˆ as the arithmetic 
mean of N independent copies Q1,...,QN of the random variable Q defned as 

Q = u˜j (S). 

P 


Note that E[Q]= r. Let Yi be the variable Qi/M where M = λpj 
I and let Y = i Yi. 
Note that for each activity Aj and for each scenario S ∈S, there exists some I ∈ Aj (S) 

P

II N

such that u˜j (S) ≤ pj . This implies that Yi ∈ [0, 1]. Moreover, Y = Qi/M = rˆ

iM 


P 


N

and E[Y ]= E[Qi]/M = r. By applying the Chernoff bound, we obtain the 

iM 

following: 

 


hi

ǫǫ2 
γ 

Pr |Y − E[Y ]| >N ≤ 2 exp − N ⇔ Pr |r − rˆ| > ǫpj 
I ≤ , (14) 

λλ2 n 

where the last inequality follows from the choice of the value of N. By taking the 
union bound over all activities, we obtain that r is “close” to rˆ for all activities, with 
probability at least 1 − γ. 

We use the same argument as we used in the polynomial scenario model to show that 
constraint (5) is satisfed. Consider constraint (4) for some scenario; it may be violated 
by any activity. We show that it is satisfed, with high probability, by a non-deferred 
activity. For a deferred activity, we shall increasing the value of its dual variables, as 
we did in the polynomial scenario model so that the corresponding constraint is also 
satisfed with high probability. (It is important to note that this increase in the dual 
variables is not performed by the algorithm; it is only used for the analysis.) 

For each deferred activity Aj , let 

X 


δj = pj 
I − u¯ j (S) j =1,..., N 
S2S 

and let S ∈S be an arbitrarily selected scenario. We increase the value of u¯ j (S) by 
δj for each deferred activity Aj . From the fact that r is a good approximation of rˆ, it 
follows that, for each activity Aj , if 

N 

X

1 

I 

u˜j (Si) ≤ (1 + ǫ)pj ,

N 

i=1 

then with probability at least 1 − γ, 

X 


I 

u¯ j (S) ≤ (1 + 2ǫ)pj . (15) 

S2S 

This implies that with high probability, for each deferred activity Aj 

XX 
X 


I

δj + δ(I,S)= u¯ j (S) ≤ (1 + 2ǫ)pj (16) 

S2S I2Aj (S) S2S 

In a similar way, if for an activity Aj 

N 

X

1 
u¯ j (Si) > (1 + ǫ)p I 

j

N 

i=1 

then with probability at least 1 − γ, it follows that 

X 


I 

u¯ j (S) >pj . 
S2S 


Hence, the new solution is dual feasible with high probability. Note that Equation (16) 
is an approximation to Equation (10). This implies that by replacing this new equation 
in the previous proof we obtain 

XXXX 
X 


I

¯¯

uj (S) + vt(S) ≤ 2(1 + 2ǫ) j ¯p xj + 
j2N S2S S2S t2T j2N 
X 
X 
+ 2(1 + 2ǫ) X 
II q(S)pj (I, S)¯yj (I, S), (17) 

j2N S2S I2Aj (S) 

which completes the proof. 

5 An NP-hardness of approximation result 

We show that, in contrast to the results of the previous sections, another natural 2-stage 

P 


stochastic generalization of the problem 1|rj | wj Uj (even in a very simple case) can 
not be approximated. Suppose that in the frst phase, we select a set of activities that 
we are committed to serve. In the second phase, for a given scenario, we must schedule 
exactly one instance of each activity selected in the frst phase, and we may augment 
this solution by scheduling other instances of additional activities. We wish to maximize 
is the total expected proft (where it is now natural to assume that the proft obtained 
for an instance in the second phase is less than the corresponding proft in the frst). We 

P 


will refer to this problem as the augmentation 2-stage stochastic 1|rj | wj Uj . 

An integer programming formulation for this problem is obtained by changing (SIP) 
in the following way: a 0-1 variable xj indicates (with value 1) that activity Aj is 
selected in the frst phase; constraint (1) is replaced by the following two constraints: 

X 


yj (I,S) ≥ xj ∀S ∈S,j ∈N : Aj (S) 6= ∅ (18) 

I2Aj (S) 

X 


yj (I,S) ≤ 1 ∀j ∈N ,S ∈S (19) 

I2Aj (S) 

Unfortunately, it is straightforward to show that selecting a feasible set of activities 
in the frst phase can be used to model the maximum independent set problem. This is 
formalized in the following lemma. 

Lemma 2. If there is a ρ-approximation algorithm for the augmentation 2-stage sto-

P 


chastic 1|rj | wj Uj , then there is a ρ-approximation algorithm for maximum independent 
set problem. 

Proof Sketch. We give an approximation-preserving reduction from the maximum independent 
set problem. Given a graph G, we build the following input for the aug-

P 


mentation 2-stage stochastic 1|rj | wj Uj . For each vertex vj , there is an activity Aj , 
j =1,...,n, each activity is always released at time 0, has deadline time 1, and takes 
one time unit to complete; each activity has frst-stage proft 1, and second-stage proft 


0. For each edge ei =(vj ,vk), there is a scenario Si in which only the activities Aj 
and Ak are active. Each scenario Si occurs with positive probability, and hence our 
frst stage selection must contain at most one of the endpoints of ei. Thus, there is a 
one-to-one correspondence between independent sets in G and feasible frst-stage decisions. 
Furthermore, the objective function value of any frst-stage decision is exactly the 
number of activities selected (since the second stage does not contribute any expected 
proft). Hence, we see that the two optimization problems are identical. 

From Lemma (2) and the result in [6] we obtain the following theorem. 

Theorem 3. For any ǫ> 0, there does not exist a polynomial-time algorithm that ap-

P 


1/2−ǫ

proximates the augmentation 2-stage stochastic 1|rj | wj Uj within a factor n , 
unless P = NP. 

References 

1. A. Bar-Noy, R. Bar-Yehuda, A. Freund, J. Naor, and B. Schieber. A unifed approach to approximating 
resource allocation and scheduling. Journal of the ACM, 48:1069–1090, 2001. 
2. M. Charikar, C. Chekuri, and M. Pal.´ Sampling bounds for stochastic optimization. In 
Proceedings of APPROX-RANDOM 2005, pages 257–269, 2005. 
3. S. Dye, L. Stougie, and A. Tomasgard. The stochastic single resource service-provision 
problem. Naval Research Logistics, 50:869–887, 2003. 
4. R. L. Graham, E. L. Lawler, J. K. Lenstra, and A. H. G. Rinnooy Kan. Optimization and 
approximation in deterministic sequencing and scheduling: A survey. Ann. Discrete Math., 
5:287–326, 1979. 
5. A. Gupta, M. P´


al, R. Ravi, and A. Sinha. Boosted sampling: approximation algorithms for 

stochastic optimization. In Proceedings of the 36th Annual ACM Symposium on Theory of 

Computing, pages 265–274, 2004. 

6. J. H° Clique is hard to approximate within n 1−ǫ Acta Mathematica, 182:105–142, 
astad. . 
1999. 


7. N. Immorlica, D. Karger, M. Minkoff, and V. S. Mirrokni. On the costs and benefts of procrastination: 
approximation algorithms for stochastic combinatorial optimization problems. 
In Proceedings of the 16th ACM-SIAM Symposium on Discrete Algorithms, pages 691–700, 
2004. 
8. N. Kong and A. J. Schaefer. A factor 1/2 approximation algorithm for two-stage stochastic 
matching problems. European Journal of Operational Research, 172:740–746, 2006. 
9. R. Ravi and A. Sinha. Hedging uncertainty: Approximation algorithms for stochastic optimization 
problems. In D. Bienstock and G. Nemhauser, editors, Integer Programming and 
Combinatorial Optimization: 10th International IPCO Conference, number 3064 in Lecture 
Notes in Computer Science, pages 101–115. Springer-Verlag, 2004. 
10. D. B. Shmoys and C. Swamy. Stochastic optimization is (almost) as easy as deterministic 
optimization. In Proceedings of the 45th Annual Symposium on Foundations of Computer 
Science, pages 228–237, 2004. 
11. C. Swamy and D. B. Shmoys. The sampling-based approximation algorithms for multi-stage 
stochastic optimization. In Proceedings of the 46th Annual Symposium on Foundations of 
Computer Science, pages 357–366, 2005.