Require: p(T) : distribution over tasks Require: α, β : step size hyperparameters 1: randomly initialize θ 2: while not done do 3: Sample batch of tasks 4: for all do 5: Evaluate with respect to K examples 6: Compute adapted parameters with gradient descent: 7: end for 8: Update 9: end while |