Hi, thanks for your great work!
I’m wondering whether the UPGrad aggregator can incorporate a gradient normalization step. Specifically, is it possible to normalize the task gradients to unit norm(rescale task gradients to have equal magnitudes before aggregation) before aggregation? If yes, could you please suggest how to implement this in the current code framework?