Rank pooling sounds amazing! I've been reading this trying to think about how to state what a derivative of the argmin with respect to a parameter represents. The mean example gives good intuition. It is like adding a hyperparameter search in the middle of the network!

