I found this paper when implementing TSMixer in darts and thought it was worth sharing.
As far as I know, it may not be relevant to non-timeseries problems. That being said, does anybody know of instance normalization outside of timeseries? It is obviously similar to batch normalization but is slightly different.
I added RIN to an N-HiTS model using the Energy benchmark dataset and got 21% and 22% reduction in MSE and MAE, respectively. This required two lines of code to be added to the model definition and increased compute time in no noticeable manner.
I read that as hot pickles and got realllllllly confused until I reread it.