speedup data loading for HOP
- the file to read from are open only 1 time instead of being open at each iteration
- a bug in some version of numpy make fromfile have a large read ahead which increase a lot IO during the training: replaced by low level reading
On our cluster, we observed speedup between 2 and 5 depending on the machine IO load.