Here is performance comparison of the three most upvoted answers using Jupyter notebook. The input is a 1M x 100K random sparse matrix with density 0.001, containing 100M non-zero values:
io.mmwrite / io.mmread
(note that the format has been changed from csr to coo). np.savez / np.load
|