Pinboard (jm)
https://pinboard.in/u:jm/public/
recent bookmarks from jmSorting out graph processing2015-08-25T13:09:14+00:00
https://github.com/frankmcsherry/blog/blob/master/posts/2015-08-15.md
jmIf you wanted to do an iterative graph computation like PageRank, it would literally be faster to sort the edges from scratch each and every iteration, than to use unsorted edges. If you want to do graph computation, please sort your edges.
Actually, you know what: if you want to do any big data computation, please sort your records. Stop talking sass about how Hadoop sorts things it doesn't need to, read some papers, run some tests, and then sort your damned data. Or at least run faster than me when I sort your data for you.
]]>algorithms graphs coding data-processing big-data differential-dataflow radix-sort sorting x-stream counting-sort pagerankhttps://pinboard.in/https://pinboard.in/u:jm/b:00002f1d196e/