Pinboard (jm)
https://pinboard.in/u:jm/public/
recent bookmarks from jmCrowdView2023-07-31T16:27:10+00:00
https://crowdview.ai/
jmsearch forums searchinghttps://pinboard.in/https://pinboard.in/u:jm/b:ef9a25fcca15/k-Nearest Neighbor (k-NN) similarity search engine with Amazon Elasticsearch2020-03-04T11:48:22+00:00
https://aws.amazon.com/about-aws/whats-new/2020/03/build-k-nearest-neighbor-similarity-search-engine-with-amazon-elasticsearch/
jmAmazon Elasticsearch Service now offers k-Nearest Neighbor (k-NN) search which can enhance search by similarity use cases like product recommendations, fraud detection, and image, video and semantic document retrieval. Built using the lightweight and efficient Non-Metric Space Library (NMSLIB), k-NN enables high scale, low latency nearest neighbor search on billions of documents across thousands of dimensions with the same ease as running any regular Elasticsearch query.
]]>elasticsearch aws knn algorithms similarity searching search nmslibhttps://pinboard.in/https://pinboard.in/u:jm/b:1ac9f1d202a0/Everybody lies: how Google search reveals our darkest secrets | Technology | The Guardian2017-07-10T13:00:53+00:00
https://www.theguardian.com/technology/2017/jul/09/everybody-lies-how-google-reveals-darkest-secrets-seth-stephens-davidowitz?CMP=fb_gu
jmWhat can we learn about ourselves from the things we ask online? US data scientist Seth Stephens‑Davidowitz analysed anonymous Google search results, uncovering disturbing truths about [America's] desires, beliefs and prejudices
Fascinating. I find it equally interesting how flawed the existing methodologies for polling and surveying are, compared to Google's data, according to this]]>science big-data google lying surveys polling secrets data-science america racism searchinghttps://pinboard.in/https://pinboard.in/u:jm/b:368c1e21158e/At the cost of security everywhere, Google dorking is still a thing | Ars Technica2017-02-24T14:46:20+00:00
https://arstechnica.com/security/2016/05/google-dorking-when-pii-and-exploitable-bugs-are-only-a-search-away/
jmdorking google security searching webhttps://pinboard.in/https://pinboard.in/u:jm/b:7fa259dd7da1/The Bkd Tree2016-01-04T10:44:17+00:00
https://medium.com/@nickgerleman/the-bkd-tree-da19cf9493fb#.2z8fzib60
jmsearch lucene bkd-trees searching data-structureshttps://pinboard.in/https://pinboard.in/u:jm/b:d4a21d001bf0/Efficient substring searching2014-03-31T13:44:45+00:00
http://blog.phusion.nl/2010/12/06/efficient-substring-searching/
jmTurbo Boyer-Moore is disappointing, its name doesn’t do it justice. In academia constant overhead doesn’t matter, but here we see that it matters a lot in practice. Turbo Boyer-Moore’s inner loop is so complex that we think we’re better off using the original Boyer-Moore.
A good demo of how large values of O(n) can be slower than small values of O(mn).]]>algorithms search strings coding big-o string-search searchinghttps://pinboard.in/https://pinboard.in/u:jm/b:cad2a9fdecec/How the search for flight AF447 used Bayesian inference2014-03-12T15:33:10+00:00
http://www.bea.aero/fr/enquetes/vol.af.447/metron.search.analysis.pdf
jmmetron bayes bayesian-inference machine-learning statistics via:jgc air-france disasters probability inference searchinghttps://pinboard.in/https://pinboard.in/u:jm/b:e7c127ca54da/feedback loop n-gram analyzer2011-09-29T21:10:15+00:00
http://petermblair.com/fbl-n-gram-analyzer/
jmanti-spam spam fbl feedback filtering n-grams similarity hashing redis searchinghttps://pinboard.in/https://pinboard.in/u:jm/b:00bea3b79665/Dutch grepping Facebook for welfare fraud2011-09-10T13:34:07+00:00
http://www.irishtimes.com/newspaper/world/2011/0910/1224303851410.html
jmgrep dutch holland via:tjmcintyre privacy facebook twitter linkedin welfare dole fraud false-positives searchinghttps://pinboard.in/https://pinboard.in/u:jm/b:6616dc33ebe2/