Pinboard (jm)
https://pinboard.in/u:jm/public/
recent bookmarks from jm_Building Machine Learning Models Like Open-Source Software_2023-02-08T20:33:59+00:00
https://dl.acm.org/doi/pdf/10.1145/3545111
jm
'This Viewpoint advocates for tools
and research advances that will allow
pre-trained [machine learning] models to be built in the
same way that we build open source
software. Specifically, models should
be developed by a large community of
stakeholders that continually updates
and improves them. Realizing this goal
will require porting many ideas from
open source software development to
the building and training of pre-trained
models, which motivates many new research problems and connections to
existing fields.'
]]>training machine-learning ml ai acm open-sourcehttps://pinboard.in/https://pinboard.in/u:jm/b:cf5e3c746c3e/Scarr2018-07-10T08:38:00+00:00
http://kevinkuchta.com/_site/2018/06/scarr/
jmThere are a bunch of free/cheap options for hosting static sites (just html/css/js) out there: github pages, netlify, firebase hosting - but when I want to build a bulletproof static site "for real", my go-to toolset is S3 for hosting with Cloudfront caching in front of it. I figured that after a few times doing this, I'd automate it. There are a few pre-existing tools for parts of this, but none I could find that did the whole thing from registration through uploading and Cloudfront invalidation.
]]>cli acm aws s3 cloudfront route53 static-sites web html hostinghttps://pinboard.in/https://pinboard.in/u:jm/b:70af85c43c37/'DolphinAttack: Inaudible Voice Commands' [pdf]2018-01-25T13:49:35+00:00
https://arxiv.org/pdf/1708.09537.pdf
jm 20 kHz) to achieve inaudibility. By leveraging the nonlinearity of the microphone circuits, the modulated low frequency audio commands can be successfully demodulated, recovered, and more importantly interpreted by the speech recognition systems. We validate DolphinAttack on popular speech recognition systems, including Siri, Google Now, Samsung S Voice, Huawei HiVoice, Cortana and Alexa. By injecting a sequence of inaudible voice commands, we show a few proof-of-concept attacks, which include activating Siri to initiate a FaceTime call on iPhone, activating Google Now to switch the phone to the airplane mode, and even manipulating the navigation system in an Audi automobile. We propose hardware and software defense solutions. We validate that it is feasible to detect DolphinAttack by classifying the audios using supported vector machine (SVM), and suggest to re-design voice controllable systems to be resilient to inaudible voice command attacks.'
via Zeynep (https://twitter.com/zeynep/status/956520320504123392)]]>alexa siri attacks security exploits google-now speech-recognition speech audio acm papers cortanahttps://pinboard.in/https://pinboard.in/u:jm/b:8d9b9d7c9782/A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications - All Things Distributed2017-10-09T15:29:38+00:00
http://www.allthingsdistributed.com/2017/10/a-decade-of-dynamo.html?__s=gf36pf8g1gjugcqh6ppo
jmA deep dive on how we were using our existing databases revealed that they were frequently not used for their relational capabilities. About 70 percent of operations were of the key-value kind, where only a primary key was used and a single row would be returned. About 20 percent would return a set of rows, but still operate on only a single table.
With these requirements in mind, and a willingness to question the status quo, a small group of distributed systems experts came together and designed a horizontally scalable distributed database that would scale out for both reads and writes to meet the long-term needs of our business. This was the genesis of the Amazon Dynamo database.
The success of our early results with the Dynamo database encouraged us to write Amazon's Dynamo whitepaper and share it at the 2007 ACM Symposium on Operating Systems Principles (SOSP conference), so that others in the industry could benefit. The Dynamo paper was well-received and served as a catalyst to create the category of distributed database technologies commonly known today as "NoSQL."
That's not an exaggeration. Nice one Werner et al!]]>dynamo history nosql storage databases distcomp amazon papers acm data-storeshttps://pinboard.in/https://pinboard.in/u:jm/b:34ce49c8b5d2/Reliable Cron across the Planet - ACM Queue2015-03-16T10:46:14+00:00
http://queue.acm.org/detail.cfm?id=2745840
jmdistributed-systems cron acm paxos distributed-cron master-election distcomp reliabilityhttps://pinboard.in/https://pinboard.in/u:jm/b:24283a054f89/The Network is Reliable - ACM Queue2014-07-24T17:05:39+00:00
http://queue.acm.org/detail.cfm?id=2655736
jmec2 aws networking outages partitions jepsen pbailis aphyr acm-queue acm survey opshttps://pinboard.in/https://pinboard.in/u:jm/b:7ab28a2120f9/Weathering the Unexpected - ACM Queue2012-09-28T09:28:29+00:00
http://queue.acm.org/detail.cfm?id=2371516
jmFailures happen, and resilience drills help organizations prepare for them.
Good write-up on Google's DiRT (Disaster Recovery Test) procedures, clearly based on Amazon's Gameday exercises. ;) See also http://queue.acm.org/detail.cfm?id=2371297 for a moderated discussion including Jesse Robbins and John Allspaw]]>game-day tests disaster-recovery dirt exercises history amazon google etsy resilience acmhttps://pinboard.in/https://pinboard.in/u:jm/b:bad3bb23469f/