文章来源于网络收集而来,版权归原创者所有,如有侵权请及时联系!
参考文献
- Mark Cavage: Just No Getting Around It: You’re Building a Distributed System]( http://queue.acm.org/detail.cfm?id=2482856), ACM Queue, volume 11, number 4, pages 80-89, April 2013. doi:10.1145/2466486.2482856
- Jay Kreps: Getting Real About Distributed System Reliability , blog.empathybox.com, March 19, 2012.
- Sydney Padua: The Thrilling Adventures of Lovelace and Babbage: The (Mostly) True Story of the First Computer. Particular Books, April ISBN: 978-0-141-98151-2
- Coda Hale: You Can’t Sacrifice Partition Tolerance , codahale.com, October 7, 2010.
- Jeff Hodges: Notes on Distributed Systems for Young Bloods , somethingsimilar.com, January 14, 2013.
- Antonio Regalado: Who Coined 'Cloud Computing’? , technologyreview.com, October 31, 2011.
- Luiz André Barroso, Jimmy Clidaras, and Urs Hölzle: The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition , Synthesis Lectures on Computer Architecture, volume 8, number 3, Morgan & Claypool Publishers, July 2013. doi:10.2200/S00516ED2V01Y201306CAC024 , ISBN: 978-1-627-05010-4
- David Fiala, Frank Mueller, Christian Engelmann, et al.: Detection and Correction of Silent Data Corruption for Large-Scale High-Performance Computing , at International Conference for High Performance Computing, Networking, Storage and Analysis (SC12), November 2012.
- Arjun Singh, Joon Ong, Amit Agarwal, et al.: Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network , at Annual Conference of the ACM Special Interest Group on Data Communication (SIGCOMM), August 2015. doi:10.1145/2785956.2787508
- Glenn K. Lockwood: Hadoop's Uncomfortable Fit in HPC , glennklockwood.blogspot.co.uk, May 16, 2014.
- John von Neumann: Probabilistic Logics and the Synthesis of Reliable Organisms from Unreliable Components , in Automata Studies (AM-34), edited by Claude E. Shannon and John McCarthy, Princeton University Press, 1956. ISBN: 978-0-691-07916-5
- Richard W. Hamming: The Art of Doing Science and Engineering. Taylor & Francis, 1997. ISBN: 978-9-056-99500-3
- Claude E. Shannon: A Mathematical Theory of Communication , The Bell System Technical Journal, volume 27, number 3, pages 379–423 and 623–656, July 1948.
- Peter Bailis and Kyle Kingsbury: The Network Is Reliable , ACM Queue, volume 12, number 7, pages 48-55, July 2014. doi:10.1145/2639988.2639988
- Joshua B. Leners, Trinabh Gupta, Marcos K. Aguilera, and Michael Walfish: Taming Uncertainty in Distributed Systems with Help from the Network , at 10th European Conference on Computer Systems (EuroSys), April 2015. doi:10.1145/2741948.2741976
- Phillipa Gill, Navendu Jain, and Nachiappan Nagappan: Understanding Network Failures in Data Centers: Measurement, Analysis, and Implications , at ACM SIGCOMM Conference, August 2011. doi:10.1145/2018436.2018477
- Mark Imbriaco: Downtime Last Saturday , github.com, December 26, 2012.
- Will Oremus: The Global Internet Is Being Attacked by Sharks, Google Confirms , slate.com, August 15, 2014.
- Marc A. Donges: Re: bnx2 cards Intermittantly Going Offline , Message to Linux netdev mailing list, spinics.net, September 13, 2012.
- Kyle Kingsbury: Call Me Maybe: Elasticsearch , aphyr.com, June 15, 2014.
- Salvatore Sanfilippo: A Few Arguments About Redis Sentinel Properties and Fail Scenarios , antirez.com, October 21, 2014.
- Bert Hubert: The Ultimate SO_LINGER Page, or: Why Is My TCP Not Reliable , blog.netherlabs.nl, January 18, 2009.
- Nicolas Liochon: CAP: If All You Have Is a Timeout, Everything Looks Like a Partition , blog.thislongrun.com, May 25, 2015.
- Jerome H. Saltzer, David P. Reed, and David D. Clark: End-To-End Arguments in System Design , ACM Transactions on Computer Systems, volume 2, number 4, pages 277–288, November 1984. doi:10.1145/357401.357402
- Matthew P. Grosvenor, Malte Schwarzkopf, Ionel Gog, et al.: Queues Don’t Matter When You Can JUMP Them! , at 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2015.
- Guohui Wang and T. S. Eugene Ng: The Impact of Virtualization on Network Performance of Amazon EC2 Data Center , at 29th IEEE International Conference on Computer Communications (INFOCOM), March 2010. doi:10.1109/INFCOM.2010.5461931
- Van Jacobson: Congestion Avoidance and Control , at ACM Symposium on Communications Architectures and Protocols (SIGCOMM), August 1988. doi:10.1145/52324.52356
- Brandon Philips: etcd: Distributed Locking and Service Discovery , at Strange Loop, September 2014.
- Steve Newman: A Systematic Look at EC2 I/O , blog.scalyr.com, October 16, 2012.
- Naohiro Hayashibara, Xavier Défago, Rami Yared, and Takuya Katayama: The ϕ Accrual Failure Detector , Japan Advanced Institute of Science and Technology, School of Information Science, Technical Report IS-RR-2004-010, May 2004.
- Jeffrey Wang: Phi Accrual Failure Detector , ternarysearch.blogspot.co.uk, August 11, 2013.
- Srinivasan Keshav: An Engineering Approach to Computer Networking: ATM Networks, the Internet, and the Telephone Network. Addison-Wesley Professional, May 1997. ISBN: 978-0-201-63442-6
- Cisco, Integrated Services Digital Network , docwiki.cisco.com.
- Othmar Kyas: ATM Networks. International Thomson Publishing, 1995. ISBN: 978-1-850-32128-6
- InfiniBand FAQ , Mellanox Technologies, December 22, 2014.
- Jose Renato Santos, Yoshio Turner, and G. (John) Janakiraman: End-to-End Congestion Control for InfiniBand , at 22nd Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), April 2003. Also published by HP Laboratories Palo Alto, Tech Report HPL-2002-359. doi:10.1109/INFCOM.2003.1208949
- Ulrich Windl, David Dalton, Marc Martinec, and Dale R. Worley: The NTP FAQ and HOWTO , ntp.org, November 2006.
- John Graham-Cumming: How and why the leap second affected Cloudflare DNS , blog.cloudflare.com, January 1, 2017.
- David Holmes: Inside the Hotspot VM: Clocks, Timers and Scheduling Events – Part I – Windows , blogs.oracle.com, October 2, 2006.
- Steve Loughran: Time on Multi-Core, Multi-Socket Servers , steveloughran.blogspot.co.uk, September 17, 2015.
- James C. Corbett, Jeffrey Dean, Michael Epstein, et al.: Spanner: Google’s Globally-Distributed Database , at 10th USENIX Symposium on Operating System Design and Implementation (OSDI), October 2012.
- M. Caporaloni and R. Ambrosini: How Closely Can a Personal Computer Clock Track the UTC Timescale Via the Internet? , European Journal of Physics, volume 23, number 4, pages L17–L21, June 2012. doi:10.1088/0143-0807/23/4/103
- Nelson Minar: A Survey of the NTP Network , alumni.media.mit.edu, December 1999.
- Viliam Holub: Synchronizing Clocks in a Cassandra Cluster Pt. 1 – The Problem , blog.logentries.com, March 14, 2014.
- Poul-Henning Kamp: The One-Second War (What Time Will You Die?) , ACM Queue, volume 9, number 4, pages 44–48, April 2011. doi:10.1145/1966989.1967009
- Nelson Minar: Leap Second Crashes Half the Internet , somebits.com, July 3, 2012.
- Christopher Pascoe: Time, Technology and Leaping Seconds , googleblog.blogspot.co.uk, September 15, 2011.
- Mingxue Zhao and Jeff Barr: Look Before You Leap – The Coming Leap Second and AWS , aws.amazon.com, May 18, 2015.
- Darryl Veitch and Kanthaiah Vijayalayan: Network Timing and the 2015 Leap Second , at 17th International Conference on Passive and Active Measurement (PAM), April 2016. doi:10.1007/978-3-319-30505-9_29
- Timekeeping in VMware Virtual Machines , Information Guide, VMware, Inc., December 2011.
- MiFID II / MiFIR: Regulatory Technical and Implementing Standards – Annex I (Draft) , European Securities and Markets Authority, Report ESMA/2015/1464, September 2015.
- Luke Bigum: Solving MiFID II Clock Synchronisation With Minimum Spend (Part 1) , lmax.com, November 27, 2015.
- Kyle Kingsbury: Call Me Maybe: Cassandra , aphyr.com, September 24, 2013.
- John Daily: Clocks Are Bad, or, Welcome to the Wonderful World of Distributed Systems , basho.com, November 12, 2013.
- Kyle Kingsbury: The Trouble with Timestamps , aphyr.com, October 12, 2013.
- Leslie Lamport: Time, Clocks, and the Ordering of Events in a Distributed System , Communications of the ACM, volume 21, number 7, pages 558–565, July 1978. doi:10.1145/359545.359563
- Sandeep Kulkarni, Murat Demirbas, Deepak Madeppa, et al.: Logical Physical Clocks and Consistent Snapshots in Globally Distributed Databases , State University of New York at Buffalo, Computer Science and Engineering Technical Report 2014-04, May 2014.
- Justin Sheehy: There Is No Now: Problems With Simultaneity in Distributed Systems , ACM Queue, volume 13, number 3, pages 36–41, March 2015. doi:10.1145/2733108
- Murat Demirbas: Spanner: Google's Globally-Distributed Database , muratbuffalo.blogspot.co.uk, July 4, 2013.
- Dahlia Malkhi and Jean-Philippe Martin: Spanner's Concurrency Control , ACM SIGACT News, volume 44, number 3, pages 73–77, September 2013. doi:10.1145/2527748.2527767
- Manuel Bravo, Nuno Diegues, Jingna Zeng, et al.: On the Use of Clocks to Enforce Consistency in the Cloud , IEEE Data Engineering Bulletin, volume 38, number 1, pages 18–31, March 2015.
- Spencer Kimball: Living Without Atomic Clocks , cockroachlabs.com, February 17, 2016.
- Cary G. Gray and David R. Cheriton: Leases: An Efficient Fault-Tolerant Mechanism for Distributed File Cache Consistency , at 12th ACM Symposium on Operating Systems Principles (SOSP), December 1989. doi:10.1145/74850.74870
- Todd Lipcon: Avoiding Full GCs in Apache HBase with MemStore-Local Allocation Buffers: Part 1 , blog.cloudera.com, February 24, 2011.
- Martin Thompson: Java Garbage Collection Distilled , mechanical-sympathy.blogspot.co.uk, July 16, 2013.
- Alexey Ragozin: How to Tame Java GC Pauses? Surviving 16GiB Heap and Greater , java.dzone.com, June 28, 2011.
- Christopher Clark, Keir Fraser, Steven Hand, et al.: Live Migration of Virtual Machines , at 2nd USENIX Symposium on Symposium on Networked Systems Design & Implementation (NSDI), May 2005.
- Mike Shaver: fsyncers and Curveballs , shaver.off.net, May 25, 2008.
- Zhenyun Zhuang and Cuong Tran: Eliminating Large JVM GC Pauses Caused by Background IO Traffic , engineering.linkedin.com, February 10, 2016.
- David Terei and Amit Levy: Blade: A Data Center Garbage Collector , arXiv:1504.02578, April 13, 2015.
- Martin Maas, Tim Harris, Krste Asanović, and John Kubiatowicz: Trash Day: Coordinating Garbage Collection in Distributed Systems , at 15th USENIX Workshop on Hot Topics in Operating Systems (HotOS), May 2015.
- Predictable Low Latency , Cinnober Financial Technology AB, cinnober.com, November 24, 2013.
- Martin Fowler: The LMAX Architecture , martinfowler.com, July 12, 2011.
- Flavio P. Junqueira and Benjamin Reed: ZooKeeper: Distributed Process Coordination. O'Reilly Media, 2013. ISBN: 978-1-449-36130-3
- Enis Söztutar: HBase and HDFS: Understanding Filesystem Usage in HBase , at HBaseCon, June 2013.
- Caitie McCaffrey: Clients Are Jerks: AKA How Halo 4 DoSed the Services at Launch & How We Survived , caitiem.com, June 23, 2015.
- Leslie Lamport, Robert Shostak, and Marshall Pease: The Byzantine Generals Problem , ACM Transactions on Programming Languages and Systems (TOPLAS), volume 4, number 3, pages 382–401, July 1982. doi:10.1145/357172.357176
- Jim N. Gray: Notes on Data Base Operating Systems , in Operating Systems: An Advanced Course, Lecture Notes in Computer Science, volume 60, edited by R. Bayer, R. M. Graham, and G. Seegmüller, pages 393–481, Springer-Verlag, 1978. ISBN: 978-3-540-08755-7
- Brian Palmer: How Complicated Was the Byzantine Empire? , slate.com, October 20, 2011.
- Leslie Lamport: My Writings , research.microsoft.com, December 16, 2014. This page can be found by searching the web for the 23-character string obtained by removing the hyphens from the string
allla-mport-spubso-ntheweb
. - John Rushby: Bus Architectures for Safety-Critical Embedded Systems , at 1st International Workshop on Embedded Software (EMSOFT), October 2001.
- Jake Edge: ELC: SpaceX Lessons Learned , lwn.net, March 6, 2013.
- Andrew Miller and Joseph J. LaViola, Jr.: Anonymous Byzantine Consensus from Moderately-Hard Puzzles: A Model for Bitcoin , University of Central Florida, Technical Report CS-TR-14-01, April 2014.
- James Mickens: The Saddest Moment , USENIX ;login: logout, May 2013.
- Evan Gilman: The Discovery of Apache ZooKeeper’s Poison Packet , pagerduty.com, May 7, 2015.
- Jonathan Stone and Craig Partridge: When the CRC and TCP Checksum Disagree , at ACM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM), August 2000. doi:10.1145/347059.347561
- Evan Jones: How Both TCP and Ethernet Checksums Fail , evanjones.ca, October 5, 2015.
- Cynthia Dwork, Nancy Lynch, and Larry Stockmeyer: Consensus in the Presence of Partial Synchrony , Journal of the ACM, volume 35, number 2, pages 288–323, April 1988. doi:10.1145/42282.42283
- Peter Bailis and Ali Ghodsi: Eventual Consistency Today: Limitations, Extensions, and Beyond , ACM Queue, volume 11, number 3, pages 55-63, March 2013. doi:10.1145/2460276.2462076
- Bowen Alpern and Fred B. Schneider: Defining Liveness , Information Processing Letters, volume 21, number 4, pages 181–185, October 1985. doi:10.1016/0020-0190(85)90056-0 90056-0)
- Flavio P. Junqueira: Dude, Where’s My Metadata? , fpj.me, May 28, 2015.
- Scott Sanders: January 28th Incident Report , github.com, February 3, 2016.
- Jay Kreps: A Few Notes on Kafka and Jepsen , blog.empathybox.com, September 25, 2013.
- Thanh Do, Mingzhe Hao, Tanakorn Leesatapornwongsa, et al.: Limplock: Understanding the Impact of Limpware on Scale-out Cloud Systems , at 4th ACM Symposium on Cloud Computing (SoCC), October 2013. doi:10.1145/2523616.2523627
- Frank McSherry, Michael Isard, and Derek G. Murray: Scalability! But at What COST? , at 15th USENIX Workshop on Hot Topics in Operating Systems (HotOS), May 2015.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论