Wednesday, November 11, 2009

Round-up for Amazon seminar

Some of the previous lectures got involved with professional knowledge, but this lecture is mostly related to complex, specialized knowledge and most likely to be the most professional one. The part about Amazon mainly covers how Amazon deals with the huge amount of data.

Amazon runs a world-wide e-commerce platform that serves tens of millions customers at peak times using tens of thousands of servers located in many data centers around the world. There are strict operational requirements on Amazon’s platform in terms of performance, reliability and efficiency (Dynamo: Amazon’s Highly Available Key-value Store).The Traditional Cloud Data (Cloud computing is Internet based development and use of computer technology) Services are traditionally oriented around Relational Database systems. However, Traditional RDBMS clouds are expensive to maintain, license and store large amounts of data. The solution seems to downgrade some of the service guarantees of traditional RDBMS so that Amazon applies Amazon’s Dynamo (Amazon storage system). According to Wikipedia, dynamo (storage system) is a highly available, proprietary key-value storage system. It has properties of both databases and distributed hash tables (DHTs). It is not directly exposed as a web service, but is used to power parts of other Amazon Web Services. The main advantage of Dynamo is that its client applications can tune the values of N, R and W to achieve their desired levels of performance, availability and durability. The system is used to support many of the most critical elements of Amazon's operation, including shopping-cart processing. Dynamo, as an alternative to rigid relational database systems, has been the underlying storage technology for a number of the core services in Amazon’s e-commerce platform. It was able to scale to extreme peak loads efficiently without any downtime during the busy holiday shopping season (Inside Amazon's dynamo, 2007). And it offers a simple Primary-key based data model and stores vast amounts of information on distributed, low-cost virtualized nodes. The motivation of dynamo is to build a distributed storage system which consists of four factors: scale simple, key-value highly and available and Guarantee Service Level Agreements (SLA).

Even after the lecture and the research for the roundup, I am still confused about some parts of the system, but I can find out that Amazon has put great effort to handle the large amount of data. More important, it succeeds and I think it one of the reasons why Amazon becomes one of the largest e-commerce operations in the world. Actually, I have already done some shopping in Amazon in China, like books, accessories and such stuff. I choose Amazon is because of the convenient, fine shopping booklet which consist of different kinds of goods every month from Amazon, not just because it is cheaper or something else. My point is that Amazon has many advantages over other e-commerce operations and we have many reasons to choose it.
References
Dynamo: Amazon’s Highly Available Key-value Store. (n.d. ). Retrieved November 7, 2009, from http://s3.amazonaws.com/AllThingsDistributed/sosp/amazon-dynamo-sosp2007.pdf
Dynamo (storage system). (n.d.). Wikipedia, the free encyclopedia. Retrieved November 7, 2009,
from
http://en.wikipedia.org/wiki/Dynamo_(storage_system)
Inside Amazon's dynamo. (October 03, 2007). Retrieved November 7, 2009, from
http://www.roughtype.com/archives/2007/10/inside_amazons.php


By Zhu Li and Zhang Haoqiang

No comments:

Post a Comment

Followers