Forrester Report: MaxCompute One of World's Leading Cloud-Based Data Warehouse

本文涉及的产品
云原生大数据计算服务MaxCompute,500CU*H 100GB 3个月
云原生大数据计算服务 MaxCompute,5000CU*H 100GB 3个月
简介: Forrester names Alibaba Cloud MaxCompute as one of the world's leading cloud-based data warehouse in the "Cloud Data Warehouse, Q1 2018" report.

March 19, 2018 – Chinese daily newspaper, Reference News, reported on its official website that the internationally-renowned Forrester released its "Cloud Data Warehouse, Q1 2018" report. This report comprehensively evaluated the primary functions, regional performance, market segments, typical customers, and other features of big data service providers.

Forrester reports are extremely influential within the cloud industry and are often regarded as the guidebooks for CIOs of major international companies. Based on these criteria, Forrester elected four companies: AWS, Alibaba Cloud, Google, and Microsoft. Alibaba Cloud was the only Chinese tech company selected.

1

Evaluation Criteria

Cloud-based big data services have been in high demand in recent years due to the advantages of security, elastic scalability, rapid deployment, and low costs. Conversely, locally deployed big data analytic solutions are gradually becoming obsolete. In its evaluation, Forrester required each supplier to meet the following criteria:

1) Sophisticated big data warehouse products
2) Independent big data warehouse solutions
3) Big data use cases
4) Publicly available products
5) A leading position in regional markets
6) Advanced technology

As the only selected Chinese product, MaxCompute received a detailed analysis in the Forrester report. In the following sections, we will be sharing the journey of the Alibaba Cloud big data processing service, MaxCompute.

Evolution of MaxCompute

In 2009, Alibaba reached the Greenplum ceiling. It was difficult to scale up Greenplum beyond one hundred hosts and 1000 TB. However, even such maximum capacity was far from enough to support a thriving business like Alibaba Cloud.

In September 2009, Alibaba Cloud launched R&D on its Apsara big data platform. Their aim was to create the self-developed data warehouse MaxCompute for data volumes measured in exabytes. It was a slow but enlightening journey for the team at Alibaba Cloud. It is only after 8 years that the team could successfully deploy a global network of clusters, each with over 10,000 servers!

2

Global Reach

Due to the advantages of security, elastic scalability, rapid deployment, and low costs, the cloud-based big data services have been in high demand in recent years. Last year, Forrester reported that, although cloud data warehouse (CDW) enterprises provided excellent cloud-based features, many cloud companies exhibited deficiencies in areas such as global deployment, data security, integration, modeling, and governance.

However, MaxCompute has consistently improved its global presence, performance, security, end-to-end development experience, and ecosystem.

3

MaxCompute is currently deployed in 15 regions worldwide, including Hong Kong, Singapore, Japan, Dubai, US West, US East, Australia, Indonesia, and India. It connects millions of servers to form a supercomputer capable of providing computing power to major Internet markets around the globe in the form of online public services.

Increased Performance and Efficient Development

MaxCompute's exabyte-level performance and processing make it the global leader in the field. In October 2017, MaxCompute completed the world's first public cloud-based 100 TB BigBench big data benchmark test, achieving a performance in excess of 7830 QPM.

The next-gen big data language NewSQL, which combines the advantages Declarative and Imperative coding, breaks through the technical restrictions of the previous SQL language. The unified programming language provides support for offline, quasi-real-time, stream, graphic, machine learning, and other computing modes and unstructured data processing. This greatly reduces the technical barriers to big data development.

Maximum Security

MaxCompute introduces multi-tenant cloud security isolation technology that upends the security limitations of traditional big data platforms. This technology refines security boundaries to the user, process, and CPU core levels. MaxCompute authorizes and audits millions of tenants and the tens of billions of tasks they perform each day to ensure financial-grade data security.

Comprehensive Data Modeling, Governance, and Integration

In response to the demand for big data construction, management, and applications in various industries, MaxCompute provides an all-in-one big data capability toolkit covering smart data construction and management for the entire process from data access to data consumption. This toolkit includes DataWorks, MaxCompute Studio, and other tools that help customers construct fully-integrated, asset- and service-oriented, self-optimizing, closed-loop smart data systems with unified standards, capable of driving innovation.

MaxCompute vs. the World

At only $354.7/QPM, MaxCompute provides its customers with a plethora of features that is comparable to, if not better than, similar products from competitors.

4

At the 2015 Sort Benchmark competition, Apsara set four new GraySort and MinuteSort world records.

  • FuxiSort sorted 100 TB of data in 377 seconds.
  • It used a shared testing environment, dual gigabit NICs, and mechanical hard disks.

At the 2016 Sort Benchmark competition, Apsara set two new CloudSort world records.

  • NADSort sorted 100 TB of data at a cost of $144 ($1.44/TB).
  • In 2017, MaxCompute adapted to the TPC benchmark and expanded its data scale to 100 TB. In a global first, MaxCompute completed the public cloud-based BigBench big data benchmark test. Its performance exceeded 8200 QPM, establishing itself as a computing leader, not only in China, but the world.

Compliance and Certifications

Data privacy and security is becoming an increasingly important topic in the modern society. To comply with the needs of customers, Alibaba Cloud MaxCompute has earned 10 industry and security certifications from respected third-party institutions.

5

Leadership in Regional Markets

As a leading cloud computing vendor, Alibaba Cloud serves 2.3 million customers in over 200 countries and regions around the world. The company holds a 47.6% share of the public cloud market in China, equaling to almost all competitors combined.

MaxCompute is dedicated to providing massive data storage and large-scale computing to its customers in 15 regions around the world, including Hong Kong, Singapore, Japan, Dubai, Europe, US West, US East, Australia, Indonesia, and India. In doing so, it empowers customers throughout the world with Alibaba Cloud's exceptional computing capabilities.

6

ofo Use Case

Using MaxCompute, the bike-sharing company ofo has started to establish data models and perform clustering to optimize its operation. By studying historical transaction data and user flow information, the company calculates the number of bicycles that need to be deployed in various areas, understand where bikes are taken from, and make suitable plans to recover bikes from low-traffic areas and increase deployment in high-traffic area.

In July 2017, ofo upgraded MaxCompute from 1.0 to 2.0. The new version increased the efficiency of offline operations by over 50% and allowed the company to process 32 million daily transactions with great ease. Overall, the upgrade increased the company's operational efficiency by 76%. At the same time, MaxCompute significantly reduced ofo's big data platform O&M costs. Now, the company only needs one part-time O&M employee. Compared to a self-built physical cluster, MaxCompute offers much lower total costs, while greatly increasing the efficiency of application development.

Beijing Genomics Institute (BGI) Use Case

Genetic technology is gradually moving out of the laboratory and into daily life. However, the resulting explosive growth in data volumes far exceeds the capability of traditional computing. In this context, BGI opted for MaxCompute.

In the Million Genomes Project, it takes traditional computing methods 3–5 days to analyze population structures. MaxCompute can complete the entire analysis within one hour, greatly accelerating data throughput and delivery. When performing structural analysis on the genetic data of one million people, the complexity of the process is beyond the capabilities of traditional computing. Using MaxCompute, BGI was able to achieve a technological breakthrough and compute the genetic distance between one individual and 100 thousand others in a matter of hours, while reducing the cost to under $1,000. Currently, BGI continues to explore and build on the advantages provided by MaxCompute.

Conclusion

In short, Alibaba Cloud MaxCompute provides multi-tenant big data warehousing and hybrid cloud services based on a public cloud. It is rapidly globalizing its services with special attention to the finance, Internet, retail, and e-commerce fields.

MaxCompute is a sophisticated product, with over nine years of experience. Its capability, along with its advanced technology and comprehensive big data development solutions, have earned it a leading position in the CDW market.

相关实践学习
基于MaxCompute的热门话题分析
本实验围绕社交用户发布的文章做了详尽的分析,通过分析能得到用户群体年龄分布,性别分布,地理位置分布,以及热门话题的热度。
SaaS 模式云数据仓库必修课
本课程由阿里云开发者社区和阿里云大数据团队共同出品,是SaaS模式云原生数据仓库领导者MaxCompute核心课程。本课程由阿里云资深产品和技术专家们从概念到方法,从场景到实践,体系化的将阿里巴巴飞天大数据平台10多年的经过验证的方法与实践深入浅出的讲给开发者们。帮助大数据开发者快速了解并掌握SaaS模式的云原生的数据仓库,助力开发者学习了解先进的技术栈,并能在实际业务中敏捷的进行大数据分析,赋能企业业务。 通过本课程可以了解SaaS模式云原生数据仓库领导者MaxCompute核心功能及典型适用场景,可应用MaxCompute实现数仓搭建,快速进行大数据分析。适合大数据工程师、大数据分析师 大量数据需要处理、存储和管理,需要搭建数据仓库?学它! 没有足够人员和经验来运维大数据平台,不想自建IDC买机器,需要免运维的大数据平台?会SQL就等于会大数据?学它! 想知道大数据用得对不对,想用更少的钱得到持续演进的数仓能力?获得极致弹性的计算资源和更好的性能,以及持续保护数据安全的生产环境?学它! 想要获得灵活的分析能力,快速洞察数据规律特征?想要兼得数据湖的灵活性与数据仓库的成长性?学它! 出品人:阿里云大数据产品及研发团队专家 产品 MaxCompute 官网 https://www.aliyun.com/product/odps 
目录
相关文章
|
存储 分布式计算 运维
【2023云栖】刘一鸣:Data+AI时代大数据平台建设的思考与发布
本文根据2023云栖大会演讲实录整理而成,演讲信息如下: 演讲人:刘一鸣 | 阿里云自研大数据产品负责人 演讲主题:Data+AI时代大数据平台应该如何建设
102239 15
|
2月前
|
存储 NoSQL 大数据
大数据中数据存储 (Data Storage)
【10月更文挑战第17天】
82 2
|
2月前
|
数据采集 算法 大数据
大数据中数据清洗 (Data Cleaning)
【10月更文挑战第17天】
198 1
|
存储 SQL 分布式计算
MaxCompute(原名ODPS,全称Open Data Processing Service)
MaxCompute(原名ODPS,全称Open Data Processing Service)是阿里云开发的一种云原生数据处理和分析服务。它提供了强大的数据计算和处理能力,支持海量数据的存储、计算、分析和挖掘,并且具有高可靠、高性能、高可扩展、高安全等优势,适用于各种数据处理和分析场景。
1231 0
|
运维 Oracle 关系型数据库
【大数据开发运维解决方案】Oracle Data Redaction数据加密测试
最近有个做Java开发的网友问我,怎么在Oracle进行数据加密呢?我给他推荐了Data Redaction。Oracle Database 12c中加入了Data Redaction这个新的安全特性。当然在11g的Database Advanced Security Administrator’s Guide官方文档中就介绍了。
【大数据开发运维解决方案】Oracle Data Redaction数据加密测试
|
大数据
阿里云大数据ACP(二)数据集成 Data Integration 2
阿里云大数据ACP(二)数据集成 Data Integration 2
174 0
阿里云大数据ACP(二)数据集成 Data Integration 2
|
DataWorks 安全 数据可视化
阿里云大数据ACP(二)数据集成 Data Integration 1
阿里云大数据ACP(二)数据集成 Data Integration 1
489 0
阿里云大数据ACP(二)数据集成 Data Integration 1
|
消息中间件 SQL 分布式计算
IDEA 中使用 Big Data Tools 连接大数据组件
简介 Big Data Tools 插件可用于 Intellij Idea 2019.2 及以后的版本。它提供了使用 Zeppelin,AWS S3,Spark,Google Cloud Storage,Minio,Linode,数字开放空间,Microsoft Azure 和 Hadoop 分布式文件系统(HDFS)来监视和处理数据的特定功能。 下面来看一下 Big Data Tools 的安装和使用,主要会配置 Flink,Kafka 和 HDFS。
IDEA 中使用 Big Data Tools 连接大数据组件
|
存储 数据采集 人工智能
初始大数据(Big Data)开发
大数据(big data),或称巨量资料,指的是所涉及的资料量规模巨大到无法透过目前主流软件工具,在合理时间内达到撷取、管理、处理、并整理成为帮助企业经营决策更积极目的的资讯。主要解决的是对海量数据的存储以及海量数据的计算分析问题
初始大数据(Big Data)开发
|
机器学习/深度学习 算法 Java
大数据data开发有哪些好的辅助工具?
作为一个程序员开发工具好比是人的手和脚,只有把这些开发工具用好,才能做好一个产品的需求。大多使用SQL数据库存储/检索数据,如今很多情况下,它都不再能满足我们的需求。下面小编就介绍一些大数据data开发常用的辅助工具。