Tag Archives: Riak CS

Why We Built Riak CS

August 15, 2013

With the launch of Riak CS 1.4, several members of the Basho team have been approached with the question “Why did you build Riak CS?”

When we open sourced Riak CS in March of 2013, the conversation focused on the importance of the community of developers with whom we engage, and participating with this community in a more open fashion.

However, understanding the history of a product can be just as important as understanding the logic behind our go-to-market strategy.

Put simply, Basho is a distributed systems company.

As a company that started with Riak, an open source distributed database, we had an immediate, targeted focus on high availability, fault-tolerance, and linear scalability. These core properties of our database implementation are, in actuality, consistent themes to consider when building any distributed system. And as Riak and Riak Enterprise gained traction in market, several customers began to use their Riak implementation to store larger objects.

With this and other customer feedback in mind, we prototyped Riak CS, which offers all of the benefits of Riak, while also adding the features and functionality required to power large object storage in public or private clouds as well as providing reliable storage for applications and services.

As we built upon this initial prototype, both based on distributed systems themes and customer input, we added an S3-compatible API to Riak CS. This provided a solution for service providers that wanted to offer S3-compatible storage and for customers that wanted to adopt a hybrid-cloud approach to address data sovereignty or redundancy concerns. We also added OpenStack Object Storage API compatibility with the latest Riak CS 1.4 release. Riak CS can now easily interact with multiple IaaS providers, which helps expand our potential user base for both the open source and enterprise product.

However, regardless of feature decisions – either present or in the future – our commitment to providing robust, resilient distributed storage remains.


"What’s New in Riak CS 1.4" Webcast

August 14, 2013

Next week, we will be hosting the “What’s New in Riak CS 1.4” webcast. Join us on Friday, August 23rd at 11am PT/2pm ET for a free webcast that will discuss the new features and updates announced with the latest release. You can sign up for this 30-minute webcast here.

In addition to looking at the 1.4 updates, this webcast will discuss the basics of Riak CS and Riak CS Enterprise, while also providing some common use cases and user stories.

Register now for the “What’s New in Riak CS 1.4” webcast and learn more about this new release here.


Basho Unveils Riak CS 1.4, Driving Distributed Cloud Storage Innovation for Public and Private Clouds

New Release Adds OpenStack Integration, Simplifies Management, and Boosts Multi-Datacenter Replication Speed

August 13, 2013 – CAMBRIDGE, MABasho Technologies, the leader in distributed systems software, announced today the availability of Riak CS 1.4 and Riak CS 1.4 Enterprise. Riak CS 1.4 continues Basho’s commitment to provide cloud storage software that is simple to operate, highly available by design, and compatible with industry cloud standards. Riak CS is used by organizations worldwide to power their public and private clouds.

Riak CS 1.4 introduces formal integration with OpenStack, provides enhanced performance and manageability, includes community requests, and improves performance at scale. Riak CS 1.4 Enterprise significantly boosts the performance of multi-data center replication by allowing for concurrent channels, so the full capacity of the network and cluster size can scale the performance to available resources.

“Riak CS is seeing impressive market adoption, especially from service providers looking to increase their portfolio offering with large object storage,” said Greg Collins, president and CEO of Basho Technologies. “This release continues our commitment of providing simple and accessible cloud storage for a broad range of cloud computing platforms and use cases. With the addition of OpenStack integration and significant performance improvements, Riak CS 1.4 also appeals strongly to enterprises building their own object storage or adopting a hybrid-cloud deployment methodology.”

“Object storage is quickly becoming a foundational platform capability for cloud providers and large enterprises to meet the rapidly growing surge in demand to store more data,” said Simon Robinson, vice president of storage research at 451 Research. “Riak CS continues to see greater adoption in public and private clouds. Riak CS’s tighter integration with OpenStack is certain to be another catalyst for Basho. OpenStack users gain a very capable storage alternative to Swift, OpenStack’s object storage platform.”

“Yahoo! JAPAN has been using Riak CS for over a year to power our public cloud storage platform” said Shingo Saito, cloud product manager at Yahoo! JAPAN. “Riak CS is also used by LOHACO, for its on-line shopping platform, operated by ASKUL Corporation, Yahoo! JAPAN partner, and by some of the largest companies in Japan. We are excited to continue to partner with Basho and look forward to deploying Riak CS 1.4.”

“Redapt is excited to work with Basho to help customers address distributed object storage needs within OpenStack environments,” said David Cantu, co-founder and COO at Redapt. “Redapt’s mission is to enable leading service providers, enterprises, and web centric companies with the ability to achieve the numerous economic and operational benefits of private cloud computing. With the Riak 1.4 announcement, Basho is helping us deliver on that commitment for our customers with proven distributed cloud storage software that is now more finely tuned for integration with OpenStack.”

“Businesses have a range of object storage needs and our partnership with Basho helps us easily address even the most complex scenarios in our public cloud,” said Jared Wray, CTO of Tier 3. “Our global data center footprint enables businesses of all sizes to adopt object storage for a variety of use cases including: cloud-native and cross-device apps, backups and archives, and secure file transfer. The improved performance and simplified operations available with Riak CS 1.4 continue to help our customers simply scale to meet operational demand.”

Major Feature Additions of Riak CS 1.4 include:

  • Built-in integration with OpenStack. Riak CS 1.4 introduces support for OpenStack’s Keystone authentication service and introduces compatibility with OpenStack Object Storage API.
  • Improved performance of large bucket query operations. Secondary indexing pagination, introduced with Riak 1.4, allows for significant performance improvements of large bucket query requests.
  • Simplified operational management. Improvements to the User API allow operators greater flexibility in managing Riak CS user information, while also improving the agility and responsiveness of Riak CS.
  • Decreased bandwidth for object block retrieval. Changes to how Riak CS handles object block retrieval will decrease intracluster bandwidth by 67% and improve download performance.

Riak CS 1.4 Enterprise adds the following:

  • Enhanced multi-site replication performance. Riak CS 1.4 Enterprise allows for concurrent channels of communication between clusters, which greatly enhances the capability for replication by taking advantage of all the network’s available resources.

Riak CS 1.4 is available for Debian, Ubuntu, FreeBSD, Mac, Red Hat Enterprise Linux, Fedora, SmartOS, and Solaris.

To view the latest technical documentation or to download Riak CS, visit docs.basho.com/riakcs/latest/.

To view a feature comparison with OpenStack Swift, visit docs.basho.com/riakcs/latest/references/appendices/comparisons/Riak-Compared-to-Swift/.

To view a feature comparison with EMC Atmos, visit docs.basho.com/riakcs/latest/references/appendices/comparisons/Riak-Compared-to-Atmos/.

To request a trial license of Riak CS Enterprise, prospective inquiries can request a Riak CS Tech Talk at http://info.basho.com/SignUpRiakTechTalk.html.

About Basho
Basho is a distributed systems company dedicated to making software that is highly available, fault-tolerant and easy-to-operate at scale. Basho’s distributed database, Riak and Basho’s cloud storage software, Riak CS, are used by fast growing Web businesses and by over 25 percent of the Fortune 50 to power their critical Web, mobile and social applications and their public and private cloud platforms.

Riak and Riak CS are available open source. Riak Enterprise and Riak CS Enterprise offer enhanced multi-datacenter replication and 24×7 Basho support. For more information, visit basho.com. Basho is headquartered in Cambridge, Massachusetts and has offices in London, San Francisco, Tokyo and Washington DC.

Contact Information:
Alex Gutow
Basho Technologies

Morgan Mathis
Highwire PR
415-963-4174 x37

Riak CS 1.4 is Now Available

August 13, 2013

The release of Riak CS 1.4, Basho’s open source cloud storage software, adds a number of performance improvements as well as OpenStack integration and simpler user management. Riak CS is being used by companies all over the world to build public and private clouds, and as reliable storage to power various applications.

One of the biggest additions with Riak CS 1.4 is the integration with OpenStack, broadening our relationship with the open source community. This integration supports OpenStack’s Keystone authentication service and the OpenStack Object Storage API, which allows OpenStack users the means to integrate Riak CS for object storage in an OpenStack deployment.

The Riak CS Users API provides an interface for user creation and management. This release also improves this API to give operators greater flexibility in managing user information. Additionally, this release benefits from ongoing refactoring and reorganization efforts aimed at improving the agility and responsiveness of Riak CS.

Riak CS 1.4 takes advantage of some changes made in Riak 1.4 to provide performance improvements to Riak CS users. First, Riak CS 1.4 features improved performance of listing the contents of large buckets by taking advantage of secondary index pagination in Riak. Riak CS 1.4 also leverages a new option for object block retrieval, which decreases intracluster bandwidth by 67%. This improves the download performance when handling many concurrent requests. These features can be independently enabled, but are disabled by default to accommodate users not using Riak 1.4 with Riak CS. See the documentation for more details.

Riak CS Enterprise is the commercial extension of Riak CS, which adds multi-datacenter replication and 24/7 support. The 1.4 release improves replication performance by increasing storage efficiencies and adding multiple TCP connections between clusters.

In addition to the features and upgrades listed above, many bugs were harmed in the making of this release. For a full list of what is included in Riak CS 1.4, check out our code at Github.com/basho or review the release notes. To learn even more, join our live webcast, “What’s New in Riak CS 1.4” on August 23rd.


Basho and Open Source

July 23, 2013

This week is O’Reilly OSCON, a conference dedicated to all things open source. Basho is a sponsor and Basho engineer, Eric Redmond, will be delivering a presentation entitled “Distributed Patterns In Action“.

Basho first open sourced Riak in 2009. It’s a decision that helped us grow our business, and become a leader in newer, agile enterprise environments. Our participation in the open source community benefits our culture, our development process, and our business.

In honor of OSCON, we thought it important to explore the commercial aspects of our open source decision.

The Business of Open Source

Open source is in the DNA of our company, with both Riak and Riak CS available under the Apache 2 license. (It is worth noting that these products are but a few of our open source contributions, which also include Webmachine and Lager.) To turn this great code into a business, we chose to stay true to our roots as a software company, instead of just selling services. The enterprise versions of Riak and Riak CS offer the entirety of our open source software, with the addition of multi-datacenter replication and monitoring capabilities.

The decision to sell licenses to the enterprise, rather than to rely just on services, makes Basho unique. It allows us to engage with our enterprise customers in the transformation of their application architecture. They can be confident in the software’s availability and in Basho’s commitments to support them – as customers. Enterprises need an alternative to traditional database vendors, but one that can still fit — in license structure, operational management, and process integration — into a traditional organization.

Our licensing model for Riak Enterprise and Riak CS Enterprise lets us balance agility with tradition. Our community helps us develop groundbreaking software, while the enterprise license helps corporate IT and Operations sleep at night.

Open source drives adoption (a concept discussed at length in Stephen O’Grady’s book The New Kingmakers). That means Riak is used across many different industries, powering thousands of applications. That commercial validation — our success in production deployments — is accelerated due to the open source availability.

We remain keenly aware, and tremendously appreciative, that our community (from the individuals to the large organizations) guides Riak and Riak CS updates, and has been crucial to the refinement and forward momentum of this software.

Basho’s success is open source’s success. Our strengths reside both in our team and in our community, as their combined efforts improve our technology and its utilization. We are excited to see what other open source showcases are in view at OSCON 2013.

Greg Collins

Tier 3 Object Storage: Powered by Riak CS

June 19, 2013

Today, Tier 3 announced the availability of their global cloud object storage product, powered by Riak CS. You can find the entirety of the release in our News Section entitled “Tier 3 Launches Global Cloud Object Storage.”

In particular, we are keenly interested in the unique geographic footprint that Tier 3 maintains. In conversations with customers, press, and analysts, we frequently hear people discussing “geo-data locality.” This phrase typically is used to express a desire to address regulatory compliance or to improve the end-customer experience through low-latency (in the case of mobile applications).

With the Tier 3 release, their geographic footprint — in addition to maximizing availability — leverages the inherent replication present in Riak CS to pre-determine the physical locations of specific data.

For geo-data locality, requests can be load balanced across geographies, with geo-based client requests directed to the appropriate datacenter. For example, US-based requests can be served out of a Tier 3 US-based datacenter, while EU-based requests can be served out of a Tier 3 European datacenter. For situations where not all data needs to be shared across all datacenters (or if certain data, such as user data, must only be stored in a specific geographic region to provide low-latency response and address privacy regulations), Riak CS Enterprise’s multi-datacenter replication can be configured on a per-bucket basis so only shared assets, popular assets, etc. are replicated.


Top Five Questions About Riak CS

May 1, 2013

This post looks at five commonly asked questions about Riak CS – simple, available, open source storage built on top of Riak. For more information, please review our full documentation, or sign up for an intro to Riak CS webcast on Friday, May 10.

What is the relationship between Riak and Riak CS?

Riak CS is built on top of Riak, exposing higher-level storage functions including large object support, an S3-compatible API, multi-tenancy, and per-user storage and access statistics. Riak itself provides the replication, availability, fault-tolerance, and underlying storage functions for the Riak CS implementation. Riak and Riak CS should both be installed on every node in your cluster. While Riak and Riak CS could be run on separate virtual or physical nodes, running them on the same machine minimizes intra-cluster bandwidth usage and is the recommended approach. As with Riak, we advise a minimum 5-node cluster.

When objects are uploaded to Riak CS, the object is broken up into smaller chunks which are then streamed, stored, and replicated in the underlying cluster. A manifest is maintained for each object, that points to which blocks comprise the object, and is used to retrieve all blocks and present them to the client on read. In addition to running Riak and Riak CS on each node, Stanchion, a request serializer, must be installed on at least one node in the cluster. This ensures that global entities, such as users and buckets, are unique in the system.

What use cases does Riak CS support that Riak doesn’t?

Riak CS has several features that are not provided in the standalone Riak database. One of the most obvious differences is in the size of objects supported. Riak CS exposes large object support, and includes multi-part upload so you can upload objects as a series of parts. This allows you to upload single objects to the system into the terabyte range. In Riak, the data model is simply key/value; in Riak CS, the key/value model provides the underlying structure for higher-level storage semantics – users, buckets and objects. The Riak CS interface is an S3-compatible HTTP API, allowing you to use existing S3 libraries and tools. In contrast, Riak exposes an HTTP and protobufs API and offers many language-specific clients. Unlike Riak, Riak CS is multi-tenant, with the concept of “users” and per-user reporting on storage and access. This makes it a fit for both private cloud scenarios, with multiple internal users, or as a foundation for a public cloud storage offering.

How does multi-tenancy, authentication and reporting work?

Riak CS exposes an interface for user creation, disablement and credential management. Riak CS can be set so that only administrators can create new users. Administrators also have special privileges including being able to retrieve a list of all users in the system and query the user account information of any user. Once issued credentials, users are able to authenticate, create buckets, upload and download files, retrieve account information, obtain new credentials, or disable their account through the API. Riak CS supports the standard S3 authentication scheme, with support for header and query string authorization.

Riak CS exposes storage, usage and network statistics that support use cases like accounting, subscription, billing or multi-group utilization for public or private clouds. Riak CS will report information on how much storage a user is consuming and the network operations related to access. This data is exposed via an HTTP interface and can be queried on the default timespan “now” or as a range from start time through end time. Access statistics are reported as bytes in and bytes out for both object and bucket operations. Reporting of this information can be scheduled for a set interval or manually triggered.

What’s the difference between Riak CS and Riak CS Enterprise?

Riak CS Enterprise provides multi-datacenter replication on top of Riak CS. For multi-datacenter replication in Riak CS, global information for users, bucket information and manifests are streamed in real-time from a primary implementation to a secondary site so global state is maintained across locations. Objects can then be replicated in either full sync or real-time sync mode. The secondary site will replicate the object as in normal operations. Additional datacenters can be added in order to create availability zones or provide additional data redundancy and locality. Riak CS Enterprise can also be configured for bi-directional replication. Riak CS Enterprise also comes with 24/7, enterprise-level support. More information and pricing can be found here, and full technical information is available on our docs portal. Ready to get started? Sign up for a developer trial of Riak CS Enterprise.

What are your plans for integration of Riak CS with open source compute solutions?

Riak CS provides highly available, distributed storage, making it a natural fit for usage alongside compute solutions. We have partnered with Citrix to collaborate on the integration of Apache CloudStack and Riak CS to create a complete cloud software offering that combines compute and storage in an integrated platform. For more information on our partnership with CloudStack, check out this blog post with the latest update. API and authentication support for OpenStack is also in progress.

Ready to get started? You can download Riak CS here, and check out the Riak CS Fast Track for a hands-on getting started guide.

Riak CS Intro Webcast

April 18, 2013

Next Tuesday, April 23, we will be hosting a Riak CS Intro webcast. It will take place at 11am PT/ 2pm ET. You can sign up for it here.

This webcast will cover:

  • A technical overview of Riak CS and its components
  • Riak CS architecture, operations, and new features
  • APIs and interfaces, as well as Riak CS Control
  • What’s available with Riak CS Enterprise
  • Use cases and user stories

We will also be available after the webcast to answer any questions that you might have. You can register for the Riak CS Intro webcast here.


Getting Started with Riak CS Control

April 17, 2013

Riak CS Control is a standalone user management interface for Riak CS, Basho’s cloud storage software. Riak CS Control provides a user interface for filtering, disabling, creating, and managing users.

To help get you started with Riak CS Control, we have put together a short video that walks you through the installation and configuration. It also goes over the basics of how to create and manage users.

For more information on Riak CS, sign up for our Riak CS Intro webcast on April 23rd or visit the Riak CS page.


Getting Started with Riak CS and S3cmd

April 9, 2013

Riak CS (Cloud Storage) is simple, open source storage software built on top of Riak. s3cmd is a command-line tool for uploading, retrieving, and managing data via an Amazon S3 compatible API.

In this short screencast, we cover the process of installing and configuring s3cmd on a Debian- or Ubuntu-based system. Once installed, we’ll use Amazon’s s3cmd tool to manage buckets and files in Riak CS. You can view the entire screencast below.

Due to the small type used in the screencast, we recommend viewing this video at a high resolution.

You can also learn more about Riak CS here.