Cloudera on AWS
ClouderaExternal reviews
46 reviews
from
and
External reviews are not included in the AWS star rating for the product.
Big Data made easy with Cloudera
What do you like best about the product?
Cloudera abstracts you from the need of really knowing the depths of a Hadoop cluster at the beginning of your analytics stage. It's simply very easy to deploy servers with HDFS already deployed and connected that will give you automatic support to run Hive or Pig queries.
What do you dislike about the product?
Cloudera has very little things to not like it until it fails for some (usually) unknown reason. Crashes are not common but it's very annoying when it happens on a very long job. When we deal with distributed systems, however, failures are a very common thing so it lacks some better feedback of logs and error logs.
What problems is the product solving and how is that benefiting you?
If you want to deploy a relatively small cluster to execute batch processing that was taking days to hours, or some high speed queries that where taking hours to some minutes over a large set of data using a known SQL-like language like Hive is perfect.
Recommendations to others considering the product:
Cloudera and Hortonworks are the products to start with if you don't really know absolutely everything that is involved in a Hadoop cluster. Cloudera is more popular and have been around lot of time but Hortonworks is also a good option. Cloudera certifications are valuable in the industry (althought Hortonworks are cheaper). It depends on your focus, if your prefer some more known and used product to start in a Big Data cluster and pretend to be certified, try one of the virtual machines that Cloudera offers to start playing with. If you simply want to learn a bit of "Big Data" things for yourself, maybe I'll give a try to Hortonworks as all it's architecture is also open to read and learn.
showing 11 - 11