Featured

Other Posts

Getting Started with Amazon Redshift – Sample Chapter

Here is a preview of one of the chapters (available at PacktPub):

Leave a Comment Continue Reading →

What does that checkbox do? (Redshift Encryption)

**EDIT**     I don’t normally go back and edit things in a prior post… but… in this case the additional link to Werner Vogels blog  that came out after I wrote this is worthwhile to the topic at hand.           **EDIT** As I alluded to in the prior post, I have been busy…   I have actually been busy […]

3 Comments Continue Reading →

More blog-posts to come…

I know in my last post, I indicated there would be more blog posts shortly on Amazon Redshift.  Those posts are coming… this is not that post… (sorry).  My bog-posts have taken a short detour, that in the coming weeks will become more clear.  For now, know that I have not forgotten you!… I will […]

2 Comments Continue Reading →

“Big Data” with Amazon Redshift – Intro

If you have been following along on my blog, you have seen the various technical and other ramblings for my recent R&D efforts around “Big Data”.  A little while ago I wrote about giving Amazon a try which I still believe is true… and long those lines, am now giving Amazon Redshift (their new entry […]

7 Comments Continue Reading →

Hadoop – some basic setup

For those of you coming along on this journey, I want to take a quick step back.  Rather than assuming what you do and don’t know a little background to how to make this installation go. This post is not really “interesting findings” but rather a how-to for the install process. As I explained in […]

Leave a Comment Continue Reading →

Hadoop/Hive a few lessons learned

It has been a few days since the last set of posts, and quite honestly did not want to leave it hanging even this long, so I wanted to give a brief update to (at least help) cut-off some of the frustrations I have faced for those of you attempting the same path through the […]

Leave a Comment Continue Reading →

Hadoop +1 (add a node that is…)

This one gets a little finicky depending on you configuration, and how much horsepower you have available to you.  If you started of with my first post, and built a VM … ideally … you made a clone of the host once you had Hadoop running, which will make this “easier”.  You are quickly getting […]

2 Comments Continue Reading →

From Hadoop to Hive

This is the next step after you have completed the initial setup to get Hadoop running, this will walk you through the steps to get Hive running, everything in the prior post is a prerequisite to this setup.  Just as my prior post, this is nothing “ground breaking”, but hopefully will provide a consolidated place […]

2 Comments Continue Reading →

Single Cluster Hadoop – from Zero to Hadoop

Before I get into the installation… what is Hadoop anyway?  Normally I don’t like the “cut-past” approach of blogging, but in this case I make an exception from the Apache Docs: “The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity hardware. It has many similarities with existing distributed […]

4 Comments Continue Reading →

Amazon – Give it a try

A little more than a week ago Amazon made some announcements, near and dear to my heart, warehousing.  Only this time, as you would expect from Amazon, it is moving to the cloud… with the announcement of their “Redshift” product.  I have been intrigued by some of the advantages that cloud computing provides, however, from […]

2 Comments Continue Reading →