The sandbox is a virtual machine which comes complete with a very robust and working Hadoop installation and some tutorial material. I’m looking elsewhere for more tutorial material, since this is very much a mere taste of what’s possible and the company seems to be selling their education services via this sandbox.
That said, I can tell you the main thing I’ve learned so far is that I don’t have a need for Hadoop. If you don’t have a server farm that you’re trying to turn into a cloud and a need to crunch a lot of data that you load-balance across your cloud, you don’t need the big data toolset. We’ve got a very, very long way to go before SQL Anywhere is insufficient to meet our needs.
But I’ll admit figuring out how Pig works is kinda fun.