Salesforce.com’s Force and SAP’s A1S

Two announcements this week, all centered around small and mid-market applications built on flexible platforms. Force.com from Salesforce.com is a rebranding of their technology with a focus beyond CRM applications. This article mentions some promising case studies including 45,000 users running on the Salesforce.com platform already. That’s the Japanese post office, so obviously internationalization isn’t an issue. Not everyone is convinced of Salesforce.com’s potential. Joshua Greenbaum thinks that SAP’s A1S, to be announced on Sept 19th, will be the one to beat.

Either way, I’m pretty excited. On-demand, software-as-a-service business software platforms with straightforward development tools have incredible potential. Highly-tailored applications are the way to go, but you don’t want to be caught in the low-level details of databases and web service implementation.

TeraData Performance at KiloData Prices

What if you could turn on a massively parallel business intelligence database cluster with a few lines of code? What if you could leverage in-house and outsourced resources for computation and storage as needed? What if you could expand your analysis, data mining and text-search effort one node at a time, transparently, instantly?

There’s been a flurry of discussion around Hadoop and the Hbase project to bring Google’s BigTable feature to Hadoop.

Now Amazon wants to talk about how to use Hadoop with EC2 and S3, their computing and storage clusters.

Can I search large volumes of data on the cheap? Yes, but my algorithms must fit within the MapReduce framework.

Does someone have a MapReduce-enabled data query language? Well, there’s Pig from Yahoo. Sawzall from Google. Here is a discussion comparing those two from Greg Linden. Abacus from the Hadoop project. Apparently Microsoft has DryadLINQ.

We are on the exponential curve as it swoops upward dramatically. From the power and flexibility of opensource, anyone can use Google secret sauce on Amazon’s computers for 18 cents per gigabyte and 10 cents per computing hour.

Interactive Information Visualization

Enrico Bertini at Visuale asks how important is interactivity in information visualization? As a proponent of QlikView, Spotfire, Tableau and others, I think it’s extremely important. Interactivity is the future, it’s “make or break.”

I’ve been implementing speed-of-thought interactive BI tools for 6 years and I don’t want to do it any other way. When I watched my first seasoned executive lose restraint and laugh uncontrollably as he got instant answers to his hardest questions, I knew this was the only way to go. When my end-user training sessions end late because everyone is so excited about what they can do, it’s clear that people NEED interactivity.

Response to the Tableau 3.0 Webinar

I finally got around to watching the Tableau 3.0 webinar. I agree with their very excited presenter that Tableau 3.0 is a leap forward. The support of ad-hoc grouping of dimension elements is excellent as is the enhanced support of ad-hoc sets. The annotations look good and act sensibly. Generally, the new features are focused on ease of use, better statistical analysis, and report clarity. All good things. Here are 3.0 examples.

Annotations should be required in every BI tool. The ability to mark reference lines and data points on graphs and tables is critical to clear communication. Placing an annotation on a point in space does not require a data point to exist there, another nice feature. The smart BI vendors are focusing on collaboration and communication among users.

“Groups” stole their name from the “groups” of 2.x which are now the “sets” of 3.0 and can be used like so: similar dimensions such as coffee and tea, which may need to be represented in the database as separate product lines, can now be combined on the fly within Tableau by an end user under the simple heading “drinks”. This would make it easy to answer a question about food vs drink sales without the need to export to Excel and spend more time adding up the drink categories. In short, “groups” bring dimension values together and “sets” allow for separating special values from the rest of a dimensions values–and both can be done by the end user. Pretty nice.

I think the strongest competitor for visualization is Spotfire. However, Tableau’s use of live database interaction will become an advantage as data warehouse implementations shift to high-performance in-memory read-optimized databases. Was that over-hyphenated? Spotfire’s initial data loads are inflexible and I wouldn’t recommend it if you need to update a large dataset frequently.

Unlike QlikView, all of Tableau’s data needs to be in a single database. With good design, this is not a performance issue. The problem is that the extra expense of hardware and software to store a separate data warehouse and run ETL processing may push Tableau’s final price tag far above QlikView, which can easy pull from multiple sources and uses its own high-speed database.

Beautiful Music with reactable

From the Björk shows comes the reactable. The demo is wonderful. Via Boing Boing.

snag-0003.png

A Periodic Table of Visualization Methods

What and outstanding site! Found on Kevin Kelly’s Cool Tools blog.

snag-0001.png

Analytical Sniglets

The team at Juice Analytics asked its readers for the “Lingua Analytica” and here is the evolving dictionary. Some of my favorites are abracadata, frankensheet and sociomath.
My additions:

  • Bitpicking (v): Carefully choosing data to create the most favorable impression.
  • Sheetcake (n): A spreadsheet with excessive formatting and color.
  • Spreadache (n): Exceeding the Excel 65,000-row limit.