Talend and FuseSource Fight for the Title of “Geek of Geeks” – Who has the Most Arcane Content?

I promised that I would share some real-world examples of how high-tech vendors’ marketing copy scores in my Arcanicity Index. So first up is some text from the respective web sites of Talend and FuseSource – two ESB vendors that I’m looking at in detail to test project REPAMATron.

As a reminder the Arcanicity Index looks to rate how much technical, or domain knowledge a reader would need to posses in order to understand a given piece of text. At the moment it works from the density of the abbreviations and acronyms found in the text. Future version may also look for technical concepts as well. The equation and MUCH more information on the Arcanicity Index can be found here or a slimmed down version is here.

The headline figures are as follows:

Element Talend Web Copy Figure FuseSource Web Copy Figure
Average Grade Index 18.24 13.68
Arcanity Index 21.02 72.76

The average Grade Index here is taken from the average of Flesh-Kincaid, Gunning-Fog, Coleman-Liau, Smog and the Automated Reading Index all of which aim to calculate the number of years of study someone would need in order to be able to comprehend the text from a structural point of view. What this tells us is that on average the structure of the text on the FuseSource site is much (nearly 5 school grades) easier to read.

However, if we look at my Arcanicity Index score (which is still a working draft and not finalised yet!) for the respective texts it shows that the FuseSource copy features a far higher degree of arcane concepts when compared to the Talend copy. The Arcanicity Index has a notional maximum of 100 although it is possible to score higher than 100. FuseSource scores 72.76 on that scale which makes the concepts contained within very complex.

What all this means is that the FuseSource web copy is likely to exclude non-expert readers and suggests that FuseSource is aiming at a VERY technically savvy audience. Talend’s web copy, whilst still containing very complex concepts is written in a narrative way with long hand explanations of concepts instead of using abbreviations and acronyms. Who has it right? Well I guess time and their relative success will tell.

I guess this goes to show that there is a significant difference between grade reading age and the degree of arcane, technical concepts within a text. This is why I decided to introduce the Arcanicity Index in the first place so I’m pleased it’s adding an extra layer of insight.

This just leaves me to announce that the current leader in the Arcanicity Index “Geek of Geeks” award for the most acronym-dense marketing copy is FuseSource with a score of 72.76.

Danny Goodall

For those masochists, insomniacs or natural language processing lovers out there; the statistics relating to the two vendors’ texts are shown below – in GREAT detail.

 

 

Text Structure Statistics

Here we see the statistics for the sentence and word structure. It’s clear from this table below that Talend uses longer and more complicated words when describing it’s ESB product offer.

Element Talend Web Copy Figure FuseSource Web Copy Figure
Number characters 1554 3007
Number of words 254 618
Number of sentences 12 24
Average chars per word 6.17 4.97
Average syllables per word 2.14 1.66
Average words per sentence 21.17 25.75
Words with 3+ syllables 85 101
% with 3+ syllables 33.46 16.34

 

Traditional Readability Index Scores

The table below shows the scores for Flesch-Kincaid, Gunning-Fog, Coleman-Liau, SMOG and the Automatic Reading Index. As mentioned above these indexes each attempts to predict the number of grade years of education that would be required to comprehend the text. More information can be found here.

Element Talend Web Copy Figure FuseSource Web Copy Figure
Flesch-Kincaid Grade 17.90 14.00
Gunning-Fog 19.80 15.30
Coleman-Liau 20.20 12.80
SMOG 15.30 11.90
Automated Reading Index 18.00 14.40
Average Grade–>> 18.24 13.68

 

Acronym Counts

The table below shows that statistics relating to acronyms. There are two sets of counts for each statistic – one for unique acronyms within the text and one that counts duplicates.

Element Talend Web Copy FuseSource Web Copy
Unique REST EIPS ESB JVM JEE OSGI ESBS WS BPEL JBI SOA JMX CSV JAX TAR POJOS JMS POJO OSGI FTP ZIP XSLT DSL FUSE JCA HAS MB APP MOV SOAPHTTP ADE ESB
Non-Unique REST EIPS ESB ESB ESB ESB ESB ESB JVM JEE OSGI ESBS WS BPEL BPEL JBI JBI JBI JBI JBI SOA JMX JMX CSV JAX TAR POJOS JMS JMS POJO POJO OSGI OSGI OSGI FTP ZIP ZIP XSLT DSL FUSE JCA HAS MB MB MB APP MOV SOAPHTTP ADE ADE ADE ADE ADE ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB ESB
Number of unique 6 26
Number of non-unique 11 62
Per 100 words (unique) 4.33 10.03
Per 100 words (non-unique) 2.36 4.21
Per 100 sentences (unique) 91.67 258.33
Per 100 sentences (non-unique) 50.00 108.33

 

Syllable counts

The table below shows the syllable counts for the words within each of the texts. The first line that begins [(1,93), (2,74)…] shows that there were 93 x 1 syllable words and 74 x 2 syllable words, etc. in Talend’s web copy. Again it is clear that despite the amount of words in the FuseSource sample being nearly double that in the Talend sample, Talend has very nearly as many 3 syllable words as FuseSource and only one less 4 sylaable word. Talend uses longer, more complex words to describe its offer.

Syllables Talend Web Copy FuseSource Web Copy
Syllable frequency [(1, 94), (2, 74), (3, 44), (4, 37), (5, 3), (6, 1)] [(1, 362), (2, 154), (3, 57), (4, 38), (5, 6)]
1 syllable ESB is a for an ESB a of and to a wide of ESB a broad set of EIPs it an of or ESB is and and is with and is for that are for rich the of a full ESB. both Web and REST the to choose the most for their tasks and more in their is on the source and is built on a that a of and it ‘out of the box’ with pre for in JEE or JVM. learn ESB a for an and for a and on that to The Source ESB for the fuse ESB is and by the people who wrote the fuse ESB has a that to use their in their SOA. JBI or or – BPEL XSLT or JMX – may be to a Fuse ESB and Fuse ESB may be to ESBs. key and – JBI JMS JCA JMX BPEL – Fuse ESB is on source app – JBoss and – and are – and are by – Fuse key – reuse – can freely and the code – use the of your choice class – can be in large – help for to – to and in more read the sheet. to Get Fuse ESB on for Fuse ESB 3. of are 81 MB TAR. gz 82 MB ZIP Source 4. 5 MB ZIP for your to why not check out how and for can help you be as as a part of your the is just the as a you can ask and and JBI in the your of the in the use the to builds in for and much join our To run the first the slide then the code These mov which to view. and Fuse ESB 4 In this Ade shows how to build and run Fuse ESB 4. he shows how to prune the so that FUSE ESB loads up the you need for your work. a Fuse ESB 4 In this Ade an of his and shows how to and share your Fuse ESB 4. he shows how to your with the Fuse ESB 4 and how to read your own and the a Web Fuse ESB 4 In this Ade shows how to create a SOAP/HTTP web JAX WS and it the Fuse ESB 4 he shows how to tie this new in with the an Fuse ESB 4 In this Ade shows how to create and the DSL he a file and FTP to read in plain text CSV them to a JMS and then splits the batch line by line for a JBI Fuse ESB 4 In this Ade shows how to take JBI and in Fuse ESB 3 and them in the Fuse ESB 4
syllables Talend lightweight service transport routing address talend supports standard patterns making Talend open standards based widely talend Service looking service without fledged talend Service supports based service giving freedom method talend makes users projects. talend based open Apache Camel project lightweight based supports data formats comes runtime ready OSGi Apache Tomcat servers about Talend’s products Talend complex projects. talend Service open lightweight service talend fully ready routing product based Apache Camel uses Patterns improve Open fully code. allows service any standard OSGi service engine binding deployed deployed other Feature Standards based design OSGi support Apache license license based open license Supports any server WebSphere many others tested training Backed Apache leaders employs used mission projects Access Apache access influence over project roadmaps detail. data ready Started. download Download based Apache Looking downloads earlier versions here. Windows Waiting download training Apache making product member answer Apache questions forums Apache product wiki browser access receive discounts Apache more. today. demo Videos demos yourself review download videos are. files require QuickTime building running screencast session install also only deploying POJO into screencast session gives deploy Java POJOs into logging logging system custom data using files OSGi deploying Service into screencast session service using deploy into also shared POJO deploying Pattern into screencast session deploy patterns. using Apache Camel creates poller files saves queue using data deploying into screencast session deployed deploy
3 syllables modular solution various enterprise provides messaging enterprise challenges. enterprise excellent solution regardless moreover therefore existing legacy components. Factory designed overhead Factory services suitable existing Factory Edition simplifies productive Factory component protocols programming packages deployment including standalone flexible solution Factory flexible Factory supported production Enterprise certified supported pluggable preferred solutions compliant component including engines container components Features Benefits Apache’s WebLogic releases versioned Supported consulting subscriptions Benefit Straightforward existing middleware components modify Flexible Enterprise critical deployment unix/mac complete. FuseSource subscription consulting successful possible FuseSource downloading beginning. document Subversion installing modules overview integrate properties service. runtime. component Enterprise enterprise persistent processing. component solutions container.
4 syllables integration requirements. combination enablement mediation variety integration integration topology infrastructure developers enablement additional development developers environment. Integration Community integration integration Integration architecture variety integration languages. configured environments application integration distributed integration architecture enablement. Integration mediation Integration developer Enterprise. productized architecture Productized integrated available ServiceMix contributers integration Embeddable redistribute development environment comprehensive evaluation development Interested ServiceMix available ServiceMix enterprise. community ServiceMix community ServiceMix community conferences community presentation installation development environment developed previously. Integration integration individual developed
5 syllables architecture. capability productivity
6 syllables interoperable

 

Talend ESB Web Copy

Here is a sample of text taken from the ESB product page from Talend’s web site.

Talend ESB is a lightweight, modular solution for various enterprise integration requirements. An ESB provides a combination of service enablement, messaging transport, routing, and mediation to address a wide variety of enterprise integration challenges.  Talend ESB supports a broad set of standard enterprise integration patterns (EIPs) making it an excellent solution regardless of topology or architecture.  Moreover, Talend ESB is open and standards-based and therefore is widely interoperable with existing and legacy infrastructure components.

Talend Service Factory is designed for developers that are looking for rich service enablement capability without the additional overhead of a full-fledged ESB.  Talend Service Factory supports both Web services and REST based service development giving developers the freedom to choose the method most suitable for their existing environment.

Talend Integration Factory Community Edition simplifies integration tasks and makes users more productive in their integration projects. Talend Integration Factory is based on the open source Apache Camel project and is built on a lightweight, component-based architecture that supports a variety of integration protocols, data formats and programming languages.  It comes “out-of-the-box” with pre-configured runtime packages ready for deployment in environments including OSGi, Apache Tomcat, JEE servers or standalone JVM.

Learn about Talend’s application integration products:
Talend ESB, a flexible solution for complex distributed integration projects.
Talend Service Factory, an open, lightweight and flexible architecture  for service enablement.
Talend Integration Factory, a fully supported, production-ready mediation and routing product based on Apache Camel that uses Enterprise Integration Patterns to improve developer productivity

 

FuseSource Web Copy

Here is a similar sample taken from the product page for the FuseSource Fuse ESB.

The Open Source ESB for the Enterprise.Fuse ESB is certified, productized and fully supported by the people who wrote the code. Fuse ESB has a pluggable architecture that allows organizations to use their preferred service solutions in their SOA. Any standard JBI or OSGi-compliant service engine or binding component – including BPEL, XSLT or JMX engines – may be deployed to a Fuse ESB container, and Fuse ESB components may be deployed to other ESBs.

Key Features and Benefits
Feature

Standards-based design – JBI, OSGi, JMS, JCA, JMX, BPEL support Apache license – Fuse ESB license is based on Apache’s open source license
Supports any app server – WebSphere, WebLogic, JBoss and many others
Productized – integrated, tested, and releases are versioned
Supported – training, consulting, and subscriptions are available
Backed by Apache leaders – Fuse employs key ServiceMix contributers

Benefit

Straightforward integration – reuse existing middleware components
Embeddable – can freely modify and redistribute the code
Flexible – use the development environment of your choice
Enterprise-class – can be used in large, mission-critical projects
Collaboration – comprehensive help for evaluation, development, deployment
Access to Apache – access to and influence over project roadmaps
Interested in more detail? Read the data sheet.

Ready to Get Started?

Download

Download Fuse ESB (based on Apache ServiceMix)Looking for Fuse ESB 3? Downloads of earlier versions are available here.
Unix/Mac (81 MB, TAR.GZ)
Windows (82 MB, ZIP)
Source (4.5 MB, ZIP)
Waiting for your download to complete? Why not check out how FuseSource subscription, training and consulting for Apache ServiceMix can help you be as successful as possible making FuseSource a part of your enterprise?

Downloading the product is just the beginning!As a community member, you can:
ask and answer Apache ServiceMix and JBI questions in the community forums
document your investigation of the Apache ServiceMix product in the wiki
use the community repository browser to access builds in Subversion
receive discounts for Apache conferencesand much more…Join our community today!

Demo Videos
To run the demos yourself, first review the slide presentation then download the code:
These videos are .mov files which require QuickTime to view.
Building, installing and running Fuse ESB 4 (screencast): In this session, Ade shows how to build, install and run Fuse ESB 4. He also shows how to “prune” the installation so that FUSE ESB only loads up the modules you need for your work.
Deploying a POJO into Fuse ESB 4 (screencast): In this session, Ade gives an overview of his development environment, and shows how to deploy and share your Java POJOs into Fuse ESB 4. He shows how to integrate your logging with the Fuse ESB 4 logging system, and, how to read your own custom configuration data using properties files and the OSGi configuration service.
Deploying a Web Service into Fuse ESB 4 (screencast): In this session, Ade shows how to create a SOAP/HTTP web service using JAX-WS and deploy it into the Fuse ESB 4 runtime. He also shows how to tie this new component in with the shared POJO developed previously.
Deploying an Enterprise Integration Pattern into Fuse ESB 4 (screencast): In this session, Ade shows how to create and deploy enterprise integration patterns. Using the Apache Camel DSL, he creates a file- and FTP-poller to read in plain-text CSV files, saves them to a persistent queue using JMS, and then splits the batch data line-by-line for individual processing.
Deploying a JBI component into Fuse ESB 4 (screencast): In this session, Ade shows how to take JBI solutions developed and deployed in Fuse ESB 3 and deploy them in the Fuse ESB 4 container.

And that, was all she wrote.

Posted in All Blog Categories, communication, Competition and Competitive Intelligence, natural language processing, readability, REPAMATron, Vendor-Related Posts and tagged , , , , , , , , , , , , , , , , , , , .

Leave a Reply

Your email address will not be published. Required fields are marked *