Thursday, 30 June 2011

The battle between "Open Science" and "Open Innovation"

"Open innovation" is a term that describes the sourcing of new methods, ideas, solutions etc. from outside the organisation.

I hate "open innovation"! I don't hate the process of "open innovation", I just hate the term. Because there's nothing "open" about it. The final "innovation" is just as closed as if it had it been invented in-house. 

The poster child of open innovation is, of course, InnoCentive. Clever name, brilliant business model: "Seekers" invite "solvers" to provide solutions to their problems for a cash reward. InnoCentive represents open-something for sure, but if not innovation, then what? Open questions? Not quite. Open quandaries? Better. Open befuddlement? Too far! I therefore humbly suggest;
InnoCentive; crowdsourced solutions to open quandaries.

However, so called "open innovation" extends way beyond crowdsourcing of the Innocentive mould. A great number of acquisitions and licensing deals, particularly in the pharmaceutical industry, can be seen in this light. Although such transactions open very little to the public domain it is clear that innovation has technically come from outside the purchaser/licensee, hence "open innovation" still fits.

More on "open innovation": Pharma are increasingly looking to bypass the biotech middleman by partnering directly with academia. This represents the funding of public research by multinational corporations in exchange for first dibs on any intellectual property that may emerge. It could be argued that such initiatives pit "open innovation" in mortal combat with "open science". "Open science", remember, asserts that public research belongs public domain, for free, and for the good of all.

Semantic posturing aside, innovation is the key to progress, no matter how it is couched. A nice recent example from Henry Chesborough (who coined open innovation) on how it can help pharma. So let's call a spade a spade; I hate the term "open innovation" because I can twist it to be in conflict with "open science" which is a movement that I truly value. I would rather "open innovation" revert to plain old "contract research", perhaps reserving "open quandary" to describe the crowdsourcing of same. 

Thursday, 16 June 2011

The brilliant Genome Analysis Crowdsourcing repository

In the days following the deadly German E. coli outbreak various 'rapid response' sequencing, assembly and annotation efforts washed across my radar (mainly via twitter). In isolation each of these efforts represents little more than a shop-front for their respective creator's (albeit impressive) capabilities. There was always the nagging feeling that a coordinated effort would have been more credible, and ultimately more useful.

Having perused a couple of the available data sets to see which file formats were being distributed I was hoping to find a blog post that summarised them all. That's when I found the E.coli O104:H4 Genome Analysis Crowdsourcing repository at GitHub. This goes way beyond being a simple blog. It represents a living repository linking all of the data generation efforts to-date. If that in itself were not enough, there is also a day-by-day listing of analysis reports (mainly blog posts).

I now contend that "Genome Analysis Crowdsourcing", by pooling various independent data and analyses makes these as credible and useful, if not more so, than any coordinated project could possibly have been. The quantity and variety of data in the public domain, all generated within 2 weeks, linked from a central location, is staggering!

Thursday, 2 June 2011

Open communities - build or reuse?

I have drafted this blog a few times and I'm bored with the narrative. I'm therefore going to spit out the conclusion right at the start: if you want to engage a developer community for your project go to them, don't ask them to come to you (they almost certainly won't)...

As funding for open databases like NCBIs OMIM is cut, there tend to be fairly rational calls for the database curation to be opened up to the community (eg Manuel Corpas' recent blog post). The typical method is the addition to the project of a wiki interface that accepts community annotation. I've been at the birth of a few such projects. No names named, and here's why; I've also sadly attended their inevitable deaths from neglect when no bugger ever used the darned things. Whilst notable successes exist (EcoliWiki, SNPedia, the Polymath Project) building an open community from scratch is hard, very hard, and most projects are doomed to failure. I, for one, limit myself to participating in two or three projects at any one time, and need a very compelling reason to start contributing to a new one.

So, given that collaborative development does produce valuable products and individuals can be motivated to contribute, how do we go about finding our contributors? The solution is actually pretty obvious; don't build an open community from scratch - use an existing one! The shining example of this approach is the Rfam adoption of Wikipedia itself as the source of community‐derived annotation, with advantages described in this NAR paper, including;
  • Access to a large existing community of curators,
  • Access to well maintained, user-friendly curation tools,
  • Entries subjected to automated QC tools (bots),
  • Leading to improved database content (around 2500 contributions/year),
  • Plus the side effect of improved discoverability of the resource via Wikipedia itself. 

It will be interesting to see other annotation projects cotton on to this idea; Pfam already has, but it's from the same Bateman stable as Rfam, so might not count (I've already been chided for mixing the two over at this Tree of Life blog post on a similar subject). Away from annotation, for active and inclusive bioinformatics-specific open communities you have the OBF leading the way, and also Debian Med (now blogging here) who are leveraging the wider Debian Linux community for the benefit of the life sciences. Whether there will be open science projects that successfully leverage Twitter and other social media communities remains to be seen.

So; what's the point of all this? Oh yes - if you're serious about engaging a developer community for your project go to them, don't ask them to come to you. Got that?