Reuters CEO sees “semantic web” in its future

Tags: , , , ,
Categories: electronic culture
Hits for this post:677
Tiny URL: http://r-echos.net/lk/12011
Sunday, February 10th, 2008 at 11:04 pm
Bookmark on del.icio.us | Twitter This Stumble This

Reuters CEO sees “semantic web” in its future: ”

By Tim O’Reilly

Money:Tech logo

At Money:Tech yesterday, I did an on-stage interview with Devin Wenig, the charismatic CEO-to-be of Reuters (following the still-not completed merger with Thomson). Devin highlighted what he considers two big trends hitting financial (and other professional) data:

  1. The impact of consumer media on professional media. As young people who grew up on the web hit the trading floor, they aren’t going to be satisfied with text. Reuters needs to combine text, video, photos, internet and mobile, into a rich, interactive information flow. However, he doesn’t see direct competition from consumer media (including Google), arguing that professionals need richer, more curated information sources.

  2. The end of benefits from decreasing the time it takes for news to hit the market. He describes the quest for zero latency in news, from the telegraph and early stock tickers and the news business that Reuters pioneered through today’s electronic trading systems. (Dale Dougherty wrote about this yesterday, in a story about the history of the Associated Press.) As we reach the end of that trend, with information disseminated instantly to the market via the internet, he increasingly sees Reuters’ job to be making connections, going from news to insight. He sees semantic markup to make it easier to follow paths of meaning through the data as an important part of Reuters’ future.

Devin’s point about the semantic web was thought-provoking. Ultimately, Reuters’ news is the raw material for analysis and application by investors and downstream news organizations. Adding metadata to make that job of analysis easier for those building additional value on top of your product is a really interesting way to view the publishing opportunity. If you don’t think of what you produce as the ‘final product’ but rather as a step in an information pipeline, what do you do differently to add value for downstream consumers? In Reuters’ case, Devin thinks you add hooks to make your information more programmable. This is a really important insight, and one I’m going to be chewing on for some time.

That’s a really good case for the Semantic Web, and one that I hadn’t understood before. It’s not about having end users add semantics for the love of it. That’s just overhead, which is why I’ve always argued against it, preferring the kind of implicit semantics that come from applications that harness user self-interest. But professional publishers definitely have an incentive to add semantics if their ultimate consumer is not just reading what they produce, but processing it in increasingly sophisticated ways.

But even if Devin is right about one role of a publisher being to add value via metadata, I don’t think he should discount the statistical, computer-aided curation that has proven so powerful on the consumer internet. (Curation is that part of the publisher’s job that consists of choosing and arranging the content that is presented to the ultimate consumer — reading the slushpile, if you will, so that others don’t have to, and making sure that the most important material gets its day in the sun.)

Explicit semantic markup has thus far not proven to be anywhere near as powerful as techniques for mining implicit semantics, or the design of applications in which more implicit semantics are created by users simply by ‘living as and where they live.’ (Facebook’s ’social graph’ is the latest example of this kind of implicit semantic application.) Much success on the consumer internet has resulted from innovations in curation. After all, PageRank is a kind of automated curation via collective intelligence, as is Flickr’s interestingness algorithm, user voting on slashdot and digg stories, and even community editing of Wikipedia.

Devin is completely right, though, about consumer media changing expectations for professional media. I see a lot of future upset in enterprise software as well as in media, as consumer phenomena like mashups and social networking change ease-of-use expectations of applications like CRM and business reporting. But it seems to me that mainstream media needs to learn not just about multimedia, but also about new sources of information.

A huge part of the generational change is a change in expectations of transparency, informality, and sources of authority. So when Devin says that Google isn’t terribly useful for professional uses like financial research, I think he misses just how much authority bloggers are getting as reliable news sources, and how people are using tools like iGoogle to pull together targeted RSS data feeds. Raw Google results may be less useful than Reuters-filtered results, but how about community or expert-curated Google results? Just as Reuters’ customers are adding value to the Reuters data stream, they are capable of adding value to the Google data stream. And there are increasingly powerful tools for managing that stream.

What I think does ultimately matter is the ability of professional media to build specialized interfaces and vertical data stores that are suited to their niche, hopefully harnessing data and services from the consumer internet, and mashing them up with specialized, perhaps private, data stores. Put that together with metadata for programmable re-use, and you may really have something.

On the end of timing arbitrage due to zero latency in information distribution, I have to disagree with Devin. There’s still a huge amount of information that never hits the market because people aren’t paying attention to the right things. David Leinweber, who spoke at the conference yesterday, gave a great example, of a huge price move in the stock of a pharmaceutical company as a result of news of successful clinical trials for a cancer vaccine. Every single news story on the subject resulted from the company’s press release, yet there was information available on the web months in advance of the story. Leinwebber’s point: it isn’t about speeding up news distribution, but about getting better access to sources of the news. As he said: ‘You have to get the news before the news people get there.’

Even in areas like company financials, the bread and butter of equity analysis, is there any reason why earnings are still reported only quarterly? There may one day be a company that breaks ranks, and shows its data in real-time (as it is increasingly coming to company executives.) The expectation of radical transparency may one day reach even into this relic of the 19th century.

I also think there’s a huge opportunity to get to data sooner via the sensor revolution. When phones report location (disclosure: O’Reilly AlphaTech Ventures investment), when phones listen to ambient sound, when credit cards report spending patterns (disclosure: OATV investment), when cars report their miles traveled, when we’re increasingly turning every device into a sensor for the global brain, there will be more and more sources of data to be mined.

And yes, we’ll need humans (at least for a while) as the last mile to extract meaning from all that data. But we’ll want those humans to be augmented with tools that notice patterns and exceptions as they happen, not just after the ‘news’ hits.”

(Via O’Reilly Radar.)

Related Posts




One Response to “Reuters CEO sees “semantic web” in its future”

  1. Paul Haley Says:

    Tagging may be more of an obstacle or onus than it is worth. Leinwebber is right. The information can be available on the web before it is identified as relevant or tagged by a channel. There is a huge opportunity for a crawler to find nformation that is relevant to a purpose and channel it in real-time, but if it has to consume rather than generate tags, it won’t work. The approach being taken by companies like Powerset is the way to go. Focusing automatic semantic extraction on a domain, such as finance, is clearly important. The question is, when will the NLP be up to it. Manual tagging is unlikely to flag relevant content back with adequate semantic detail and precision when it is first authored (i.e., before it may be tagged by Reuters or others).

Trackbacks

Leave a Reply

R-Echos

Subscribe in a reader


Since 2004, R-Echos is an experimental online magazine dedicated to republication; topics vary from biology to graphic design, from ecology to business. It agglomerates anything which is about art, computing, science. His form is made out of collages of texts, links, images, references, videos and sounds - choosen with care to take part to this very personnal publication.



  • About
  • Articles
  • Beta version
  • Categories
  • Defragmentation
  • Directory
  • Index
  • Links
  • Monthly Archives
  • R-Echos issue 1
  • Somewhere else
  • Tags
  • Visual Index
  • Visualisation


  • Search R-Echos



    * curation / edition / selection is made by Electronest

    On Purpose: Design Concepts

    On Purpose: Design Concepts

    On Purpose: Design Concepts looks at conceptual design practices, the emergence of ‘meta design’, and the question of who or what can define something as design…
    With Åbäke, Droog Design, Daniel Eatock, Electronest, Ann-Sofie Back, Will Holder, Peter Jensen, Onkar Kular & Noam Toran, Metahaven, Alex Rich, Savage, Yuri Suzuki
    September 13 - [...]

    websites and White Cubes

    websites and White Cubes

    Dumb sign, originally uploaded by blackbeltjones.
    Been asked to work on the nominations for designs of the year again at the Design Museum, which is very nice.But it leads me back to this hoary old question – how should interactive work best be shown in a museum or gallery context? Should it be [...]

    R-Echos issue 1 - AMP001

    R-Echos issue 1

    An experiment in the economics of production: how can we shift focus from consumption of a finished product to investment in the processes of design, print & production?

    This is a poster and a text: an analog R-Echos
    Would you be interested in investing in the tangible production of this work?
    1. You can download the digital archive
    and [...]

    What if, VACANT LOT, Hoxton, London

    What if, VACANT LOT, Hoxton, London

    Related PostsBuilding and designing Digitalism’s IdealisticThe best CNC project machines - Hack a Daygreenpix zero-energy massive LED displayDIY Blubber BotBotanicalls Twitter DIYBuild Your Own War Bot - Wired How-To WikiHOW TO - Embroider digital imagesThe Shipyard ReturnsBottoms Up DoorbellThey were flexible in the fifties tooThe Magic Roundabout, SwindonPrintBot [...]

    magazines as objects exhibition

    Colophon events this week

    Colophon events this week

    There are a couple of Colophon-related events in Europe this week. First up, Andrew Losowsky – that’s him above next to a copy of IsNotMagazine – has curated an exhibition of magazines as objects in Milan. CR Blog has an in-depth report with details – it sounds great, lots of magazine-y-ness. Andrew’s [...]



    Collections

    * at the occasion of R-Echos issue 1 we organised some pages into topic oriented piles:

  • Displaying
  • un-Realisation
  • Physical Interface
  • Augmented Reality
  • Publishing
  • Geometry
  • Visualisation


  • R-Echos has its own tiny url system:

    * tiny url are url you can copy/paste into email without the risk of having a long line that surely will get broken and a link unusable.



    R-Echos context

    To get updates via email:

    mailinglist delivered via FeedBurner



    free advertising network