O’Reilly Radar > Database War Stories #6: O’Reilly Research

O’Reilly Radar > Database War Stories #6: O’Reilly Research

In building our Research data mart, which includes data on book sales trends, job postings), blog postings, and other data sources, Roger Magoulas has had to deal with a lot of very messy textual data, transforming it into something with enough structure to put it into a . In this entry, he describes some of the problems, solutions, and the skills that are needed for dealing with unstructured data.




Leave a Reply

R-Echos

Since 2004, R-Echos is an experimental online magazine dedicated to republication; topics vary from biology to graphic design, from ecology to business. It agglomerates anything which is about art, computing, science. His form is made out of collages of texts, links, images, references, videos and sounds - choosen with care to take part to this very personnal publication.

* Electronest

  • About
  • Articles
  • Beta version
  • Categories
  • Defragmentation
  • Index
  • Monthly Archives
  • R-Echos issue 1
  • Somewhere else
  • Tags
  • Visual Index
  • Visualisation
  • Collections

  • Displaying
  • un-Realisation
  • Physical Interface
  • Augmented Reality
  • Publishing
  • Geometry
  • Visualisation
  • Recently republished | Most Read

    Subscribe in a reader

    Enter your email address:

    Delivered by FeedBurner