web2express.org

June 22, 2009

New feature: RSS feeds for twitter digest

Filed under: news, web intelligence — aj @ 9:50 am

Just added RSS feeds on the web2express digest website. There different feeds for the different views of hot topics:

  • Feed for today’s new topics.
  • Feed for today’s top topics.
  • Feed for daily new topics in the last 3 days.
  • Feed for daily new topics in the last 7 days (a week).

They are available for anyone to subscribe and use on their websites.

June 19, 2009

case study: web2x digest picked up news event early

Filed under: news, web intelligence — aj @ 12:14 am

I notice a perfect case study from today’s news:  “Continental airlines incident” was emerging at about 8:29am (PST) on the twitter daily new topic list on this http://web2express.org website. it was probably at the same time as other major news outlets broke the news, maybe even a litter bit earlier.  Surprisingly, this top news did not show up on Twitter.com’s trending topics list, nor on google trends at all for the whole day.

This case study shows the difference between various trending applications. I think the underlying technology is the key. Calais is the semantic analysis core of my real time trending system. It seems Calais does pretty good job as it promises.

more digging:

I just found the flight schedule from NYT.com report:  The flight, Flight 61, took off at 9:54 a.m. in Brussels (3:54 a.m. Eastern time), according to Continental’s Web site. It touched down at 11:47 a.m., earlier than its scheduled noon landing, at Gate C123 at Newark Liberty International Airport.

Something is amazing if you compare the time carefully. The incident showed up as new hot twitter topics at 8:29am PST while the flight was still in the air.  where did the tweets come from so early? did someone tweet in the airplane? or people on the ground in Europe got the information early and tweet? Anyway, news travels on twitter fast, very fast!

June 17, 2009

semantic technology conference 2009

Filed under: events, news, semantic search engine, web intelligence — aj @ 8:59 pm

I have attended the semantic technology conference again this year – the largest gathering for semantic tech companies. Several things stand out:

Google gave a session on semantics – amazing change of altitude from the past. They support mircoformat and RDFa in documents and use metadata to make search results more relevant.  Great turning point.

Several companies have opened their NLP/semantic analysis core through api service, and they are free at least for developers. This is great news for the developer community because more tools are avaliable for creating new semantic web applications. In addition to OpenCalais and Zemanta, which I have tried before, the new semantic apis come from Ontos, AdaptiveBlue, Dapper, and Expert Systems.

Tom Gruber announced virtual personal assitant from his new company Siri. It works on mobil phone and provides a speech input interace for intelligent search. Cool product!

I moderated a panel discussion on web intelligence. Four companies on the panel (Siri, Zemanta, Expert Systems, Overtone) have different tricks to bring some levels of intelligence to the web or customers.  There were lots of questions from the audience. My opening remarks tried to convey the following messages:

1. Web intelligence is becoming the hallmark of the web as social networking connects people closer. The impact is showing up in many application ares:

  • Smarter Social Networking
    • connect to people you like and sources you trust
    • assisted by consumer intelligent agents
    • Twine, Siri
  • Semantic Search answering your questions
    • Google, Microsoft Bing, Wolfram Alpha
  • Semantic Publishing
    • Reuters, Freebase, Zemanta
  • Web-scale Market Intelligence
    • understand customer needs in real time
    • having intelligent dialog with customers  (branding)
    • Overtone, Attensity, ScoutLabs
  • Semantic Online Advertising
    • targeted by content semantics
    • Google, Peer39, TextWise

2. Plenty of open/free tools and data are available for developers to start creating web apps with intelligence.

Lots of free/open tools available to play with:

  • semantic analysis API: Open Calais, Zemanta
  • NLP: OpenNLP, Stanford NLP tools
  • knowledge management: Protege, Jena
  • search: Lucene, Solr
  • social networking: Orkut,


Lots of data available:

  • Twitter stream
  • News feeds, blog feeds, …
  • Wikipedia, Freebase, DBPedia, …

3. Latest trend:  real time data + intelligence. Semantic Web SIG will focus on twitter stream and applications on the next event on July 1.  TechCrunch will devote a full-day conference for this topic if real time data stream on July 10.

Overrall, the semantic technolgy is getting more steam now and we’ll see what killer apps will emerge in the next year or so.

aj chen

Powered by WordPress