From the previous week I had been working on implementation of query expansion by limiting the expansion terms to the most similar segments of the top documents retrieved during the baseline run. I completed the Java implementation which reads in the baseline retrieval information and the original TREC formatted query and outputs an expanded TREC formatted query chosen from the sections of documents which are most similar (with respect to term overlap) with the query sentences.
The expanded query increased the MAP from 0.56 to 0.59 on the FIRE test collection.
Monday, May 24, 2010
My Research activities
I joined the CNGL (Center for Next Generation in Localization) research group in Dublin City University on Jan 2010 as a PhD. student. I work in the Digital Content Management track which deals which focuses on Information Retrieval and Adaptive Hypermedia systems.
I decided to start this blog to be a bit more methodical in my research activities. In this blog you would mainly find the tasks I have been doing lately and I would try to provide the links to any useful stuff that I came up with during my explorations.
I decided to start this blog to be a bit more methodical in my research activities. In this blog you would mainly find the tasks I have been doing lately and I would try to provide the links to any useful stuff that I came up with during my explorations.
Subscribe to:
Posts (Atom)