You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have released the source code of the document retrieval component. As mentioned in [1], the retrieval model is an extension of the query likelihood (QL) model, where we give different weights to the original query and question and answer text.
278
+
279
+
The code is available under `./src/` directory. The main retrieval model can be found in `QL.py` file where it uses Cython function `ql_score` to calcuate the relevance score faster. Therefore, before running the code, you must compile and build the Cython package `ql_score` on your device.
280
+
281
+
**Build Cython package:** To build the Cython package on your device, please go to `./src/ql_score/` and run the following command: `python setup.py build_ext --inplace`
282
+
283
+
Furthermore, our code uses the custom index that we have built (and is available [here](http://ciir.cs.umass.edu/downloads/qulac/)), as well as ClueWeb term statistics file that is also available [here](http://ciir.cs.umass.edu/downloads/qulac/). The file `run_QL.ipynb` provides a Jupyter Notebook example of how to execute a query.
284
+
285
+
**NOTE:** The index is build using Krovetz stemmer. Therefore, in order to replicate the results of [1], you need to install the Krovetz Stemmer on your machine and use it in your own code. Once you install the stemmer, please modify `./src/utils/utils.py` file to use the stemmer as its default stemmer. To do so, follow the `TODO` comments in the file.
0 commit comments