Based on the positive feedback we have received at NeuCLIR Report Generation Pilot last year, we provide a search endpoint using PLAID-X again for participants if you do not want to stand up a search engine by yourself. PLAID-X is a very effective MLIR model, which is developed along with teh NeuCLIR track.
Please register at TREC to get the URL to the endpoint.
You can use the following HTTP POST request to retrieve documents based on your query.
curl -X POST https://get.url.from.trec/query -H "Content-Type: application/json" -d '{
"service": "plaidx-ragtime",
"query": "your query",
"limit": 10
}'
If you are working with the NeuCLIR data for development, you can also search among the NeuCLIR collection by specifying plaidx-neuclir
in the service
field.
You can also specify the language of the documents (only for NeuCLIR) by passing the language code to the subset
field.
For example,
curl -X POST https://get.url.from.trec/query -H "Content-Type: application/json" -d '{
"service": "plaidx-neuclir",
"query": "your query",
"limit": 100,
"subset": "fas"
}'
You can retrieve the document content also from the search endpoint by using the API /content
.
For example,
curl -X POST https://get.url.from.trec/content -H "Content-Type: application/json" -d '{
"collection": "neuclir",
"id": "01f01b3d-89c7-4f97-9d14-bd2dcdff190b"
}'
You can put either neuclir
or ragtime
to the collection
field.
Please send an email to us if you have any question at ragtime-organizers@googlegroups.com or at the TREC Slack.