wiki:FCS-Agenda

Version 11 (modified by vronk, 13 years ago) (diff)

added search-result and search context

FCS Agenda

This is to collect ToDos? and Tasks regarding Federated Content Search.

1. Pin down the basic search-result format

The most urgent questions are:

  1. Do we agree on <ccs:Metadata>-element?
  2. What sensible Resource(Fragment)-reference can we provide in Resource@pid ResourceFragment@pid
  3. Do we agree on DataView[@type='kwic'] format (<kw>-element)?
  4. Can we have some kind of default string representation for every record, using title, name or anything else that helps to identify the resource for the user? (I.e. PID is not very useful for this)

2. Specification of DataViews

In the next step we need a definition of the actual format of the individual DataView Types. This has to be harmonized with the work on Viewers. A separate page Viewable elaborates on this relationship.

Interested teams should take over individual types and lead working out a proposal. Tentative assignements:

text, annotated text, written corpus, book
Thomas@UniTueb; Matej@ICLTT
list, matrix
Thomas@UniTueb
geolocation
Marc@Meertens

Implement Search on Indexes

After the basic connectivity is established and simple queries can be submitted, the work shall be started on allowing to query specific indices. This involves solving the announcement of indices via explain/scan and correctly parsing the non-trivial query:

  index relation searchTerm

Search context

Elaborate on ways of selecting a search context (Repositories/Collections/Resources? to content-search in) either extensionally by providing a list of identifiers or intensionally by MD-query. See more on SearchContext and CDMDC.

Use cases

We need concrete scenarios to be able to check if what we do is any useful.

Further test Pazpar2

Assigned: Matej, ICLTT

There is a test-instance of PazPar2 running, that now can already connect to FCS-services. Currently main issue is Pazpar2 not being able to pass through XML-data inside the customizable fields.