Skip to content

Search

Document

Each database in DBRepo has their own document where the document id (the identifier of a database in OpenSearch) is equivalent to the database id (the identifier of a database in the Metadata Database).

Since the repository automatically collects statistical properties (min, max, mean, median, std.dev) in both the Metadata Database and the Search Database, a special search can be performed when at least two columns have the same semantic concept (e.g. temperature) annotated and the units of measurements can be transformed.

Two tables with compatible semantic concepts (Temperature) and units of measurement (left is in degree Celsius, right is in degree Fahrenheit)

Figure 3: Two tables with compatible semantic concepts and units of measurement

In short, the search service transforms the statistical properties not in the target unit of measurements is transformed by using the omlib package.

For example: a user wants to find datasets that contain "temperature measurements between 0 - 10 °C". Then the search service transforms the query to the dataset on the right from °F to contain "temperature measurements between 32 - 50 °F" instead.

Unit independent search query transformation

Figure 4: Unit independent search query transformation