PatentSemTech aims to establish a long-term collaboration and a two-way communication channel between the IP industry and academia from relevant fields such as natural-language processing (NLP), text and data mining (TDM) and semantic technologies (ST) in order to explore and transfer new knowledge, methods and technologies for the benefit of industrial applications as well as support research in applied sciences for the IP and neighbouring domains.
PatentSemTech'23 workshop will be held as a full-day onsite event in conjunction with SIGIR 2023 .
Time zone: Anywhere on Earth (AoE)
|SIGIR PatentSemTech2023 workshop||July 27, 2023|
From the definition of a search task perspective, users of patent information systems are highly specialised information professionals, who cooperate with research and/or legal departments in their institutions / companies. The search in this area is generally business critical. There are high requirements on the correctness and completeness of the data to search through, on the efficiency of the search interface, and on the trustworthiness of the provider, on the quality of the search results. For general language documents (like news articles, or Wikipedia articles) there is a variety of tools and methods to process and prepare them for a specific task. It is a most challenging undertaking to adapt or re-design such tools to address the requirements of working with patent and legal documents.
Patent are a type of scientific text which is complex and difficult to analyse compared to the common language. Without being complete, some reasons are:
Working with patent data, besides its challenging aspects, does bring a richness of facets to be exploited with text-mining and semantic methods: