Developing a methodology of structuring and layering technological information in patent documents through natural language processing

Taeyeoun Roh, Yujin Jeong, Byungun Yoon

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Since patents contain various types of objective technological information, they are used to identify the characteristics of technology fields. Text mining in patent analysis is employed in various fields such as trend analysis and technology classification, and knowledge flow among technologies. However, since keyword-based text mining has the limitation whereby, when screening useful keywords, it frequently omits meaningful keywords, analyzers therefore need to repeat the careful scrutiny of the derived keywords to clarify the meaning of keywords. In this research, we structure meaningful keyword sets related to technological information from patent documents; then we layer the keywords, depending on the level of information. This research involves two steps. First, the characteristics of technological information are analyzed by reviewing the patent law and investigating the description of patent documents. Second, the technological information is structured by considering the information types, and the keywords in each type are layered through natural language processing. Consequently, the structured and layered keyword set does not omit useful keywords and the analyzer can easily understand the meaning of each keyword.

Original languageEnglish
Article number2117
JournalSustainability (Switzerland)
Volume9
Issue number11
DOIs
StatePublished - 17 Nov 2017

Keywords

  • NLP
  • Patent analysis
  • Technological information
  • Text mining
  • Text structure

Fingerprint

Dive into the research topics of 'Developing a methodology of structuring and layering technological information in patent documents through natural language processing'. Together they form a unique fingerprint.

Cite this