| Literature DB >> 35455212 |
Wenguang Lin1, Xiaodong Liu1, Renbin Xiao2.
Abstract
Patent data contain plenty of valuable information. Recently, the lack of innovative ideas has resulted in some enterprises encountering bottlenecks in product research and development (R&D). Some enterprises point out that they do not have enough comprehension of product components. To improve efficiency of product R&D, this paper introduces natural-language processing (NLP) technology, which includes part-of-speech (POS) tagging and subject-action-object (SAO) classification. Our strategy first extracts patent keywords from products, then applies a complex network to obtain core components based on structural holes and centrality of eigenvector algorism. Finally, we use the example of US shower patents to verify the effectiveness and feasibility of the methodology. As a result, this paper examines the acquisition of core components and how they can help enterprises and designers clarify their R&D ideas and design priorities.Entities:
Keywords: big data; core components; feature vectors; patent text; structural hole
Year: 2022 PMID: 35455212 PMCID: PMC9026476 DOI: 10.3390/e24040549
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Comparison of patent analysis methods.
| Method | Mode of Action | Advantage | Disadvantage | Literature |
|---|---|---|---|---|
| Complex network | The keywords in the patent text are regarded as nodes, the associations between keywords are regarded as edges, and the complex patent network is constructed for analysis | Strong visualization, which is conducive to clarifying the relationship between keywords and facilitating network analysis | Insufficient dynamic visualization | Iwan, Thorsten and Katja |
| Vector | Performs word vector training on the domain corpus to construct technical efficacy topics | Suitable for large databases, high degree of automation | Lack of judgment on the semantic connection of keywords | Park, Chun and Jeong |
| TRIZ | Through the analysis and extraction of patent knowledge, it is introduced into TRIZ tool to provide a large number of heuristic principles, effects, structures, etc. for solving product innovation problems in specific fields. | Not only makes up for the limitations of TRIZ and the ambiguity and broadness of the obtained solutions, but also makes up for the microscopic nature of knowledge acquired through patents. | Relies on the designer’s subjective experience and domain knowledge | Li et al. |
| keyword map | Transforms technical information in patents into a map of technology-directed functionality | The content is detailed and helpful for understanding technological trends | Difficult to find and organize information. | Kim M, Park Y and Yoon J |
Comparison of core component identification methods.
| Method | Mode of Action | Advantage | Disadvantage | Literature |
|---|---|---|---|---|
| Network algorithm | Map product components to networks, and identify core components through network measurement algorithms. | Easy to measure position and use of nodes in network, strong visualization. | Insufficient dynamic visualization. | Yin et al. |
| Machine learning | The model is trained through sample data, and the trained model is used to analyze and predict data. | High degree of automation, faster training speed. | Influenced by algorithm accuracy and quality. | Zheng et al. |
| QFD | Obtains the components that contribute most to requirements through requirement analysis. | Strong purpose, high excavation accuracy. | Influenced by subjective experience. | Klahn et al. |
| TF-IDF | Takes the frequency of keywords appearing in the text and the frequency of keywords appearing in all texts as the criteria for judging the importance of keywords. | The algorithm is simple and easy to implement. | Semantic order and context in the text are ignored. | Dorji et al. |
Figure 1Example of core component.
Figure 2Framework of core component acquisition.
Parts of speech.
| Tag | Explanation | Tag | Explanation |
|---|---|---|---|
| aux | auxiliary (be) | Root | root node |
| conj | conjunct | Subj | subject |
| nsubj | nominal subject | Obj | object |
| nsubjpass | passive nominal subject | Dobj | direct object |
| SYM | symbol | Amod | adjectival modifier |
| num | numeric modifier | Attr | attributive |
| comp | complement | punct | punctuation |
| … | … | … | … |
Types of SAO structure.
| No | SAO Structure | Example |
|---|---|---|
| 1 |
| The switching mechanism is disposed on fixing holder” (US10464077): |
| 2 |
| the forward position and the backward position are arranged at two sides of the initial position (US10449559): |
| 3 |
| Preferably, the rotation valve comprises a sleeve, a valve core unit, a rotation support and a switch knob (US8720799): |
| 4 |
| The present invention has arisen to mitigate and/or obviate the afore-described disadvantages (US20150354186A1): |
| 5 |
| The temperature display is disposed on the curved pendant and connected with the temperature sensor (US20180202135A1): |
Function diagram of series components.
| No | Component | Functionality | Component |
|---|---|---|---|
| 1 | motor | drive | liquid-ctrl box |
| 2 | liquid-ctrl box | hoist | pump |
| 3 | liquid-ctrl box | pass-through | pipeline |
| 4 | motor | drive | pump |
| 5 | pump | drive | hydraulic oil |
| 6 | filter | filtration | hydraulic oil |
| 7 | pipeline | conduction | hydraulic oil |
| 8 | hydraulic-oil | drive | piston |
| 9 | cylinder | storage | hydraulic oil |
| 10 | cylinder | guide | piston |
| … | … | … | … |
Figure 3Semaphore tree model.
Figure 4Semantic network model.
Figure 5Example of structural hole network.
Figure 6Example of eigenvector network.
US shower patent search results.
| Retrieval of Patents | Result |
|---|---|
| Title or Abstract:(showerhead * OR shower head * OR sprayer *) AND CPC:(B05B1/18) AND Time:(from 1 January 1914 to 1 December 2019) | 1733 |
Patent text SAO acquisition results.
| No | Patent Number | S | A | O |
|---|---|---|---|---|
| 1 | US20150273490A1 | |||
| 2 | US20150273490A1 | Embodiments | include | use |
| 3 | US20150273490A1 | Embodiments | include | showerhead |
| 4 | US20150273490A1 | Embodiments | include | invention |
| 5 | US20150273490A1 | Embodiments | include | chamber |
| 6 | US20150273490A1 | Embodiments | include | insulator |
| … | … | … | … | |
| 1048569 | US10207280 | |||
| 1048570 | US20150273490A1 | body | is | chamber |
| 1048571 | US20150273490A1 | body | is | outlets |
| 1048572 | US20150273490A1 | body | is | joint |
Figure 7Comparison of three algorithms.
Component metrics.
| Id | Weighted Degree |
|---|---|
| valve | 6881 |
| hole | 7950 |
| nozzle | 3615 |
| plate | 5036 |
| handle | 2513 |
| … | … |
| conduit | 1085 |
| cartridge | 1073 |
| controller | 1051 |
| sensor | 846 |
| shaft | 2205 |
Figure 8Complex network of SAO keyword combinations for shower patents.
Calculation results of CRO and CRI.
| Rank by CI | Component | CRO | CPI | CI |
|---|---|---|---|---|
| 1 | valve | 0.702236961 | 1 | 0.85111848 |
| 2 | hole | 0.587594404 | 0.967555 | 0.777574702 |
| 3 | outlet | 0.558008416 | 0.964982 | 0.761495208 |
| 4 | plate | 0.571470938 | 0.866782 | 0.719126469 |
| 5 | inlet | 0.527199305 | 0.855714 | 0.691456652 |
| 6 | pipe | 0.537497082 | 0.824818 | 0.681157541 |
| 7 | axis | 0.521848944 | 0.81001 | 0.665929472 |
| 8 | nozzle | 0.56987848 | 0.738293 | 0.65408574 |
| 9 | handle | 0.562902512 | 0.713606 | 0.638254256 |
| 10 | connector | 0.537076218 | 0.71216 | 0.624618109 |
| 11 | channel | 0.520952 | 0.71189 | 0.61642122 |
| 12 | showerhead | 0.558708 | 0.664609 | 0.611658319 |
| 13 | casing | 0.516216 | 0.664802 | 0.590509075 |
| 14 | cavity | 0.510695 | 0.656919 | 0.583807209 |
| 15 | tube | 0.508913 | 0.652626 | 0.58076929 |
| 16 | hose | 0.526853 | 0.625187 | 0.576020133 |
| 17 | groove | 0.502273 | 0.622823 | 0.562548247 |
| 18 | ring | 0.516556 | 0.600572 | 0.558564007 |
| 19 | cover | 0.525866 | 0.565686 | 0.545776232 |
| 20 | arm | 0.516927 | 0.555637 | 0.536281896 |
| … | … | … | … |
Figure 9Shower product structure.