Complex Queries in PATENTSCOPE Search System

 
C
o
m
p
l
e
x
 
q
u
e
r
i
e
s
 
i
n
 
t
h
e
 
P
A
T
E
N
T
S
C
O
P
E
s
e
a
r
c
h
 
s
y
s
t
e
m
Online
October  
2019
Sandrine Ammann
Marketing & Communications Officer
 
 
Click this button if you can hear my
voice and see my screen
 
 
 
 
Questions/concerns
p
a
t
e
n
t
s
c
o
p
e
@
w
i
p
o
.
i
n
t
 
Complexity
 “Complexity” of queries depends on:
 
Q: How many years of experience do you have in
patent searching?
A
B
Less than 5 years
More than 5 years
 
Simple Search
 
Agenda
Interfaces
Search fields
Syntax
Examples
Combination of interfaces
Q&A
 
2 Interfaces
 
 
 
 
Field Combination - pros
Predefined fields
Immediate results on the same page
 
Field combination - cons
 
Interface: Advanced search
 
 
EN_ALLTXT; PA; DP  = fields
AND = operator
 
Fields: where to search
    
Source: http://spicewallpaper.blogspot.ch/2012/08/green-fields-with-blue-sky.html
 
 
 
Examples
FP = front page
ALL = all fields
ALL_NAMES = all names
IC = IPC
DP = publication date
CTR = country either WO or country from nat collection
NPCC= national phase entry
AN = origin of PCT
 
Date search
Simple:
DP:01.02.2000
DP:20000201
DP:02.2000
DP:200002
DP:2000
 
Example: IPC
IC = International Classification
IC :A
IC :A47
IC :A47L
IC :A47L1
IC:A47L11
IC:A47L11/03
 
D
0
6
F
 
1
/
0
6
 
w
i
l
l
 
i
n
c
l
u
d
e
 
b
y
 
d
e
f
a
u
l
t
 
 
 
 
D
0
6
F
 
1
/
0
8
 
 
 
1
/
1
0
 
 
 
1
/
1
6
To exclude subgroup: IC_EX
ICI = International Classification Inventive
ICN = International Classification Non-inventive
ICI_EX    ICN_EX  = no subgroup
 
Example: grant
 
Fields rules
Basic fields: elements of a patent document
Derived fields
2 letter code = individual field
EN_TI  
 
FR_AB 
 
 ES_DE_S
Convention: language specified by 2 letters
  
          if not specified all languages
S = stemmed
:
 
t
o
 
s
e
p
a
r
a
t
e
 
t
e
r
m
 
w
i
t
h
o
u
t
 
a
n
y
 
s
p
a
c
e
 
Fields: golden rules
EN_ALL = default field       field indicator not required
Field name followed by : ":" or "/"
The field is only valid for the term that it directly
precedes, so the query:
    
 
 
EN_TI:("wind turbine" AND electric) solar
"
w
i
n
d
 
t
u
r
b
i
n
e
"
 
A
N
D
 
e
l
e
c
t
r
i
c
 
i
n
 
t
h
e
 
t
i
t
l
e
 
f
i
e
l
d
"
s
o
l
a
r
"
 
i
n
 
t
h
e
 
d
e
f
a
u
l
t
 
f
i
e
l
d
 
(
E
N
_
A
L
L
 
)
.
 
 
Grouping/nesting
Solar OR (wind AND turbine)
(solar OR wind) AND turbine
EN_TI: electric car
 electric will be searched in English title but car in all fields
EN_TI: (electric car)
Both electric and car will be searched in the English title
 
Range search
Range:
DP:[01.01.2000 TO 01.01.2001]
 
Can also be used to search non-date fields
IN: {Smith to Terence}
 
Boolean operators
AND
OR
NOT
ANDNOT
 
ANDNOT - NOT
Use ANDNOT when searching A excluding B
Ex: bicycle ANDNOT boat
Use NOT when searching all documents except A
 Ex:NOT(car AND bicycle AND boat)
 
Proximity operator NEAR
Finds words that are next to each other
NEAR3         3 = the max nb of word gaps between 2
search terms
 
Proximity search: BEFORE
the order of terms is significant.
t
r
u
n
k
 
B
E
F
O
R
E
 
c
u
t
t
i
n
g
 
An example
 
 
 
 
Keywords
Stemming
Wildcard
Truncation
Fuzzy
 
Stemming
 
Stemming
Stem = stemming
Process that removes common endings from words.
 
critical
 
critically
 
criticism
 
each word is reduced to ‘critic’
    
 
criticisms
     
 
critics
 
 
Stemming
no dictionary includes the necessary technical terms to
express patent concepts
Porter Stemming Algorithm finds words that contain
common roots
Save time and effort
 
 
 
Search without stemming
 
 
Same search with stemming
 
Wildcards/truncation : ?   *
* stands for 0 or more characters
? stands single character
  
te?t = test or text
  
electric* = electrical; electricity
  
behavi*r = behaviour or behavior
  
micro?p* = microspeaker, microsporidial
 
   
 
 
Use of wildcards
Spelling uncertainty (plural, tenses, foreign words):
tyre vs. tire       t*re
University vs Universität      Universit* Stuttgart
Multiple spelling variants are known:
color vs. colour      col*
Preferred option over stemming:
      electric vs. electricity 
 
   electri*
 
Wildcard vs stemming
Logic results:
n
avy
, 
navies
 or 
naval
 if 
nav* =
 
navigating
, 
navigation
,
electricity
 or 
electric
 if 
elect*
 = 
electoral
 
Fuzzy searches
     
Use of the tilde: ~
Examples:
 roam~      
  
foam / roams
Roam~
0.8
      Useful to find misstpyed, misspelt or mis-OCRed words
 
^ caret = weighting factor
Same result but ranking will be different
touch^3 AND polarize
 
 
 
 
Example: national phase entry
All applications that entered national phase in China in
2012
 
Instant help
Validates search query
Suggests terms
Provides list of:
IPC codes
countries
 
Help
 
 
 
 
 
Result combination
Combine search with chemical structure search
Combine search with CLIR
 
For queries with compounds
 
 
 
 
 
Combining with CLIR
 
 
 
 
 
Most common errors
(….)   "…"
" 
 not 
Field name
No space
Wildcard at the 
beginning
 of a word
 
Next webinar: November 19 or 21
Chemical structure search in
PATENTSCOPE
To register: 
http://www.wipo.int/patentscope/en/webinar/
 
wipo.int/patentscope/en/webinar 
 
Global Brand Database: webinar
Overvire of the Global Brand Database
October 23 at 5:30pm CET
https://attendee.gotowebinar.com/register/62095384934
36706818
 
Global Design Database: webinar
Overview of the Global Design Database
October 25 at 5:30pm CET
https://attendee.gotowebinar.com/register/79025898538
4599810
 
 
 
 
 
p
a
t
e
n
t
s
c
o
p
e
@
w
i
p
o
.
i
n
t
Slide Note
Embed
Share

Discover the intricacies of performing advanced searches in the PATENTSCOPE online system as explained in a presentation by Sandrine Ammann, Marketing & Communications Officer. The presentation delves into the complexity and nuances of conducting patent searches, offering insights on the factors affecting query complexity and showcasing tools available to enhance search capabilities.

  • PATENTSCOPE
  • Search System
  • Complex Queries
  • Patent Searching
  • Sandrine Ammann

Uploaded on Sep 12, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Complex queries in the PATENTSCOPE search system Online October 2019 Sandrine Ammann Marketing & Communications Officer

  2. Click this button if you can hear my voice and see my screen

  3. Questions/concerns patentscope@wipo.int

  4. Complexity Complexity of queries depends on:

  5. Q: How many years of experience do you have in patent searching? A Less than 5 years B More than 5 years B

  6. Simple Search

  7. Agenda Interfaces Search fields Syntax Examples Combination of interfaces Q&A

  8. 2 Interfaces

  9. Field Combination - pros Predefined fields Immediate results on the same page

  10. Field combination - cons

  11. Interface: Advanced search

  12. EN_ALLTXT; PA; DP = fields AND = operator

  13. Fields: where to search Source: http://spicewallpaper.blogspot.ch/2012/08/green-fields-with-blue-sky.html

  14. Examples FP = front page ALL = all fields ALL_NAMES = all names IC = IPC DP = publication date CTR = country either WO or country from nat collection NPCC= national phase entry AN = origin of PCT

  15. Date search Simple: DP:01.02.2000 DP:20000201 DP:02.2000 DP:200002 DP:2000

  16. Example: IPC IC = International Classification IC :A IC :A47 IC :A47L IC :A47L1 IC:A47L11 IC:A47L11/03

  17. D06F 1/06 will include by default D06F 1/08 1/10 1/16 To exclude subgroup: IC_EX ICI = International Classification Inventive ICN = International Classification Non-inventive ICI_EX ICN_EX = no subgroup

  18. Example: grant

  19. Fields rules Basic fields: elements of a patent document Derived fields 2 letter code = individual field EN_TI FR_AB Convention: language specified by 2 letters if not specified all languages S = stemmed ES_DE_S : to separate term without any space

  20. Fields: golden rules EN_ALL = default field field indicator not required Field name followed by : ":" or "/" The field is only valid for the term that it directly precedes, so the query: EN_TI:("wind turbine" AND electric) solar "wind turbine" AND electric in the title field "solar" in the default field (EN_ALL ).

  21. Grouping/nesting Solar OR (wind AND turbine) (solar OR wind) AND turbine EN_TI: electric car electric will be searched in English title but car in all fields EN_TI: (electric car) Both electric and car will be searched in the English title

  22. Range search Range: DP:[01.01.2000 TO 01.01.2001] Can also be used to search non-date fields IN: {Smith to Terence}

  23. Boolean operators AND OR NOT ANDNOT

  24. ANDNOT - NOT Use ANDNOT when searching A excluding B Ex: bicycle ANDNOT boat Use NOT when searching all documents except A Ex:NOT(car AND bicycle AND boat)

  25. Proximity operator NEAR Finds words that are next to each other NEAR3 3 = the max nb of word gaps between 2 search terms

  26. Proximity search: BEFORE the order of terms is significant. trunk BEFORE cutting

  27. An example

  28. Keywords Stemming Wildcard Truncation Fuzzy

  29. Stemming

  30. Stemming Stem = stemming Process that removes common endings from words. critical critically criticism criticisms critics each word is reduced to critic

  31. Stemming no dictionary includes the necessary technical terms to express patent concepts Porter Stemming Algorithm finds words that contain common roots Save time and effort

  32. Search without stemming

  33. Same search with stemming

  34. Wildcards/truncation : ? * * stands for 0 or more characters ? stands single character te?t = test or text electric* = electrical; electricity behavi*r = behaviour or behavior micro?p* = microspeaker, microsporidial

  35. Use of wildcards Spelling uncertainty (plural, tenses, foreign words): tyre vs. tire t*re University vs Universit t Universit* Stuttgart Multiple spelling variants are known: color vs. colour col* Preferred option over stemming: electric vs. electricity electri*

  36. Wildcard vs stemming Logic results: navy, navies or naval if nav* = navigating, navigation, electricity or electric if elect* = electoral

  37. Fuzzy searches Use of the tilde: ~ Examples: roam~ foam / roams Roam~0.8 Useful to find misstpyed, misspelt or mis-OCRed words

  38. ^ caret = weighting factor Same result but ranking will be different touch^3 AND polarize

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#