Skip Navigation

 
 

Unified Medical Language System® (UMLS®) Basics

Page 5 of 6
   
Overview  
Metathesaurus  
Semantic Network  
Lexical Tools  
UMLS Tools  
User Support  
     
Previous (active) Previous Next (active)
     
Glossary  
FAQ  
bullet11 Contact Us  
bullet12 Home  
bullet13 Feedback  
     
UMLS logo
 
 
 
 
arrow
 
 
 
 

The Norm Program

The lexical program, Norm, generates the normalized strings for terms included in the SPECIALIST Lexicon. The normalization process involves stripping possessives, replacing punctuation with spaces, removing stop words such as "No Other Specification" or NOS, lower-casing each word, breaking a string into its constituent words, and sorting the words in alphabetic order.

Below is an example of the normalization process for the term “Hodgkin's diseases, NOS.”

  

 

Hodgkin’s diseases, NOS

Remove genitive

Hodgkin diseases, NOS

Remove stop words

Hodgkin diseases,

Lowercase

hodgkin diseases,

Strip punctuation

hodgkin diseases

Uninflect

hodgkin disease

Sort words

disease hodgkin

The Norm program is used in systems to:

  • Find similar terms
  • Map terms to UMLS concepts
  • Find lexical variants for a term

National Library of Medicine | National Institutes of Health | Department of Health and Human Services
Freedom of Information Act | Copyright | Privacy Policy

Last Updated: October 20, 2008
First published: October 20, 2008
    Click Next to continue.