Slide 11 of 19
Notes:
Well, earlier I spoke about having a scalable methodology for preparing
large vocabularies. This is our methodology:
Each source is converted into what is a “normal” or canonical form. This inversion process requires careful consideration of how the source represents its meanings, and attempts to make them explicit.
Then, based on this representation, each source is added to the Metathesaurus. Terms from different sources which are lexically similar or appear fro other indications to be semantically identical are merged together .
After this merging, the results are reviewed by editors, who may add a modicum of additional information. This human review is highly leveraged by computational assistance, as is the quality assurance which takes place after the editing.