No changes were made to the release format of CHV or the processing for the 2012AA Metathesaurus. Updates to the CHV content primarily consisted of deletion and/or correction of misspelled terms.
In the previous version, CHV term values with potential spelling errors were identified by setting MRCONSO.SUPPRESS="E". There are 20 atoms remaining with SUPPRESS="E" however these will be reviewed in future versions to determine if they should be changed to "N".
|CHV_concepts_terms_flatfile_20110204.tsv||Tab-separated data file
TTY= PT is assigned where CHV_preferred_name="yes"
TTY=SY is assigned where CHV_preferred_name="no"
||Combination of frequency, context and CUI scores. Also uses whether or not the term is a top word. (real number)
||A slight modification to Combo_score that ignores top word criterion. The top word list is a list of easy words from the Dale-Chall list. (real number)
||Context based estimate of the difficulty of the term. (real number)
||Estimate of the difficulty of the concept (CUI) derived from determining how closely related the concept is to known examples of easy and difficult concepts. (real number)
|DISPARAGED||A value of "yes" in the CHV data indicates a misspelling or other abnormality. For this version, disparaged terms were not processed, so all cases of ATN="DISPARAGED" have ATV="no"||Disparaged field (yes/no flag)|
||Estimate of thedifficulty of a term, i.e. how likely it is that an average reader will be familiar with or understand a given term. Based on the frequency in several large text corpora. A higher score indicates that a term is more familiar (less difficult). (real number)