Unified Medical Language System® (UMLS®)
2011AB HUGO Source Information
General Notes/Comments:
The Human Genome Organisation (HUGO), established in 1989, is an international organization whose primary ethos is to promote and sustain collaboration in the field of human genetics.
Data is supplied in a single, tab-delimited, text file.
Summary of Changes
)(1) The 'Aliases' and 'Name Aliases' fields were renamed 'Synonyms' and 'Name Synonyms', respectively.
(2) Values in the 'Enzyme IDs' field no longer have the 'EC' prefix.
Summary of Source-Provided Files
)Documentation and Reference
- www.genenames.org
- www.hugo-international.org
| File Name | Description |
|---|---|
| HUGO_2011_AllData_pl.txt | Complete source data in a text file with tab delimited fields. |
Source-Provided File Details
)| Field Number | Field Name | Description | Representation |
|---|---|---|---|
| 1 | HGNC ID | A unique ID provided by the HGNC | MRCONSO.CODE; MRCONSO.SCUI |
| 2 | Approved Symbol | Official gene symbol | MRCONSO.STR; MRSAT.ATN=GENESYMBOL |
| 3 | Approved Name | Official gene name | MRCONSO.STR |
| 4 | Status | Indicates gene classification (e.g., withdrawn, approved) | Not processed |
| 5 | Locus Type | Specifies type of locus (e.g., pseudogene, protocadherin) | MRSAT.ATN=LOCUS_TYPE |
| 6 | Locus Group | Locus Group | MRSAT.ATN=LOCUS_GROUP |
| 7 | Previous Symbols | Symbols previously approved | MRSAT.ATN=PREV_SYMBOL |
| 8 | Previous Names | Gene names previously approved | MRSAT.ATN=PREV_NAME |
| 9 | Synonyms | Other symbols used to refer to the gene | MRCONSO.STR |
| 10 | Name Synonyms | Other names used to refer to the gene | MRCONSO.STR |
| 11 | Chromosome | Indicates the location of the gene on the chromosome | MRSAT.ATN=CHROMOSOME |
| 12 | Date Approved | Date gene symbol and name were approved | MRSAT.ATN=DATE_CREATED |
| 13 | Date Modified | Date entry was modified | MRSAT.ATN=DATE_LAST_MODIFIED |
| 14 | Date Symbol Changed | Date gene symbol was last changed | MRSAT.ATN=DATE_SYMBOL_CHANGED |
| 15 | Date Name Changed | Date the gene name was last changed | MRSAT.ATN=DATE_NAME_CHANGED |
| 16 | Accession Numbers | Accession numbers | MRSAT.ATN=ACCESSION_NO |
| 17 | Enzyme IDs | Enzyme Commission (EC) number | MRSAT.ATN=EZ |
| 18 | Entrez Gene ID | Entrez Gene ID | MRSAT.ATN=ENTREZGENE_ID |
| 19 | Ensembl Gene ID | Ensembl Gene ID | MRSAT.ATN=ENSEMBLGENE_ID |
| 20 | Mouse Genome Database ID | Mouse Genome Database ID | MRSAT.ATN=MGD_ID |
| 21 | Specialist Database Links | Contains HTML links to specialist databases | MRSAT.ATN=DB_XR |
| 22 | Specialist Database IDs | Contains the ID within each specialist database | MRSAT.ATN=DB_XR_ID |
| 23 | Pubmed IDs | Identifier that links to published articles | MRSAT.ATN=PMID |
| 24 | RefSeqIDs | RefSeqIDs | MRSAT.ATN=REFSEQ_ID |
| 25 | Gene Family Name | Name of the family the gene has been assigned to | MRSAT.ATN=GENE_FAM |
| 26 | Record Type | valid values: Parent, Standard | MRSAT.ATN=RECORD_TYPE |
| 27 | Primary IDs | valid values: blank, hypen | MRSAT.ATN=PRIMARY_ID |
| 28 | Secondary IDs | numeric value | MRSAT.ATN=SECONDARY_ID |
| 29 | CCDS IDs | Consensus CDS (CCDS) project ID | MRSAT.ATN=CCDS_ID |
| 30 | VEGA IDs | VEGA ID | MRSAT.ATN=VEGA_ID |
| 31 | Locus Specific Databases | Contains links to databases pertinent to the gene | MRSAT.ATN=LOCUS_SPECIFIC_DB_XR |
| 32 | GDB ID (mapped data) | GDB ID (mapped data) | MRSAT.ATN=MAPPED_GDB_ID |
| 33 | Entrez Gene ID (mapped data) | Entrez Gene ID (mapped data supplied by NCBI) | MRSAT.ATN=MAPPED_ENTREZGENE_ID |
| 34 | OMIM ID (mapped data) | OMIM ID (mapped data supplied by NCBI) | MRSAT.ATN=OMIM_NUMBER |
| 35 | RefSeq (mapped data) | RefSeq (mapped data supplied by NCBI) | MRSAT.ATN=MAPPED_REFSEQ_ID |
| 36 | UniProt ID (mapped data) | UniProt ID (mapped data supplied by UniProt) | MRSAT.ATN=SWP |
| 37 | Ensembl Gene ID (mapped data) | Ensembl Gene ID (mapped data supplied by Ensembl) | MRSAT.ATN=MAPPED_ENSEMBLGENE_ID |
| 38 | UCSC (mapped data) | UCSC (mapped data supplied by UCSC) | MRSAT.ATN=UCSC_ID |
| 39 | Mouse Genome Database ID (mapped data) | Mouse Genome Database ID (mapped data supplied by MGI) | MRSAT.ATN=MAPPED_MGD_ID |
| 40 | Rat Genome Database ID (mapped data) | Rat Genome Database ID (mapped data supplied by RGD) | MRSAT.ATN=MAPPED_RGD_ID |
(1) The following fields can contain multiple values in a comma delimited list:
Accession NumbersEnzyme ID
Gene Family Name
Name Synonyms
Previous Names
Previous Symbols
Pubmed ID
RefSeq IDs
Specialist Database ID
Specialist Database Links
Synonyms
(2) HUGO's mapped data is derived from external sources and is not subject to their checking and curation procedures.
