New Guidelines 2020

A closer look at our more recent HGNC guideline updates

The last blog post featured our Summer Newsletter, and we discussed some of the issues brought up in our Nature Genetics comment article on our guidelines including mention of our solution for the “Excel auto-changing date symbols” problem. A summary of our current guidelines can be found on our website.

We hope you have had the time to read our paper and thought you might find it interesting to read a little more about some of the updates we have made to our guidelines, with some extra examples that we didn’t have room to mention!

Naming may be based on recognized structural domains and motifs encoded by the gene

If the function of a gene is unknown, but its encoded protein contains a particular domain and/or motif, this can give an insight into its likely function. Gene symbols assigned based on domains are often retained with extra information added to the gene name once the function of its protein product has been determined.

For example:

Naming may be based on homologous genes within the human genome

A shared root symbol can be used to reflect paralogy, with each member receiving the next number or letter in the series. Genes of unknown function in an established family are given the next symbol in the series but with a different gene-name format. If there is no functional information known about any family members then all genes in that family may be given a placeholder FAM# root. This may be updated at a later date, or retained if it has become established in the literature and researchers are keen to do this.

Naming may be based on homologous genes from another species

Some human genes are named based on homologous characterized genes from another species. In the past, the name of the species that we based this on would have also been included in the human gene name in brackets, but many users told us this caused confusion. Furthermore, now that we name genes in other vertebrate species, we can appreciate that sometimes including reference to a specific species in a gene name can be confusing. We have now removed these references from our gene names.

When naming is based on a 1:1 orthology relationship between human and another species we use the same, or at least an equivalent, symbol as used in the species in question.

For example:

     

A unique number or letter suffix is added if there is more than one human homolog:

The single yeast ortholog of these genes is called SEC22 in the Saccharomyces genome database (SGD).

Some gene nomenclature needed to be updated to make it more appropriate and/or informative for use in human. For example:

Finally, some gene names ported over from homologs in other species needed to be changed because as well as being inappropriate in human they could also be perceived as offensive or pejorative.

For example:

Naming may be based only on the presence of an open reading frame

If there is an open reading frame but nothing is known about the gene product, it can be assigned a C#orf# placeholder symbol. The numbering is designated by chromosome, the letters “orf” standing for “open reading frame” and an iterative number. KIAA# is another placeholder symbol that has been approved for genes identified by the Kazusa cDNA sequencing project.

You can read more about placeholder symols in this previous blog post.

We no longer name…

HGNC historically named human loci associated with clinical phenotypes. This role has been transferred to OMIM, and as our existing phenotype records were not complete or being regularly updated we decided to withdraw them. Hence all previous “phenotype only” HGNC records now have the status of ‘entry withdrawn’. However, some phenotype based gene symbols have been highly published and so have been retained, with updates to their associated gene names to reflect the normal function of the proteins they encode.

You can read our previous blog posts to find out why we are now striving to stabilize gene symbols and about why nevertheless, we sometimes do make changes to gene symbols when there are compelling reasons to do so.

Comments or questions about our guidelines?

If you’ve enjoyed reading our comment article on our guidelines but have further specific questions about gene naming, you can contact us via email at hgnc@genenames.org.