Inversion#
Inversion: a sequence change where, compared to a reference sequence, more than one nucleotide replacing the original sequence is the reverse complement of the original sequence.
Syntax#
| Syntax | sequence_identifier ":" coordinate_type "." range "inv" |
|---|---|
| Examples |
|
| Explanation of Symbols | |
| |
Notes#
- by definition, the region inverted (
positions_inverted) contains more than one nucleotide. The descriptiong.234invis therefore not allowed; a one-nucleotide inversion should be described as a substitution. - for all descriptions, the most 3' position possible of the reference sequence is arbitrarily assigned to have been changed (3'rule).
- the 3'rule applies to ALL descriptions (genome, gene, transcript, and protein) of a given variant.
- inverted duplications are described as an insertion (
g.234_235ins123_234inv), not asg.123_456dupinv(see Q&A). - two variants separated by one or more nucleotides should be described individually and not as a "delins".
exception: two variants separated by one nucleotide, together affecting one amino acid, should be described as a "delins". - inversions are not used on protein level. Depending on the (predicted) consequences of an inversion on protein level, changes are usually described as either a delins or a frameshift.
Examples#
-
NC_000023.10:g.32361330_32361333inv
inversion of nucleotidesg.32361330tog.32361333, changing..CATCAGCCT..to..CACTGACCT... -
NM_004006.2:c.5657_5660inv
inversion of nucleotidesc.5657toc.5660(coding DNA reference sequence), changing..AGGCTGATG..to..AGGTCAGTG... -
NM_004006.2:c.4145_4160inv
inversion of the 16 nucleotides from positionc.4145toc.4160. -
NC_000023.10:g.111754331_111966764inv
a large inversion (212,434 nucleotides) from positiong.111754331tog.111966764. -
NM_004006.2:c.849_850ins850_900inv
a copy of nucleotidesc.850toc.900is inserted, in inverted orientation, 5' of the original sequence, between nucleotidesc.849andc.850. -
NM_004006.2:c.900_901ins850_900inv
a copy of nucleotidesc.850toc.900is inserted, in inverted orientation, 3' of the original sequence, between nucleotidesc.900andc.901. -
LRG_199t1:c.940_941ins[885_940inv;A;851_883inv]
an inverted copy of nucleotidesc.851toc.940, with aG>Asubstitution of nucleotidec.884, is inserted directly 3' of the original sequence. -
NM_004006.2:c.940_941ins[903_940inv;851_885inv]
an inverted copy of nucleotidesc.851toc.940, with a deletion from nucleotidesc.886toc.902, is inserted directly 3' of the original sequence.
Discussion#
Is the change AAGC to TTCG an inversion?
No, an inversion would change AAGC to GCTT, its reverse-complement.
TTCG is only the complement of AAGC.
Is the change AAGC to CGAA an inversion?
No, an inversion would change AAGC to GCTT, its reverse-complement.
CGAA is only the reverse of AAGC.
Is it not better to describe the variant g.234_235ins123_234inv as g.123_234dupinv?
The descriptions of duplications is regularly debated; "Why not remove the variant type "duplication" and describe all dups as "insertion", it will make the HGVS rules in total simpler".
While we can not do this for historic reasons (duplications are in use since the beginning), we will restrict the use of "dup" as much as possible.
Regarding a "dupinv", one could argue that an "inverted copy" is not "a copy inserted directly 3' of the original copy" and thus by definition this variant is not a duplication but an "insertion".
Therefore, the recommendation is to describe inverted duplication using the format g.122_123ins123_234inv or g.234_235ins123_234inv depending on whether the inverted copy is 5' or 3' of the original copy (reference sequence).