Splicing#
Splicing: a sequence change where, compared to a reference sequence, the normal RNA splicing pattern is altered.
Syntax#
Variants affecting RNA splicing result in either a deletion or insertion on the RNA level and should be described as such. Recommendations on representing adjoined transcripts formed by gene fusions are discussed in the Notes and Examples below.
Notes#
- all variants should be described on the DNA level; descriptions on the RNA and/or protein level may be given in addition.
- a
,(comma) is used to separate different transcripts/proteins derived from one allele;r.[123a>u,122_154del]. - HGVS recommends following the HGNC guidelines and the VICC Gene Fusion Specification nomenclature to describe products of gene fusions.
- The HGNC recommendations include using a
GENESYMBOL1::GENESYMBOL2syntax for gene-level fusion descriptions, and aGENESYMBOL1-GENESYMBOL2syntax for read-through transcripts. - The VICC nomenclature extends the HGNC recommendations to include a terminology, information model, and nomenclature for gene-level and exon-level representation, with components for disambiguating regulatory fusions from chimeric transcript fusions.
- HGVS also recommends the use of adjoined transcripts (see examples) for precise and unambiguous characterization of chimeric transcripts at the sequence level.
- The HGNC recommendations include using a
Examples#
-
one variant, several transcripts
NC_000023.11(NM_004006.2):r.[897u>g,832_960del]
two different transcripts,r.897u>gandr.832_960del, derive from one variant (c.897T>Gon the DNA level).
-
splice acceptor site
-
NC_000023.11(NM_004006.2):r.650_831del
as a consequence of a variant destroying a splice acceptor site, the sequence from nucleotider.650tor.831(exon 8) is deleted from the transcript. -
NC_000023.11(NM_004006.2):r.650_712del
as a consequence of a variant destroying a splice acceptor site, a new acceptor site in exon 8 (positions712/713) is activated and the sequence from nucleotider.650tor.712is deleted from the transcript. -
NC_000023.11(NM_004006.2):r.649_650ins[650-52_650-2;c]
as a consequence of a variant destroying a splice acceptor site (c.650-1G>C), a new acceptor site in intron 7 is activated and the intron 7 sequence from positions650-52to650-1is inserted in the transcript.
NOTE: nucleotide650-1changed fromgtoc.
-
-
splice donor site (
c.831+2T>A)-
NC_000023.11(NM_004006.2):r.650_831del
as a consequence of a variant destroying the exon 8 splice donor site, the sequence from nucleotider.650tor.831(exon 8) is deleted from the transcript. -
NC_000023.11(NM_004006.2):r.778_831del
as a consequence of a variant destroying the exon 8 splice donor site, a new donor site in exon 8 (positions777/778) is activated and the sequence from nucleotider.778tor.831is deleted from the transcript. -
NC_000023.11(NM_004006.2):r.831_832ins[ga;831+3_831+60]
as a consequence of a variant destroying the exon 8 splice donor site, a new donor site in intron 8 (positions831+60/831+61) is activated and the intron 8 sequence from positions831+1to831+60is inserted in the transcript.
NOTE: nucleotide831+2changed fromutoa.
-
-
intron variant
-
NC_000023.11(NM_004006.2):r.649_650ins650-50_650-1
as a consequence of an intron 7 variant (c.650-52_650-51del), a new stronger exon 8 splice acceptor site is created (positions650-51/650-50) and the intron 7 sequence from positions650-50to650-1is inserted in the transcript. -
NC_000023.11(NM_004006.2):r.831_832ins831+1_831+67
as a consequence of an intron 8 variant (c.831+71C>A), a new stronger exon 8 splice donor site is created (positions831+67/831+68) and the intron 8 sequence from positions831+1to831+67is inserted in the transcript. -
NC_000023.11(NM_004006.2):r.649_650ins650-1400_650-1268
as a consequence of an intron 7 variant (c.650-1401T>G), a new exon is created and its sequence (positions650-1400to650-1268) is inserted in the transcript.
-
-
adjoined transcript (based on SVD-WG007)
NM_002354.2:r.-358_555::NM_000251.2:r.212_*279
describes an adjoined transcript from anEPCAM::MSH2gene fusion, where nucleotidesr.-358tor.555(EPCAM gene, reference transcriptNM_002354.2) are spliced to nucleotidesr.212tor.*279(MSH2 gene, reference transcriptNM_000251.2).
-
uncertain (RNA not analysed)
-
NC_000023.11(NM_004006.2):r.(76a>c)
RNA was not analysed, but a substitution of theanucleotide atr.76by acis predicted. -
NC_000023.11(NM_004006.2):r.?
an effect on the RNA level is expected, but it is not possible to give a reliable prediction of the consequences (RNA not analysed). -
NC_000023.11(NM_004006.2):r.spl
RNA has not been analysed, but it is very likely that splicing is affected.
-
Discussion#
A variant changes the +1 intron sequence (GT to AT). Although I did not analyse RNA, I am quite sure that normal splicing is affected. How can I best indicate this?
HGVS recommends to use the format r.spl to indicate that RNA was not analysed, but splicing is most probably affected.
In general, the format is used for variants changing the +1, +2, -2 and -1 position of an intron, i.e. affecting the GT splice donor and AG splice acceptor site (excluding GT to GC and GC to GT variants).
r.(spl?) is frequently used to indicate that normal splicing might be affected as a consequence of variants in the first or last nucleotide of an exon, the +3 to +5 intron position (splice donor site), and variants generating a new AG di-nucleotide close to the normal splice acceptor site (AG).
See Uncertain.
How can I best describe the predicted consequences on the protein level of a variant that most probably affects splicing?
The best format seems to use p.?, meaning "I do not know what to expect on the protein level".