Construct: ORF ccsbBroadEn_13239
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF014440.1_s300c1, BRDN0000398429
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- ERVV-1 (147664)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 147664 | ERVV-1 | endogenous retrovirus group... | NM_152473.2 | 68.5% | 68.5% | 1_450del |
2 | human | 100271846 | ERVV-2 | endogenous retrovirus group... | NM_001191055.2 | 59.6% | 57.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1050
- ORF length:
- 981
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GAGCTTCTCC CCAGCAGGCT GCCACCCTAA CTTGACTCAC TGGTGTCCAG 121 CTAAACAAAT GAACGATTAT CGAGACAAGT CACCCCAAAA CCGCTGTGCA GCTTGGGAAG 181 GAAAAGAGCT AATCACATGG AGGGTTCTAT ATTTGCTTCC CAAGGCACAC ACTGTCCCCA 241 CATGGCCAAA ATCTACTGTT CCCCTGGGAG GGCCTCTATC CCCTGCATGC AATCAAACTA 301 TTCCAGCAGG GTGGAAATCG CAGTTACACA AGTGGTTCGA CAGCCACATC CCCCGGTGGG 361 CCTGTACCCC TCCTGGCTAT GTATTTTTAT GTGGGCCACA AAAAAATAAA CTGCCCTTTG 421 ATGGAAGTCC TAAGATAACT TATTCAACCC CCCCTGTGGC AAACCTCTAC ACTTGCATTA 481 ATAACATCCA ACATACGGGA GAATGTGCTG TGGGACTTTT GGGACCACGG GGGATAGGTG 541 TGACCATTTA TAACACCACC CAACCCAGAC AGAAAAGAGC TCTGGGTCTA ATACTGGCAG 601 GGATGGGTGC GGCCATAGGA ATGATCGCCC CATGGGGAGG GTTCACTTAT CATGATGTCA 661 CCCTCAGAAA TCTCTCCAGA CAAATAGACA ACATAGCTAA GAGTACCAGA GATAGCATCT 721 CTAAACTCAA GGCCTCCATA GATTCTCTAG CAAATGTAGT CATGAACAAC AGATTGGCCT 781 TAGATTACCT CTTAGCAGAG CAGGGTGGAG TCTGTGCAGT GATCAGTAAA TCCTGTTGCA 841 TTTATGTCAA TAACAGTGGG GCGATAGAGG AGGATATAAA AAAGATCTAT GATGAGGTTA 901 CGTGGCTCCA TAACTTTGGA AAAGGTGATT CAGCAGGGTC CATTTGGGAG GCTGTGAAGT 961 CTGCCCTCCC CTCCCTCACA TGGTTTGTCC CTTTACTGGG ACCAGCTGCA CTTAATAGCC 1021 TGCTTTCTCC TCTTTGGCCC TTGTCTCTAT TGCCAACTTT CTTGTACAAA GTtggcatta 1081 taagaaagca ttgcttatca atttgttgca acgaac