Construct: ORF TRCN0000472957

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF017128.1_s317c1
Derived from:
ccsbBroadEn_10022
DNA Barcode:
AACTGGTAGATAGAATCCGCCCGC
Epitope Tag:
V5
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
ARSI (340075)

Vector Information:

Vector Backbone:
pLX_317
Pol II Cassette 1:
SV40-PuroR
Pol II Cassette 2:
EF1a-TRCN0000472957
Selection Marker:
PuroR
Visible Reporter:
n/a
Epitope Tag:
V5

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 340075 ARSI arylsulfatase family member I NM_001012301.4 99.9% 100% 636T>C
2 mouse 545260 Arsi arylsulfatase i NM_001038499.1 87.5% 95.2% (many diffs)
3 mouse 545260 Arsi arylsulfatase i XM_011246974.2 73.3% 78.8% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
69
ORF end:
1776
ORF length:
1707
Sequence:
1tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag
61ttggcaccat gcacaccctc actggcttct ccctggtcag cctgctcagc ttcggctacc
121tgtcctggga ctgggccaag ccgagcttcg tggccgacgg gcccggggag gctggcgagc
181agccctcggc cgctccgccc cagcctcccc acatcatctt catcctcacg gacgaccaag
241gctaccacga cgtgggctac catggttcag atatcgagac ccctacgctg gacaggctgg
301cggccaaggg ggtcaagttg gagaattatt acatccagcc catctgcacg ccttcgcgga
361gccagctcct cactggcagg taccagatcc acacaggact ccagcattcc atcatccgcc
421cacagcagcc caactgcctg cccctggacc aggtgacact gccacagaag ctgcaggagg
481caggttattc cacccatatg gtgggcaagt ggcacctggg cttctaccgg aaggagtgtc
541tgcccacccg tcggggcttc gacaccttcc tgggctcgct cacgggcaat gtggactatt
601acacctatga caactgtgat ggcccaggcg tgtgcggctt cgacctgcac gagggtgaga
661atgtggcctg ggggctcagc ggccagtact ccactatgct ttacgcccag cgcgccagcc
721atatcctggc cagccacagc cctcagcgtc ccctcttcct ctatgtggcc ttccaggcag
781tacacacacc cctgcagtcc cctcgtgagt acctgtaccg ctaccgcacc atgggcaatg
841tggcccggcg gaagtacgcg gccatggtga cctgcatgga tgaggctgtg cgcaacatca
901cctgggccct caagcgctac ggtttctaca acaacagtgt catcatcttc tccagtgaca
961atggtggcca gactttctcg gggggcagca actggccgct ccgaggacgc aagggcactt
1021attgggaagg tggcgtgcgg ggcctaggct ttgtccacag tcccctgctc aagcgaaagc
1081aacggacaag ccgggcactg atgcacatca ctgactggta cccgaccctg gtgggtctgg
1141caggtggtac cacctcagca gccgatgggc tagatggcta cgacgtgtgg ccggccatca
1201gcgagggccg ggcctcacca cgcacggaga tcctgcacaa cattgaccca ctctacaacc
1261atgcccagca tggctccctg gagggcggct ttggcatctg gaacaccgcc gtgcaggctg
1321ccaTCCGCGT GGGTGAGTGG AAGCTGCTGA CAGGAGACCC CGGCTATGGC GATTGGATCC
1381CACCGCAGAC ACTGGCCACC TTCCCGGGTA GCTGGTGGAA CCTGGAACGA ATGGCCAGTG
1441TCCGCCAGGC CGTGTGGCTC TTCAACATCA GTGCTGACCC TTATGAACGG GAGGACCTGG
1501CTGGCCAGCG GCCTGATGTG GTCCGCACCC TGCTGGCTCG CCTGGCCGAA TATAACCGCA
1561CAGCCATCCC GGTACGCTAC CCAGCTGAGA ACCCCCGGGC TCATCCTGAC TTTAATGGGG
1621GTGCTTGGGG GCCCTGGGCC AGTGATGAGG AAGAGGAGGA AGAGGAAGGG AGGGCTCGAA
1681GCTTCTCCCG GGGTCGTCGC AAGAAAAAAT GCAAGATTTG CAAGCTTCGA TCCTTTTTCC
1741GTAAACTCAA CACCAGGCTA ATGTCCCAAC GGATCTTGCC AACTTTCTTG TACAAAGTGG
1801TTGATATCGG TAAGCCTATC CCTAACCCTC TCCTCGGTCT CGATTCTACG TAGTAATGAA
1861CTAGTCCGTA ACTTGAAAGT ATTTCGATTT CTTGGCTTTA TATATCTTGT GGAAAGGACG
1921AAACTGGTAG ATAGAATCCG CCCGCACGCG TTAAGTCgac aatcaacctc tggattacaa
1981aatttgtgaa agatt

Download FASTA (ORF) (Full)