Construct: ORF TRCN0000472957
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF017128.1_s317c1
- Derived from:
- ccsbBroadEn_10022
- DNA Barcode:
- AACTGGTAGATAGAATCCGCCCGC
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- ARSI (340075)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000472957
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 340075 | ARSI | arylsulfatase family member I | NM_001012301.4 | 99.9% | 100% | 636T>C |
2 | mouse | 545260 | Arsi | arylsulfatase i | NM_001038499.1 | 87.5% | 95.2% | (many diffs) |
3 | mouse | 545260 | Arsi | arylsulfatase i | XM_011246974.2 | 73.3% | 78.8% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1776
- ORF length:
- 1707
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcaccat gcacaccctc actggcttct ccctggtcag cctgctcagc ttcggctacc 121 tgtcctggga ctgggccaag ccgagcttcg tggccgacgg gcccggggag gctggcgagc 181 agccctcggc cgctccgccc cagcctcccc acatcatctt catcctcacg gacgaccaag 241 gctaccacga cgtgggctac catggttcag atatcgagac ccctacgctg gacaggctgg 301 cggccaaggg ggtcaagttg gagaattatt acatccagcc catctgcacg ccttcgcgga 361 gccagctcct cactggcagg taccagatcc acacaggact ccagcattcc atcatccgcc 421 cacagcagcc caactgcctg cccctggacc aggtgacact gccacagaag ctgcaggagg 481 caggttattc cacccatatg gtgggcaagt ggcacctggg cttctaccgg aaggagtgtc 541 tgcccacccg tcggggcttc gacaccttcc tgggctcgct cacgggcaat gtggactatt 601 acacctatga caactgtgat ggcccaggcg tgtgcggctt cgacctgcac gagggtgaga 661 atgtggcctg ggggctcagc ggccagtact ccactatgct ttacgcccag cgcgccagcc 721 atatcctggc cagccacagc cctcagcgtc ccctcttcct ctatgtggcc ttccaggcag 781 tacacacacc cctgcagtcc cctcgtgagt acctgtaccg ctaccgcacc atgggcaatg 841 tggcccggcg gaagtacgcg gccatggtga cctgcatgga tgaggctgtg cgcaacatca 901 cctgggccct caagcgctac ggtttctaca acaacagtgt catcatcttc tccagtgaca 961 atggtggcca gactttctcg gggggcagca actggccgct ccgaggacgc aagggcactt 1021 attgggaagg tggcgtgcgg ggcctaggct ttgtccacag tcccctgctc aagcgaaagc 1081 aacggacaag ccgggcactg atgcacatca ctgactggta cccgaccctg gtgggtctgg 1141 caggtggtac cacctcagca gccgatgggc tagatggcta cgacgtgtgg ccggccatca 1201 gcgagggccg ggcctcacca cgcacggaga tcctgcacaa cattgaccca ctctacaacc 1261 atgcccagca tggctccctg gagggcggct ttggcatctg gaacaccgcc gtgcaggctg 1321 ccaTCCGCGT GGGTGAGTGG AAGCTGCTGA CAGGAGACCC CGGCTATGGC GATTGGATCC 1381 CACCGCAGAC ACTGGCCACC TTCCCGGGTA GCTGGTGGAA CCTGGAACGA ATGGCCAGTG 1441 TCCGCCAGGC CGTGTGGCTC TTCAACATCA GTGCTGACCC TTATGAACGG GAGGACCTGG 1501 CTGGCCAGCG GCCTGATGTG GTCCGCACCC TGCTGGCTCG CCTGGCCGAA TATAACCGCA 1561 CAGCCATCCC GGTACGCTAC CCAGCTGAGA ACCCCCGGGC TCATCCTGAC TTTAATGGGG 1621 GTGCTTGGGG GCCCTGGGCC AGTGATGAGG AAGAGGAGGA AGAGGAAGGG AGGGCTCGAA 1681 GCTTCTCCCG GGGTCGTCGC AAGAAAAAAT GCAAGATTTG CAAGCTTCGA TCCTTTTTCC 1741 GTAAACTCAA CACCAGGCTA ATGTCCCAAC GGATCTTGCC AACTTTCTTG TACAAAGTGG 1801 TTGATATCGG TAAGCCTATC CCTAACCCTC TCCTCGGTCT CGATTCTACG TAGTAATGAA 1861 CTAGTCCGTA ACTTGAAAGT ATTTCGATTT CTTGGCTTTA TATATCTTGT GGAAAGGACG 1921 AAACTGGTAG ATAGAATCCG CCCGCACGCG TTAAGTCgac aatcaacctc tggattacaa 1981 aatttgtgaa agatt