Construct: ORF TRCN0000474559
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF016079.1_s317c1
- Derived from:
- ccsbBroadEn_05075
- DNA Barcode:
- GCCTATTCCCTATCATTGCCGGCG
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- ARSK (153642)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000474559
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 153642 | ARSK | arylsulfatase family member K | NM_198150.3 | 100% | 100% | |
2 | human | 153642 | ARSK | arylsulfatase family member K | XM_005271904.4 | 70.3% | 70.3% | 0_1ins477 |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1677
- ORF length:
- 1608
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcaccat gctactgctg tgggtgtcgg tggtcgcagc cttggcgctg gcggtactgg 121 cccccggagc aggggagcag aggcggagag cagccaaagc gcccaatgtg gtgctggtcg 181 tgagcgactc cttcgatgga aggttaacat ttcatccagg aagtcaggta gtgaaacttc 241 cttttatcaa ctttatgaag acacgtggga cttcctttct gaatgcctac acaaactctc 301 caatttgttg cccatcacgc gcagcaatgt ggagtggcct cttcactcac ttaacagaat 361 cttggaataa ttttaagggt ctagatccaa attatacaac atggatggat gtcatggaga 421 ggcatggcta ccgaacacag aaatttggga aactggacta tacttcagga catcactcca 481 ttagtaatcg tgtggaagcg tggacaagag atgttgcttt cttactcaga caagaaggca 541 ggcccatggt taatcttatc cgtaacagga ctaaagtcag agtgatggaa agggattggc 601 agaatacaga caaagcagta aactggttaa gaaaggaagc aattaattac actgaaccat 661 ttgttattta cttgggatta aatttaccac acccttaccc ttcaccatct tctggagaaa 721 attttggatc ttcaacattt cacacatctc tttattggct tgaaaaagtg tctcatgatg 781 ccatcaaaat cccaaagtgg tcacctttgt cagaaatgca ccctgtagat tattactctt 841 cttatacaaa aaactgcact ggaagattta caaaaaaaga aattaagaat attagagcat 901 tttattatgc tatgtgtgct gagacagatg ccatgcttgg tgaaattatt ttggcccttc 961 atcaattaga tcttcttcag aaaactattg tcatatactc ctcagaccat ggagagctgg 1021 ccatggaaca tcgacagttt tataaaatga gcatgtacga ggctagtgca catgttccgc 1081 ttttgatgat gggaccagga attaaagccg gcctacaagt atcaaatgtg gtttctcttg 1141 tggatattta ccctaccatg cttgatattg ctggaattcc tctgcctcag aacctgagtg 1201 gatactcttt gttgccgtta tcatcagaaa catttaagaa tgaacataaa gtcaaaaacc 1261 tgcatccacc ctggattctg agtgaattcc atggatgtaa tgtgaatgcc tccacctaca 1321 tgcTTCGAAC TAACCACTGG AAATATATAG CCTATTCGGA TGGTGCATCA ATATTGCCTC 1381 AACTCTTTGA TCTTTCCTCG GATCCAGATG AATTAACAAA TGTTGCTGTA AAATTTCCAG 1441 AAATTACTTA TTCTTTGGAT CAGAAGCTTC ATTCCATTAT AAACTACCCT AAAGTTTCTG 1501 CTTCTGTCCA CCAGTATAAT AAAGAGCAGT TTATCAAGTG GAAACAAAGT ATAGGACAGA 1561 ATTATTCAAA CGTTATAGCA AATCTTAGGT GGCACCAAGA CTGGCAGAAG GAACCAAGGA 1621 AGTATGAAAA TGCAATTGAT CAGTGGCTTA AAACCCATAT GAATCCAAGA GCAGTTTTGC 1681 CAACTTTCTT GTACAAAGTG GTTGATATCG GTAAGCCTAT CCCTAACCCT CTCCTCGGTC 1741 TCGATTCTAC GTAGTAATGA ACTTTATATA TCTTGTGGAA AGGACGAGCC TATTCCCTAT 1801 CATTGCCGGC GACGCGTTAA GTCgacaatc aacctctgga ttacaaaatt tgtgaaagat 1861 t