Construct: ORF TRCN0000472672
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF006325.1_s317c1
- Derived from:
- ccsbBroadEn_09732
- DNA Barcode:
- AACCAGACCGTATATGCGCCGCGA
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SLC5A8 (160728)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000472672
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 160728 | SLC5A8 | solute carrier family 5 mem... | NM_145913.5 | 99.9% | 100% | 1662C>T |
2 | human | 160728 | SLC5A8 | solute carrier family 5 mem... | XM_017018910.2 | 72.5% | 72.2% | (many diffs) |
3 | human | 160728 | SLC5A8 | solute carrier family 5 mem... | XR_944503.3 | 59.6% | (many diffs) | |
4 | mouse | 216225 | Slc5a8 | solute carrier family 5 (io... | NM_145423.2 | 86% | 86.4% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1899
- ORF length:
- 1830
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcaccat ggacacgcca cggggcatcg gcaccttcgt ggtgtgggac tacgtggtgt 121 tcgcgggcat gctggtcatc tcggccgcca tcggcatcta ctacgccttc gctgggggcg 181 gccagcagac ctccaaggac ttcctgatgg gcggccgcag aatgaccgca gtgcccgtgg 241 cgctgtccct caccgctagc ttcatgtcag ccgtcactgt cctgggcacc ccctccgagg 301 tctaccgttt tggggccatt tttagcatct ttgccttcac ctacttcttt gtggtggtca 361 tcagcgcgga ggtcttcctc ccggtgttct acaaactggg aattaccagc acctacgagt 421 atttagaact tcgatttaac aaatgtgttc gtctctgtgg aacagtcctc ttcattgttc 481 aaacaattct gtatactgga attgttattt atgcccctgc cctggctttg aatcaagtca 541 caggatttga tctgtggggc gcggtagtgg caacgggggt ggtctgcaca ttctactgca 601 cactgggtgg tcttaaagca gttatctgga cagatgtttt tcaagttggg atcatggtgg 661 ctggatttgc atccgtgatt atacaggctg tggtgatgca aggtggaatc agcactattt 721 taaatgatgc ctatgatggt ggaagattaa atttctggaa ttttaatcct aaccctttgc 781 aaagacacac cttctggaca attattatag gagggacctt cacatggacc agcatctacg 841 gtgtcaacca atcccaggtg cagagatata tttcttgtaa aagcagattc caggcaaaac 901 tgtctctcta catcaatctt gtgggactct gggcaatcct cacatgctca gtgttttgtg 961 ggctcgccct atattccagg taccatgact gtgatccttg gacagccaag aaagtgtctg 1021 caccagacca gctcatgcct tatttggtac tggacattct gcaagattat ccaggacttc 1081 ctggactttt tgtggcctgt gcttacagtg ggacattaag cacagtgtcc tccagtatta 1141 atgccttagc agcagtaact gtggaagatc taatcaaacc ttacttcaga tcgctctcag 1201 aaaggtctct gtcttggatt tcccaaggaa tgagtgtggt gtatggagcc ctgtgtattg 1261 gaatggctgc gctggcgtca cttatgggag ctttgttgca ggcagcactc agcgtatttg 1321 gtatggttgg tggaccactt atgggcctgt tcgctttggg cattttggtt ccctttgcca 1381 actcaattgg agcacttgtt ggtctgatgg ctggatttgc catttctcta tgggttggaa 1441 ttggagctca aatatatcct ccacttcctg agagaacatt gccattgcac cttgatatcc 1501 aaggctgtaa cagcacctac aatgagacaa atttgatgaC AACCACAGAA ATGCCATTTA 1561 CTACTAGTGT TTTTCAAATA TACAATGTTC AAAGGACTCC ACTGATGGAT AACTGGTATT 1621 CTTTATCATA TCTGTACTTC AGCACTGTTG GAACTTTGGT AACATTATTA GTGGGGATAC 1681 TTGTCAGTTT ATCAACAGGA GGAAGAAAAC AGAACTTAGA CCCCAGATAT ATACTAACCA 1741 AAGAGGACTT TTTATCCAAT TTTGATATTT TTAAGAAAAA GAAGCATGTT TTGAGCTATA 1801 AATCACATCC AGTGGAAGAT GGTGGAACTG ATAATCCTGC TTTCAACCAC ATTGAATTGA 1861 ACTCAGATCA GAGTGGCAAG AGCAATGGGA CTCGTTTGTT GCCAACTTTC TTGTACAAAG 1921 TGGTTGATAT CGGTAAGCCT ATCCCTAACC CTCTCCTCGG TCTCGATTCT ACGTAGTAAT 1981 GAACTAGTCC GTAACTTGAA AGTATTTCGA TTTCTTGGCT TTATATATCT TGTGGAAAGG 2041 ACGAAACCAG ACCGTATATG CGCCGCGAAC GCGTTAAGTC gacaatcaac ctctggatta 2101 caaaatttgt gaaagatt