Construct: ORF TRCN0000481495
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF005060.1_s317c1
- Derived from:
- ccsbBroadEn_00397
- DNA Barcode:
- AAGAGTCACTTCTTTTTCTATATA
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CTSK (1513)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000481495
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 1513 | CTSK | cathepsin K | NM_000396.4 | 99.8% | 100% | 831A>G |
2 | mouse | 13038 | Ctsk | cathepsin K | NM_007802.4 | 87.2% | 86% | (many diffs) |
3 | mouse | 13038 | Ctsk | cathepsin K | XM_006500974.3 | 87.2% | 86% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1053
- ORF length:
- 987
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatgtg ggggctcaag gttctgctgc tacctgtggt gagctttgct ctgtaccctg 121 aggagatact ggacacccac tgggagctat ggaagaagac ccacaggaag caatataaca 181 acaaggtgga tgaaatctct cggcgtttaa tttgggaaaa aaacctgaag tatatttcca 241 tccataacct tgaggcttct cttggtgtcc atacatatga actggctatg aaccacctgg 301 gggacatgac cagtgaagag gtggttcaga agatgactgg actcaaagta cccctgtctc 361 attcccgcag taatgacacc ctttatatcc cagaatggga aggtagagcc ccagactctg 421 tcgactatcg aaagaaagga tatgttactc ctgtcaaaaa tcagggtcag tgtggttcct 481 gttgggcttt tagctctgtg ggtgccctgg agggccaact caagaagaaa actggcaaac 541 tcttaaatct gagtccccag aacctagtgg attgtgtgtc tgagaatgat ggctgtggag 601 ggggctacat gaccaatgCC TTCCAATATG TGCAGAAGAA CCGGGGTATT GACTCTGAAG 661 ATGCCTACCC ATATGTGGGA CAGGAAGAGA GTTGTATGTA CAACCCAACA GGCAAGGCAG 721 CTAAATGCAG AGGGTACAGA GAGATCCCCG AGGGGAATGA GAAAGCCCTG AAGAGGGCAG 781 TGGCCCGAGT GGGACCTGTC TCTGTGGCCA TTGATGCAAG CCTGACCTCC TTCCAGTTTT 841 ACAGCAAAGG TGTGTATTAT GATGAAAGCT GCAATAGCGA TAATCTGAAC CATGCGGTTT 901 TGGCAGTGGG ATATGGAATC CAGAAGGGAA ACAAGCACTG GATAATTAAA AACAGCTGGG 961 GAGAAAACTG GGGAAACAAA GGATATATCC TCATGGCTCG AAATAAGAAC AACGCCTGTG 1021 GCATTGCCAA CCTGGCCAGC TTCCCCAAGA TGTGCCCAAC TTTCTTGTAC AAAGTGGTTG 1081 ATATCGGTAA GCCTATCCCT AACCCTCTCC TCGGTCTCGA TTCTACGTAG TAATGAACTA 1141 GTCCGTAACT TGAAAGTATT TCGATTTCTT GGCTTTATAT ATCTTGTGGA AAGGACGAAA 1201 GAGTCACTTC TTTTTCTATA TAACGCGTTA AGTCgacaat caacctctgg attacaaaat 1261 ttgtgaaaga tt