Construct: ORF ccsbBroadEn_00397
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF005060.1_s300c1, BRDN0000385845
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CTSK (1513)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 1513 | CTSK | cathepsin K | NM_000396.4 | 99.8% | 100% | 831A>G |
2 | mouse | 13038 | Ctsk | cathepsin K | NM_007802.4 | 87.2% | 86% | (many diffs) |
3 | mouse | 13038 | Ctsk | cathepsin K | XM_006500974.3 | 87.2% | 86% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1053
- ORF length:
- 987
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGTG GGGGCTCAAG GTTCTGCTGC TACCTGTGGT GAGCTTTGCT CTGTACCCTG 121 AGGAGATACT GGACACCCAC TGGGAGCTAT GGAAGAAGAC CCACAGGAAG CAATATAACA 181 ACAAGGTGGA TGAAATCTCT CGGCGTTTAA TTTGGGAAAA AAACCTGAAG TATATTTCCA 241 TCCATAACCT TGAGGCTTCT CTTGGTGTCC ATACATATGA ACTGGCTATG AACCACCTGG 301 GGGACATGAC CAGTGAAGAG GTGGTTCAGA AGATGACTGG ACTCAAAGTA CCCCTGTCTC 361 ATTCCCGCAG TAATGACACC CTTTATATCC CAGAATGGGA AGGTAGAGCC CCAGACTCTG 421 TCGACTATCG AAAGAAAGGA TATGTTACTC CTGTCAAAAA TCAGGGTCAG TGTGGTTCCT 481 GTTGGGCTTT TAGCTCTGTG GGTGCCCTGG AGGGCCAACT CAAGAAGAAA ACTGGCAAAC 541 TCTTAAATCT GAGTCCCCAG AACCTAGTGG ATTGTGTGTC TGAGAATGAT GGCTGTGGAG 601 GGGGCTACAT GACCAATGCC TTCCAATATG TGCAGAAGAA CCGGGGTATT GACTCTGAAG 661 ATGCCTACCC ATATGTGGGA CAGGAAGAGA GTTGTATGTA CAACCCAACA GGCAAGGCAG 721 CTAAATGCAG AGGGTACAGA GAGATCCCCG AGGGGAATGA GAAAGCCCTG AAGAGGGCAG 781 TGGCCCGAGT GGGACCTGTC TCTGTGGCCA TTGATGCAAG CCTGACCTCC TTCCAGTTTT 841 ACAGCAAAGG TGTGTATTAT GATGAAAGCT GCAATAGCGA TAATCTGAAC CATGCGGTTT 901 TGGCAGTGGG ATATGGAATC CAGAAGGGAA ACAAGCACTG GATAATTAAA AACAGCTGGG 961 GAGAAAACTG GGGAAACAAA GGATATATCC TCATGGCTCG AAATAAGAAC AACGCCTGTG 1021 GCATTGCCAA CCTGGCCAGC TTCCCCAAGA TGTGCCCAAC TTTCTTGTAC AAAGTtggca 1081 ttataagaaa gcattgctta tcaatttgtt gcaacgaac