Construct: ORF ccsbBroadEn_07330

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF015039.1_s300c1, BRDN0000392784
DNA Barcode:
None
Epitope Tag:
None
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
SGCE (8910)

Vector Information:

Vector Backbone:
pDONR223
Pol II Cassette 1:
n/a
Pol II Cassette 2:
n/a
Selection Marker:
n/a
Visible Reporter:
n/a
Epitope Tag:
n/a

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 8910 SGCE sarcoglycan epsilon NM_003919.3 99.8% 99.5% 146A>G;1196C>A
2 human 8910 SGCE sarcoglycan epsilon NM_001346717.2 97.7% 97.4% 146A>G;1037_1038ins27;1169C>A
3 human 8910 SGCE sarcoglycan epsilon NM_001099401.2 94.4% 94.1% 146A>G;1196C>A;1254_1328del
4 human 8910 SGCE sarcoglycan epsilon XM_011516665.3 92.9% 92.2% (many diffs)
5 human 8910 SGCE sarcoglycan epsilon NM_001099400.2 92.8% 91.7% (many diffs)
6 human 8910 SGCE sarcoglycan epsilon NM_001346713.2 92.2% 91.9% 109_216del;254A>G;1304C>A
7 human 8910 SGCE sarcoglycan epsilon NM_001346719.2 91.8% 91.3% (many diffs)
8 human 8910 SGCE sarcoglycan epsilon XM_011516666.3 91% 90.3% (many diffs)
9 human 8910 SGCE sarcoglycan epsilon NM_001301139.2 90.5% 90.3% 108_109ins123;1073C>A
10 human 8910 SGCE sarcoglycan epsilon NM_001346715.2 90.3% 90% (many diffs)
11 human 8910 SGCE sarcoglycan epsilon NM_001362807.2 89.7% 89.2% (many diffs)
12 human 8910 SGCE sarcoglycan epsilon NM_001362809.2 88.4% 88.3% 108_109ins123;914_915ins27;1046C>A
13 human 8910 SGCE sarcoglycan epsilon XM_011516663.2 86.2% 85.6% (many diffs)
14 human 8910 SGCE sarcoglycan epsilon XM_011516667.2 85.4% 84.5% (many diffs)
15 human 8910 SGCE sarcoglycan epsilon XM_017012763.1 85.4% 84.5% (many diffs)
16 human 8910 SGCE sarcoglycan epsilon XM_011516664.2 84.5% 83.8% (many diffs)
17 human 8910 SGCE sarcoglycan epsilon XM_024446985.1 84.2% 83.7% (many diffs)
18 human 8910 SGCE sarcoglycan epsilon XM_024446986.1 82.3% 81.7% (many diffs)
19 human 8910 SGCE sarcoglycan epsilon NM_001346720.2 79% 78.9% 0_1ins273;923C>A
20 human 8910 SGCE sarcoglycan epsilon NM_001362808.2 77% 76.8% 0_1ins273;764_765ins27;896C>A
21 human 8910 SGCE sarcoglycan epsilon XM_011516669.3 73.5% 73% (many diffs)
22 human 8910 SGCE sarcoglycan epsilon XM_017012767.1 71.6% 71% (many diffs)
23 mouse 20392 Sgce sarcoglycan, epsilon NM_011360.3 89.2% 94.5% (many diffs)
24 mouse 20392 Sgce sarcoglycan, epsilon NM_001130190.1 87.1% 92.1% (many diffs)
25 mouse 20392 Sgce sarcoglycan, epsilon NM_001130191.1 85.2% 90.3% (many diffs)
26 mouse 20392 Sgce sarcoglycan, epsilon XM_006505023.1 84.7% 88.9% (many diffs)
27 mouse 20392 Sgce sarcoglycan, epsilon NM_001130189.1 82.9% 87.1% (many diffs)
28 mouse 20392 Sgce sarcoglycan, epsilon NM_001130188.1 82.4% 87.3% (many diffs)
29 mouse 20392 Sgce sarcoglycan, epsilon XM_006505022.3 80.6% 85.6% (many diffs)
30 mouse 20392 Sgce sarcoglycan, epsilon XM_006505021.3 80.5% 85.2% (many diffs)
31 mouse 20392 Sgce sarcoglycan, epsilon XM_006505019.3 78.6% 82.4% (many diffs)
32 mouse 20392 Sgce sarcoglycan, epsilon XM_006505020.3 76.9% 80.8% (many diffs)
33 mouse 20392 Sgce sarcoglycan, epsilon XM_006505024.3 67% 71.3% (many diffs)
34 mouse 20392 Sgce sarcoglycan, epsilon XM_011241051.2 42.5% 42.9% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
66
ORF end:
1377
ORF length:
1311
Sequence:
1gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG
61TTGGCATGCA ATTGCCCCGG TGGTGGGAGC TGGGAGACCC CTGTGCTTGG ACGGGACAGG
121GTCGGGGGAC ACGCAGGATG AGCCCCGCGA CCACTGGCAC ATTCTTGCTG ACAGTGTACA
181GTATTTTCTC CAAGGTACAC TCCGATCGGA GTGTATACCC ATCAGCAGGT GTCCTCTTTG
241TTCATGTTTT GGAAAGAGAA TATTTTAAGG GGGAATTTCC ACCTTACCCA AAACCTGGCG
301AGATTAGTAA TGATCCCATA ACATTTAATA CAAATTTAAT GGGTTACCCA GACCGACCTG
361GATGGCTTCG ATATATCCAA AGGACACCAT ATAGTGATGG AGTCCTATAT GGGTCCCCAA
421CAGCTGAAAA TGTGGGGAAG CCAACAATCA TTGAGATAAC TGCCTACAAC AGGCGCACCT
481TTGAGACTGC AAGGCATAAT TTGATAATTA ATATAATGTC TGCAGAAGAC TTCCCGTTGC
541CATATCAAGC AGAATTCTTC ATTAAGAATA TGAATGTAGA AGAAATGTTG GCCAGTGAGG
601TTCTTGGAGA CTTTCTTGGC GCAGTGAAAA ATGTGTGGCA GCCAGAGCGC CTGAACGCCA
661TAAACATCAC ATCGGCCCTA GACAGGGGTG GCAGGGTGCC ACTTCCCATT AATGACCTGA
721AGGAGGGCGT TTATGTCATG GTTGGTGCAG ATGTCCCGTT TTCTTCTTGT TTACGAGAAG
781TTGAAAATCC ACAGAATCAA TTGAGATGTA GTCAAGAAAT GGAGCCTGTA ATAACATGTG
841ATAAAAAATT TCGTACTCAA TTTTACATTG ACTGGTGCAA AATTTCATTG GTTGATAAAA
901CAAAGCAAGT GTCCACCTAT CAGGAAGTGA TTCGTGGAGA GGGGATTTTA CCTGATGGTG
961GAGAATACAA ACCCCCTTCT GATTCTTTGA AAAGCAGAGA CTATTACACG GATTTCCTAA
1021TTACACTGGC TGTGCCCTCG GCAGTGGCAC TGGTCCTTTT TCTAATACTT GCTTATATCA
1081TGTGCTGCCG ACGGGAAGGC GTGGAAAAGA GAAACATGCA AACACCAGAC ATCCAACTGG
1141TCCATCACAG TGCTATTCAG AAATCTACCA AGGAGCTTCG AGACATGTCC AAGAATAGAG
1201AGATAGCATG GCCCCTGTCA ACGCTTCCTG TGTTCCACCC TGTGACTGGG GAAATCATAC
1261ATCCTTTACA CACAGACAAC TATGATAGCA CAAACATGCC ATTGATGCAA ACGCAGCAGA
1321ACTTGCCACA TCAGACTCAG ATTCCCCAAC AGCAGACTAC AGGTAAATGG TATCCCTGCC
1381CAACTTTCTT GTACAAAGTt ggcattataa gaaagcattg cttatcaatt tgttgcaacg
1441aac

Download FASTA (ORF) (Full)