Construct: ORF ccsbBroadEn_13141

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF018275.1_s300c1, BRDN0000398226
DNA Barcode:
None
Epitope Tag:
None
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
SGSM1 (129049)

Vector Information:

Vector Backbone:
pDONR223
Pol II Cassette 1:
n/a
Pol II Cassette 2:
n/a
Selection Marker:
n/a
Visible Reporter:
n/a
Epitope Tag:
n/a

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 129049 SGSM1 small G protein signaling m... NM_001098497.3 55.5% 55.5% 1_1458del
2 human 129049 SGSM1 small G protein signaling m... NM_001039948.4 52.8% 52.8% 1_1623del
3 human 129049 SGSM1 small G protein signaling m... NM_001098498.3 49.9% 49.9% 1_1458del;1767_1768ins183
4 human 129049 SGSM1 small G protein signaling m... NM_133454.3 47.5% 47.5% 1_1623del;1932_1933ins183
5 mouse 52850 Sgsm1 small G protein signaling m... NM_001309528.1 64.9% 68.8% (many diffs)
6 mouse 52850 Sgsm1 small G protein signaling m... NM_172718.3 47.9% 50.8% (many diffs)
7 mouse 52850 Sgsm1 small G protein signaling m... XM_006535122.3 45.6% 48.3% (many diffs)
8 mouse 52850 Sgsm1 small G protein signaling m... NM_001254731.1 15.3% 16.2% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
69
ORF end:
1890
ORF length:
1821
Sequence:
1gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG
61TTGGCACCAT GAAGTACCAG ATCCTCTCCA GAGCCTTCTA TGGATGGCTG GCCTACTGCA
121GACACCTGTC CACCGTGAGA ACCCACCTAT CAGCCCTGGT CAATCACATG ATCGTGTCTC
181CAGACTTGCC CTGCGATGCT GGACAGGGAC TGACAGCCAG GATCTGGGAG CAGTACCTTC
241ACGACAGCAC AAGTTACGAG GAGCAGGAGC TGCTGCGCCT CATCTACTAC GGGGGCATCC
301AGCCTGAGAT CCGCAAGGCC GTGTGGCCCT TCCTCCTGGG CCACTACCAG TTCGGGATGA
361CGGAAACAGA AAGGAAAGAG GTGGACGAGC AGATTCATGC CTGCTATGCA CAGACCATGG
421CTGAGTGGCT GGGCTGCGAG GCGATCGTGC GGCAGAGGGA GCGGGAGTCC CATGCGGCCG
481CCCTGGCCAA ATGCTCATCC GGGGCCAGCT TGGACAGCCA CCTGCACCGG ATGTTGCACA
541GGGACTCAAC CATCAGCAAT GAGTCCTCCC AGAGCTGCAG TTCGGGCCGC CAGAACATCC
601GCCTGCACAG CGACTCCAGC AGCAGCACAC AGGTGTTTGA GTCTGTGGAT GAGGTGGAGC
661AGGTGGAGGC TGAAGGCAGA TTGGAGGAGA AACAGCCCAA GATCCCCAAT GGGAACCTAG
721TGAACGGCAC TTGTTCCCCA GACTCGGGTC ATCCTTCCTC CCATAACTTC TCCTCGGGCC
781TCTCAGAGCA CTCAGAGCCC AGTCTGAGCA CAGAAGACAG TGTCTTGGAC GCCCAGCGGA
841ACACCCCCAC GGTGCTGCGA CCTAGGGATG GCAGCGTGGA TGACAGGCAG AGCAGCGAGG
901CCACCACATC TCAGGATGAG GCTCCCCGGG AGGAGCTGGC CGTGCAGGAC AGCCTGGAGA
961GTGACCTCCT GGCCAACGAG AGCATGGACG AGTTCATGTC CATCACGGGC AGCCTGGACA
1021TGGCCCTGCC TGAAAAGGAC GATGTTGTGA TGGAGGGCTG GAGGAGCAGC GAGACAGAGA
1081AACATGGCCA GGCGGACAGT GAGGACAACC TCTCGGAGGA GCCTGAGATG GAAAGTCTCT
1141TCCCTGCCCT GGCTTCTCTG GCTGTGACTA CTTCTGCCAA CGAGGTGTCC CCTGTGTCTT
1201CCAGCGGCGT CACCTACTCT CCAGAGCTGC TGGATCTGTA CACGGTGAAC CTGCACCGCA
1261TCGAGAAGGA TGTGCAGAGG TGCGACCGCA ACTACTGGTA CTTCACGCCC GCCAACTTGG
1321AGAAGCTGCG TAACATCATG TGCAGCTACA TCTGGCAGCA CATTGAGATC GGCTATGTCC
1381AGGGCATGTG TGATCTTCTG GCTCCACTGC TGGTCATTCT GGATGATGAG GCCCTTGCCT
1441TCAGCTGCTT CACGGAGCTC ATGAAGAGGA TGAACCAGAA CTTCCCCCAC GGAGGCGCCA
1501TGGACACGCA CTTTGCAAAC ATGAGATCGT TGATCCAGAT CCTGGACTCA GAGCTGTTTG
1561AGCTGATGCA TCAGAACGGG GACTATACTC ACTTCTACTT CTGCTACCGC TGGTTCCTGC
1621TGGATTTCAA GCGAGAACTC GTCTATGATG ACGTCTTCTT GGTCTGGGAG ACCATCTGGG
1681CAGCCAAACA CGTCTCCTCT GCGCACTACG TCCTGTTCAT TGCGCTGGCT CTGGTGGAAG
1741TCTACCGTGA CATCATTTTG GAGAACAACA TGGATTTCAC AGACATCATC AAATTCTTTA
1801ATGAAATGGC TGAGCGACAC AACACCAAGC AAGTCCTGAA GCTGGCGCGG GACCTCGTGT
1861ACAAGGTGCA GACTCTGATT GAGAACAAGT TGCCAACTTT CTTGTACAAA GTtggcatta
1921taagaaagca ttgcttatca atttgttgca acgaac

Download FASTA (ORF) (Full)