Construct: ORF ccsbBroadEn_02699
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF013046.1_s300c1, BRDN0000393737
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- ARSG (22901)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 22901 | ARSG | arylsulfatase G | NM_001267727.1 | 100% | 100% | |
2 | human | 22901 | ARSG | arylsulfatase G | NM_001352899.1 | 100% | 100% | |
3 | human | 22901 | ARSG | arylsulfatase G | NM_001352900.2 | 100% | 100% | |
4 | human | 22901 | ARSG | arylsulfatase G | NM_001352901.2 | 100% | 100% | |
5 | human | 22901 | ARSG | arylsulfatase G | NM_001352902.2 | 100% | 100% | |
6 | human | 22901 | ARSG | arylsulfatase G | NM_014960.5 | 100% | 100% | |
7 | human | 22901 | ARSG | arylsulfatase G | XM_017024365.1 | 100% | 100% | |
8 | human | 22901 | ARSG | arylsulfatase G | NM_001352903.2 | 99.8% | 99.8% | 704_705insCAC |
9 | human | 22901 | ARSG | arylsulfatase G | NM_001352904.1 | 99.8% | 99.8% | 704_705insCAC |
10 | human | 22901 | ARSG | arylsulfatase G | NM_001352905.2 | 99.8% | 99.8% | 704_705insCAC |
11 | human | 22901 | ARSG | arylsulfatase G | NM_001352906.1 | 99.8% | 99.8% | 704_705insCAC |
12 | human | 22901 | ARSG | arylsulfatase G | NM_001352907.2 | 99.8% | 99.8% | 704_705insCAC |
13 | human | 22901 | ARSG | arylsulfatase G | XM_011524536.2 | 95.1% | 95.1% | 1212_1292del |
14 | human | 22901 | ARSG | arylsulfatase G | XM_011524537.1 | 95.1% | 95.1% | 1212_1292del |
15 | human | 22901 | ARSG | arylsulfatase G | XM_017024360.2 | 95.1% | 95.1% | 1212_1292del |
16 | human | 22901 | ARSG | arylsulfatase G | NM_001352910.2 | 85.9% | 83.2% | (many diffs) |
17 | human | 22901 | ARSG | arylsulfatase G | NM_001352909.2 | 82.8% | 80.2% | (many diffs) |
18 | human | 22901 | ARSG | arylsulfatase G | XM_017024368.1 | 78.6% | 71.4% | 1090_1091ins121;1239_1240ins215 |
19 | human | 22901 | ARSG | arylsulfatase G | XM_024450658.1 | 65.3% | 65.3% | 0_1ins492;720_800del |
20 | human | 22901 | ARSG | arylsulfatase G | XM_011524546.2 | 50% | 50% | 0_1ins747;465_545del |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1641
- ORF length:
- 1575
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGG CTGGCTTTTT CTAAAGGTTT TGTTGGCGGG AGTGAGTTTC TCAGGATTTC 121 TTTATCCTCT TGTGGATTTT TGCATCAGTG GGAAAACAAG AGGACAGAAG CCAAACTTTG 181 TGATTATTTT GGCCGATGAC ATGGGGTGGG GTGACCTGGG AGCAAACTGG GCAGAAACAA 241 AGGACACTGC CAACCTTGAT AAGATGGCTT CGGAGGGAAT GAGGTTTGTG GATTTCCATG 301 CAGCTGCCTC CACCTGCTCA CCCTCCCGGG CTTCCTTGCT CACCGGCCGG CTTGGCCTTC 361 GCAATGGAGT CACACGCAAC TTTGCAGTCA CTTCTGTGGG AGGCCTTCCG CTCAACGAGA 421 CCACCTTGGC AGAGGTGCTG CAGCAGGCGG GTTACGTCAC TGGGATAATA GGCAAATGGC 481 ATCTTGGACA CCACGGCTCT TATCACCCCA ACTTCCGTGG TTTTGATTAC TACTTTGGAA 541 TCCCATATAG CCATGATATG GGCTGTACTG ATACTCCAGG CTACAACCAC CCTCCTTGTC 601 CAGCGTGTCC ACAGGGTGAT GGACCATCAA GGAACCTTCA AAGAGACTGT TACACTGACG 661 TGGCCCTCCC TCTTTATGAA AACCTCAACA TTGTGGAGCA GCCGGTGAAC TTGAGCAGCC 721 TTGCCCAGAA GTATGCTGAG AAAGCAACCC AGTTCATCCA GCGTGCAAGC ACCAGCGGGA 781 GGCCCTTCCT GCTCTATGTG GCTCTGGCCC ACATGCACGT GCCCTTACCT GTGACTCAGC 841 TACCAGCAGC GCCACGGGGC AGAAGCCTGT ATGGTGCAGG GCTCTGGGAG ATGGACAGTC 901 TGGTGGGCCA GATCAAGGAC AAAGTTGACC ACACAGTGAA GGAAAACACA TTCCTCTGGT 961 TTACAGGAGA CAATGGCCCG TGGGCTCAGA AGTGTGAGCT AGCGGGCAGT GTGGGTCCCT 1021 TCACTGGATT TTGGCAAACT CGTCAAGGGG GAAGTCCAGC CAAGCAGACG ACCTGGGAAG 1081 GAGGGCACCG GGTCCCAGCA CTGGCTTACT GGCCTGGCAG AGTTCCAGTT AATGTCACCA 1141 GCACTGCCTT GTTAAGCGTG CTGGACATTT TTCCAACTGT GGTAGCCCTG GCCCAGGCCA 1201 GCTTACCTCA AGGACGGCGC TTTGATGGTG TGGACGTCTC CGAGGTGCTC TTTGGCCGGT 1261 CACAGCCTGG GCACAGGGTG CTGTTCCACC CCAACAGCGG GGCAGCTGGA GAGTTTGGAG 1321 CCCTGCAGAC TGTCCGCCTG GAGCGTTACA AGGCCTTCTA CATTACCGGT GGAGCCAGGG 1381 CGTGTGATGG GAGCACGGGG CCTGAGCTGC AGCATAAGTT TCCTCTGATT TTCAACCTGG 1441 AAGACGATAC CGCAGAAGCT GTGCCCCTAG AAAGAGGTGG TGCGGAGTAC CAGGCTGTGC 1501 TGCCCGAGGT CAGAAAGGTT CTTGCAGACG TCCTCCAAGA CATTGCCAAC GACAACATCT 1561 CCAGCGCAGA TTACACTCAG GACCCTTCAG TAACTCCCTG CTGTAATCCC TACCAAATTG 1621 CCTGCCGCTG TCAAGCCGCA TACCCAACTT TCTTGTACAA AGTtggcatt ataagaaagc 1681 attgcttatc aatttgttgc aacgaac