BLASTX 1.4.13-Paracel [2002-12-12]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AY101528.f
         (1401 letters)

Database: BlastDB/NCBI/blast/db/2003_02_17_02_00_1/nr 
           1,339,046 sequences; 429,188,541 total letters

Searching...................................................done


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_201555.2|  glycosyl hydrolase family 43; protein id: ...   904   0.0  
dbj|BAB08462.1|  emb|CAB66926.1~gene_id:K9I9.10~strong simil...   879   0.0  
ref|NP_190557.1|  glycosyl hydrolase family 43; protein id: ...   670   0.0  
dbj|BAC06214.1|  P0018C10.15 [Oryza sativa (japonica cultiva...   573   e-162
emb|CAA10760.1|  hypothetical protein [Arabidopsis thaliana]      510   e-143
>ref|NP_201555.2| glycosyl hydrolase family 43; protein id: At5g67540.1, supported by
            cDNA: gi_17933294, supported by cDNA: gi_20856123
            [Arabidopsis
            thaliana]|gi|17933295|gb|AAL48230.1|AF446357_1
            AT5g67540/K9I9_10 [Arabidopsis
            thaliana]|gi|20856124|gb|AAM26649.1| AT5g67540/K9I9_10
            [Arabidopsis thaliana]
          Length = 466

 Score =  904 bits (2335), Expect = 0.0
 Identities = 432/466 (92%), Positives = 432/466 (92%)
 Frame = +1

Query: 1    MSGYSSSAGLRGFAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVV 180
            MSGYSSSAGLRGFAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVV
Sbjct: 1    MSGYSSSAGLRGFAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVV 60

Query: 181  HHLAHPXXXXXXXXXXXXXXMXXXXXXXXXXXXXXXXXXXXLVEEFLDDKSPIRHLFFPG 360
            HHLAHP              M                    LVEEFLDDKSPIRHLFFPG
Sbjct: 61   HHLAHPIVRELIRVEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRHLFFPG 120

Query: 361  IKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPT 540
            IKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPT
Sbjct: 121  IKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPT 180

Query: 541  YHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNE 720
            YHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNE
Sbjct: 181  YHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNE 240

Query: 721  KTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAY 900
            KTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAY
Sbjct: 241  KTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAY 300

Query: 901  LIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAP 1080
            LIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAP
Sbjct: 301  LIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAP 360

Query: 1081 NEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNP 1260
            NEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNP
Sbjct: 361  NEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNP 420

Query: 1261 ADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 1398
            ADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP
Sbjct: 421  ADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 466
>dbj|BAB08462.1| emb|CAB66926.1~gene_id:K9I9.10~strong similarity to unknown protein
            [Arabidopsis thaliana]
          Length = 471

 Score =  879 bits (2270), Expect = 0.0
 Identities = 419/453 (92%), Positives = 419/453 (92%)
 Frame = +1

Query: 40   AGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHLAHPXXXXXXX 219
            AGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHLAHP       
Sbjct: 19   AGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHLAHPIVRELIR 78

Query: 220  XXXXXXXMXXXXXXXXXXXXXXXXXXXXLVEEFLDDKSPIRHLFFPGIKTAAFGPTKDMG 399
                   M                    LVEEFLDDKSPIRHLFFPGIKTAAFGPTKDMG
Sbjct: 79   VEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRHLFFPGIKTAAFGPTKDMG 138

Query: 400  NETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHAHKKGPARVDI 579
            NETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHAHKKGPARVDI
Sbjct: 139  NETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHAHKKGPARVDI 198

Query: 580  IGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD 759
            IGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD
Sbjct: 199  IGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTEKYVMWMHIDD 258

Query: 760  ANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIYSSEVNSVLHI 939
            ANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIYSSEVNSVLHI
Sbjct: 259  ANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIYSSEVNSVLHI 318

Query: 940  GPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEALAHAAESIMG 1119
            GPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEALAHAAESIMG
Sbjct: 319  GPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEALAHAAESIMG 378

Query: 1120 PWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADLRDSRYVWLPL 1299
            PWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADLRDSRYVWLPL
Sbjct: 379  PWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADLRDSRYVWLPL 438

Query: 1300 VIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 1398
            VIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP
Sbjct: 439  VIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 471
>ref|NP_190557.1| glycosyl hydrolase family 43; protein id: At3g49880.1, supported by
            cDNA: gi_17381059, supported by cDNA: gi_20466058
            [Arabidopsis thaliana]|gi|11280711|pir||T46054
            hypothetical protein T16K5.230 - Arabidopsis
            thaliana|gi|6723433|emb|CAB66926.1| putative protein
            [Arabidopsis thaliana]|gi|17381060|gb|AAL36342.1| unknown
            protein [Arabidopsis thaliana]|gi|20466059|gb|AAM20364.1|
            unknown protein [Arabidopsis thaliana]
          Length = 466

 Score =  670 bits (1728), Expect = 0.0
 Identities = 323/450 (71%), Positives = 359/450 (79%), Gaps = 1/450 (0%)
 Frame = +1

Query: 52   RYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHLAHPXXXXXXXXXXX 231
            R S   +V TVVG   + HL  LYSR   ++   +S   L     + HP           
Sbjct: 16   RCSPFGLVSTVVGCVFMIHLTMLYSRS-YSVDLDLSPQLL-----IHHPIVRELERVEEE 69

Query: 232  XXXMXXXXXXXXXXXXXXXXXXXXLVEEFLDDKSPIRHLFFPGIKTAAFGPTKDMGNETS 411
               M                    LVEEFLD+ S IRHLFFP +K+A FGPTK+  N+TS
Sbjct: 70   NIHMPPPRKRSPRAIKRKPKTPTTLVEEFLDENSQIRHLFFPDMKSA-FGPTKEDTNDTS 128

Query: 412  -YYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHAHKKGPARVDIIGV 588
             YYFPG+IW DT+GNPIQAHGGGIL D  S  YYWYGEYKDGPTY +HKKG ARVDIIGV
Sbjct: 129  HYYFPGRIWTDTEGNPIQAHGGGILFDDISKVYYWYGEYKDGPTYLSHKKGAARVDIIGV 188

Query: 589  GCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTEKYVMWMHIDDANY 768
            GCYSSKDLWTWKNEG+VL AEET++THDLHKSNVLERPKVIYN  T KYVMWMHIDDANY
Sbjct: 189  GCYSSKDLWTWKNEGVVLAAEETDETHDLHKSNVLERPKVIYNSDTGKYVMWMHIDDANY 248

Query: 769  TKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIYSSEVNSVLHIGPL 948
            TKASVGVAIS++PTGPF+YLYS+ PHGFDSRDMTV+KDDD VAYLIYSSE NSVLHIGPL
Sbjct: 249  TKASVGVAISDNPTGPFDYLYSRSPHGFDSRDMTVYKDDDNVAYLIYSSEDNSVLHIGPL 308

Query: 949  TEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEALAHAAESIMGPWE 1128
            TE+YLDV PVMKR+MVGQHREAPAIFKHQN YYM+TS CTGWAPNEALAHAAESIMGPWE
Sbjct: 309  TENYLDVKPVMKRIMVGQHREAPAIFKHQNTYYMITSGCTGWAPNEALAHAAESIMGPWE 368

Query: 1129 KLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADLRDSRYVWLPLVIG 1308
             LGNPC+GGN +FR TTFFAQST+VIPLPGVPG FIFMADRWNPADLRDSRY+WLPL++G
Sbjct: 369  TLGNPCVGGNSIFRSTTFFAQSTFVIPLPGVPGVFIFMADRWNPADLRDSRYLWLPLIVG 428

Query: 1309 GPADQPLEFNFGFPSWSRVSIYWHSKWRLP 1398
            GPAD+PLE++FGFP WSRVS+YWH +WRLP
Sbjct: 429  GPADRPLEYSFGFPMWSRVSVYWHRQWRLP 458
>dbj|BAC06214.1| P0018C10.15 [Oryza sativa (japonica
            cultivar-group)]|gi|22202681|dbj|BAC07339.1| P0471B04.23
            [Oryza sativa (japonica cultivar-group)]
          Length = 465

 Score =  573 bits (1476), Expect = e-162
 Identities = 274/462 (59%), Positives = 337/462 (72%), Gaps = 8/462 (1%)
 Frame = +1

Query: 37   FAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKD---NNIHQQVSS-----DQLQVVHHLA 192
            F  GCR SL  IVW++VG  L+    S+  + D   N+I+ +  S     ++++  H   
Sbjct: 15   FDAGCRSSLSFIVWSLVGVALIVCFFSVVRQADTRQNHIYFRHLSATRELEEIEEEHFRL 74

Query: 193  HPXXXXXXXXXXXXXXMXXXXXXXXXXXXXXXXXXXXLVEEFLDDKSPIRHLFFPGIKTA 372
             P                                   +++++LD+ S +  LFFP  ++A
Sbjct: 75   PPPHKVNPRAVKRRGPRKAPK----------------VIDQYLDESSAVHALFFPDERSA 118

Query: 373  AFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHAH 552
               PTK  GN++ Y++PG++W+DT G+ IQAHGGGIL D  +  YYWYGE KDG TY  H
Sbjct: 119  V-NPTKG-GNDSMYFYPGRVWLDTDGHAIQAHGGGILYDHITAKYYWYGENKDGLTYQTH 176

Query: 553  KKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTEK 732
             K   RVDIIGV CYSSKDLW+W NEGIVL  E TN THDLHKS VLERPKVIYN+ T +
Sbjct: 177  PKSTYRVDIIGVSCYSSKDLWSWTNEGIVLPGEPTNFTHDLHKSKVLERPKVIYNDHTGQ 236

Query: 733  YVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIYS 912
            YVMWMHIDDANYTKASVGVA+SNSPTGPF YLYS RPHGF+SRDMT+FKDDDG AYL YS
Sbjct: 237  YVMWMHIDDANYTKASVGVAVSNSPTGPFTYLYSFRPHGFESRDMTIFKDDDGSAYLFYS 296

Query: 913  SEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEAL 1092
            S  N+ LH+ PLT+DYL++T  M+R+++ +HREAPA+FK Q  YYM+TS C+GWAPN AL
Sbjct: 297  SRDNTELHVSPLTKDYLNITVAMRRILIRRHREAPAVFKLQGTYYMITSGCSGWAPNRAL 356

Query: 1093 AHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADLR 1272
            AHAAESIMGPWE LGNPC+GGN+ FRLTTF +QST+V+PLPG+PG FIFMADRWNP++L+
Sbjct: 357  AHAAESIMGPWETLGNPCVGGNRFFRLTTFLSQSTFVLPLPGLPGTFIFMADRWNPSNLK 416

Query: 1273 DSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 1398
            DSRYVWLPL IGG AD+PL+++FGFP+WSRVSIYWH KWRLP
Sbjct: 417  DSRYVWLPLFIGGLADEPLDYSFGFPAWSRVSIYWHRKWRLP 458
>emb|CAA10760.1| hypothetical protein [Arabidopsis thaliana]
          Length = 239

 Score =  510 bits (1313), Expect = e-143
 Identities = 239/239 (100%), Positives = 239/239 (100%)
 Frame = +1

Query: 682  SNVLERPKVIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSR 861
            SNVLERPKVIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSR
Sbjct: 1    SNVLERPKVIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSR 60

Query: 862  DMTVFKDDDGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNI 1041
            DMTVFKDDDGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNI
Sbjct: 61   DMTVFKDDDGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNI 120

Query: 1042 YYMVTSWCTGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGV 1221
            YYMVTSWCTGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGV
Sbjct: 121  YYMVTSWCTGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGV 180

Query: 1222 PGAFIFMADRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 1398
            PGAFIFMADRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP
Sbjct: 181  PGAFIFMADRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 239
  Database: BlastDB/NCBI/blast/db/2003_02_17_02_00_1/nr
    Posted date:  Feb 17, 2003 10:02 AM
  Number of letters in database: 429,188,541
  Number of sequences in database:  1,339,046
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 429,188,541
effective HSP length: 127
effective length of database: 259,129,699
effective search space used: 87844967961
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)







BLAST Search Results