BLASTX 1.5.4-Paracel [2003-06-05]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= PTA.CL.12.cl.1287.Contig1
(2432 letters)
Database: BlastDB/NCBI/blast/db/FASTA/2004_07_04_01_00_0/nr
1,895,298 sequences; 629,712,918 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAO00820.1| Unknown protein [Arabidopsis thaliana] 1071 0.0
ref|XP_408922.1| hypothetical protein AN4785.2 [Aspergillus... 830 0.0
ref|XP_412248.1| hypothetical protein AN8111.2 [Aspergillus... 274 6e-72
ref|XP_385963.1| hypothetical protein FG05787.1 [Gibberella... 250 1e-64
ref|NP_015076.1| DNA-binding transcription factor required ... 243 1e-62
ref|XP_408471.1| hypothetical protein AN4334.2 [Aspergillus... 216 3e-54
ref|XP_409685.1| hypothetical protein AN5548.2 [Aspergillus... 187 1e-45
ref|XP_401329.1| hypothetical protein UM03714.1 [Ustilago m... 169 3e-40
ref|NP_985390.1| AFL160Cp [Eremothecium gossypii]|gi|449842... 169 4e-40
sp|P08657|LAC9_KLULA Lactose regulatory protein LAC9|gi|173... 167 1e-39
>gb|AAO00820.1| Unknown protein [Arabidopsis thaliana]
Length = 549
Score = 1071 bits (2769), Expect = 0.0
Identities = 522/549 (95%), Positives = 522/549 (95%)
Frame = +3
Query: 582 MAALSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARSGGVPLKAQLLESAPSGLI 761
MAALSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARSGGVPLKAQLLESAPSGLI
Sbjct: 1 MAALSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARSGGVPLKAQLLESAPSGLI 60
Query: 762 EQAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAWCIGDDSS 941
EQAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAWCIGDDSS
Sbjct: 61 EQAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAWCIGDDSS 120
Query: 942 DLDITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTGWNYLGLAVRMAMS 1121
DLDITFYQEARGYLQQASVFETGN SNYAQKRNKPNTGWNYLGLAVRMAMS
Sbjct: 121 DLDITFYQEARGYLQQASVFETGNLTLVQGLLLLSNYAQKRNKPNTGWNYLGLAVRMAMS 180
Query: 1122 LGLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHE 1301
LGLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHE
Sbjct: 181 LGLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHE 240
Query: 1302 DELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSLTPEDTLKLQQPIEE 1481
DELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSLTPEDTLKLQQPIEE
Sbjct: 241 DELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSLTPEDTLKLQQPIEE 300
Query: 1482 WYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRRWA 1661
WYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRRWA
Sbjct: 301 WYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRRWA 360
Query: 1662 PLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLFQA 1841
PLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLFQA
Sbjct: 361 PLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLFQA 420
Query: 1842 GLIPIVFFMTDPSNPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRLLGPPIPPT 2021
GLIPIVFFMTDPSNPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRLLGPPIPPT
Sbjct: 421 GLIPIVFFMTDPSNPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRLLGPPIPPT 480
Query: 2022 QVIPTANPHEAXXXXXXXXXXXXXXXXXFPEQLFSDPGFGGSLFPVEQQMQMNAGGMDFS 2201
QVIPTANPHEA FPEQLFSDPGFGGSLFPVEQQMQMNAGGMDFS
Sbjct: 481 QVIPTANPHEAQQQPMHHIPQQPQNMMPFPEQLFSDPGFGGSLFPVEQQMQMNAGGMDFS 540
Query: 2202 EWVNFPPTE 2228
EWVNFPPTE
Sbjct: 541 EWVNFPPTE 549
>ref|XP_408922.1| hypothetical protein AN4785.2 [Aspergillus nidulans FGSC
A4]|gi|40741165|gb|EAA60355.1| hypothetical protein
AN4785.2 [Aspergillus nidulans FGSC A4]
Length = 666
Score = 830 bits (2144), Expect = 0.0
Identities = 436/716 (60%), Positives = 505/716 (69%), Gaps = 7/716 (0%)
Frame = +3
Query: 99 IKMASTRDSHSFAVCTHGIRGDQI*LHV*CDECRLRKSRCSKERPVCLQCRQLNKECLYS 278
I M STRDSHS+A C+QL+KEC YS
Sbjct: 6 ITMGSTRDSHSYA------------------------------------CKQLDKECKYS 29
Query: 279 PKVSRSPLTRQHLTYVEDRLHSFETALGRLFPGGDLDATLRSLLQN--PEXXXXXXXXXX 452
PK++RSPLTRQHLTYVEDRL +FE+ALGRLFPGGDLDAT+RSLLQ+ P
Sbjct: 30 PKITRSPLTRQHLTYVEDRLQAFESALGRLFPGGDLDATVRSLLQDQDPLSKERSSSKSS 89
Query: 453 XXXXXXXXHXXXXXXXXXXXXXXXXDGFDWAEKEITLGDLTDGMAALSIKPEGAGYFGAS 632
DGFDWAE ITLGDLTDGMAALSIKPEGAGYFGAS
Sbjct: 90 SRHSTPAKTEADRHESAPEALPQQADGFDWAENRITLGDLTDGMAALSIKPEGAGYFGAS 149
Query: 633 SSVVPLRALFDHGFDLNMPVRSAR----SGGVPLKAQLLESAPSGLIEQAFIDAFFLNYH 800
SSVVPLRAL HGFDLN+P S++ S VPLK+QLL APSG+IEQAF+DAFF NYH
Sbjct: 150 SSVVPLRALLKHGFDLNIPSGSSKRVDNSDRVPLKSQLLNIAPSGVIEQAFMDAFFNNYH 209
Query: 801 TSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAWCIGDDSSDLDITFYQEARGY 980
SYPFVHE TFRAQF++ RPHG AW ILLNTILALGAWCIGDD+SDLDITFYQEAR
Sbjct: 210 MSYPFVHEATFRAQFHEQLPRPHGPAWQILLNTILALGAWCIGDDNSDLDITFYQEARSR 269
Query: 981 LQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTGWNYLGLAVRMAMSLGLHKEFPGWKIS 1160
LQQ SVFE GN SNYAQKRNKPNTGWN+LGLAVRM+MSLGLHKEF GWKIS
Sbjct: 270 LQQMSVFEAGNLTLVQALLFLSNYAQKRNKPNTGWNFLGLAVRMSMSLGLHKEFHGWKIS 329
Query: 1161 LLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHEDELTPTTTALPDE 1340
LLQRE+RRRLWWGV+IFDSGAAKTFGRPILLPE+SVMD K V NIH++ LT TTT +P E
Sbjct: 330 LLQREVRRRLWWGVYIFDSGAAKTFGRPILLPEDSVMDVKHVLNIHDEALTSTTTVVPPE 389
Query: 1341 SLGPTIYSGLIAQARFHILTNSVYQRLISSPSLTPEDTLKLQQPIEEWYNGLPSYLQHPQ 1520
PT+Y+G++AQA+FHILTNSVYQRLIS P+ TPE+TL LQ+P+EEWYN LP Y+++P
Sbjct: 390 VNEPTLYTGMLAQAKFHILTNSVYQRLISGPNPTPEETLSLQKPMEEWYNSLPDYIKNP- 448
Query: 1521 MPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRRWAPLSGMPGTPDAGT 1700
P S+ D+ ALVR+RLLWR+WNL ILI+RPILLRWAS+RW P P P
Sbjct: 449 APGSMS------DNFALVRSRLLWRDWNLRILIYRPILLRWASKRWTP--NTPTEP---- 496
Query: 1701 EDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLFQAGLIPIVFFMTDPS 1880
EDP E ECR+ C +NA+LTISSI+++++NY CTR+GAWYMLYFLFQAGLIPI+ MTDP+
Sbjct: 497 EDPYEAECRMLCFRNAKLTISSITDFVNNYPCTRVGAWYMLYFLFQAGLIPIILLMTDPT 556
Query: 1881 NPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRLLGPPIPPTQVIPTANPHEAXX 2060
+ EA +W+Q+IE TKALL +PSL N A RC +VI RL P P + P +
Sbjct: 557 SAEAPSWIQEIEATKALLMYPSLSNNNLAGRCLDVIYRLCAPVYPSNATSSASAPSQ--- 613
Query: 2061 XXXXXXXXXXXXXXXFPEQLFSDPGFGGSLFP-VEQQMQMNAGGMDFSEWVNFPPT 2225
F +QL++DP F GSLFP V Q + ++A GMDFSEWVNF PT
Sbjct: 614 -------QPQPIYMPFADQLYNDPTF-GSLFPDVNQDLNVSA-GMDFSEWVNFAPT 660
>ref|XP_412248.1| hypothetical protein AN8111.2 [Aspergillus nidulans FGSC
A4]|gi|40740543|gb|EAA59733.1| hypothetical protein
AN8111.2 [Aspergillus nidulans FGSC A4]
Length = 753
Score = 274 bits (701), Expect = 6e-72
Identities = 185/496 (37%), Positives = 243/496 (48%), Gaps = 6/496 (1%)
Frame = +3
Query: 534 FDWAEKEITLGDLTDGMAALSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARS-G 710
F+W E+ DGMA+L+ + GY L D D N V
Sbjct: 216 FEWDERAGVDNKFVDGMASLTSRSNEGGY---------LETRDDDAGDSNGQVHEYECRA 266
Query: 711 GVPLKAQLLESAPSGLIEQAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHIL 890
+P + + F+DAFF YH SYP VHE TFRAQF + RP G W +L
Sbjct: 267 SIPFVLNTMSQL------EPFVDAFFRLYHCSYPIVHEATFRAQFMEIIPRPPGNTWQVL 320
Query: 891 LNTILALGAWC--IGDDSSDLDITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKR 1064
L I ALG + +SD+D+ + A+ L V ETGN SNY QKR
Sbjct: 321 LFVISALGVFTSSTATSTSDVDLALFDAAKERLS-IDVLETGNLVLVQALTLISNYLQKR 379
Query: 1065 NKPNTGWNYLGLAVRMAMSLGLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRP 1244
NKPN+G+NY+GLA R+AM +GLHKEFP W SLL E+RRR+W+ ++IFD GA TF RP
Sbjct: 380 NKPNSGYNYMGLARRVAMGIGLHKEFPTWDASLLTLEMRRRVWYCLYIFDIGAMITFSRP 439
Query: 1245 ILLPEESVMDAKQVRNIHEDELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLI 1424
+ P E V D K N H+ ++T TT LP + TIYS L AQA+FH+ T+S+Y ++I
Sbjct: 440 LDFPVEGV-DVKLPMNAHDSDITSTTRHLPPPAYETTIYSHLRAQAQFHLATSSIYSKII 498
Query: 1425 SSPSLTPEDTLKLQQP-IEEWYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNW 1601
S P + + L+L I W L Y P + P AL L WR
Sbjct: 499 SHPLPSATELLQLDDTLIGGWLANLAPYFAEPAV---------QPQKFALAHAILCWRYR 549
Query: 1602 NLTILIHRPILLRWASRRWAPLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYM 1781
N IL++RP L+ R + +G G + GT+ E RCL A + I +
Sbjct: 550 NFRILMYRPFLVGRVMVR-SRGAGTQG-QEVGTDIDLAIE---RCLAAAAEAVELICTFW 604
Query: 1782 DNYI--CTRLGAWYMLYFLFQAGLIPIVFFMTDPSNPEAATWLQDIETTKALLTHPSLGT 1955
+ T + WY LYFLFQA LI ++ DP +P A W I +L +L
Sbjct: 605 LTQVHHRTMMACWYGLYFLFQAILISVICLRNDPQSPLAQGWRSQISQAIEVLESMAL-M 663
Query: 1956 NRFATRCYEVINRLLG 2003
N A RC VI L G
Sbjct: 664 NSSARRCLRVITDLCG 679
Score = 59.3 bits (142), Expect = 4e-07
Identities = 28/63 (44%), Positives = 38/63 (59%)
Frame = +3
Query: 186 CDECRLRKSRCSKERPVCLQCRQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFETALGR 365
C ECR RKS+C + PVC C + ++C Y K +R+PLTR HL+ VE+ L + L R
Sbjct: 35 CRECRRRKSKCDRAIPVCRLCSKYKRQCTYE-KPARTPLTRTHLSRVENELARTKALLRR 93
Query: 366 LFP 374
P
Sbjct: 94 FMP 96
>ref|XP_385963.1| hypothetical protein FG05787.1 [Gibberella zeae
PH-1]|gi|42553019|gb|EAA75862.1| hypothetical protein
FG05787.1 [Gibberella zeae PH-1]
Length = 1332
Score = 250 bits (638), Expect = 1e-64
Identities = 176/528 (33%), Positives = 257/528 (48%), Gaps = 35/528 (6%)
Frame = +3
Query: 528 DGFDWAEKEITLGD-------------------LTDGMAALSIKPEGAGYFGASSSVVPL 650
D F+W E+E + G + DGMA+L+I + GY+GA+S L
Sbjct: 769 DDFEWNEQETSWGTYDPASWSITNHVDGNPSQAIMDGMASLTIGDQKRGYYGAASGSALL 828
Query: 651 RALFD--------------HGFDLNMPVRSARSGGVPLKAQLLESAPSGLIEQAFIDAFF 788
R + H + S S +A L A LI DAFF
Sbjct: 829 RQILSARPDGEEVDNDVALHEIESLFQQHSDHSQWFRSQAMLTRVAVENLI-----DAFF 883
Query: 789 LNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAWCIGDDSSDLDITFYQE 968
+ YH ++P VHEPTFRAQ+ + W+ L N + ALG++ + + D+ +Q
Sbjct: 884 VFYHPTFPIVHEPTFRAQYAGTLPCANKGHWNTLANILAALGSFASSNVADATDLPIFQA 943
Query: 969 ARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTGWNYLGLAVRMAMSLGLHKEFPG 1148
A+ L ++ E GN SNY QKRNKPNTG+NY GLA+R+A+ LGLHK+ G
Sbjct: 944 AQKSLF-SNYLEVGNLTLVQAFSLSSNYMQKRNKPNTGFNYGGLAIRLAIGLGLHKDLEG 1002
Query: 1149 WKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHEDELTPTTTA 1328
+S LQ E RRR+WW + + D GA T+GRP+ P+ V A +NIHE +LT +T
Sbjct: 1003 NSLSPLQSETRRRIWWCLCVLDVGATITYGRPLNWPQAGVETAFP-QNIHEKDLTHDSTH 1061
Query: 1329 LPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSLTPEDTLKLQQP-IEEWYNGLPSY 1505
P E G T+Y+ + Q+ +H+ T +VY RLI+SP + + + L I W +P Y
Sbjct: 1062 CPPEVDGITMYTYIRVQSAYHLSTMTVYNRLITSPFPSATELITLDDVCIGSWLAQVPYY 1121
Query: 1506 LQHPQMPLSVQPTNPDPDS-LALVRNRLLWRNWNLTILIHRPILLRWASRRWAPLSGMPG 1682
+ T P PDS L WR NL I+++RP L+RWA S
Sbjct: 1122 YR----------TVPPPDSEYGLGMGISEWRYRNLRIVMYRPFLVRWAR------SSAQN 1165
Query: 1683 TPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLFQAGLIPIVF 1862
T T E RCL A+ ++ I Y + +RL A+Y+LYFLF A LIP+
Sbjct: 1166 TQQNLTSS--ENLAVFRCLDAAKESVMHIQSYWTSRSHSRLAAFYILYFLFHATLIPVHC 1223
Query: 1863 FMTDPSNPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRLLGP 2006
+P + A W I+ + A++ + N +++C ++ +L P
Sbjct: 1224 LRQNPHHALAPDWRSQIQASLAVMGGMA-ELNPNSSKCRDITLKLCWP 1270
Score = 35.0 bits (79), Expect = 8.0
Identities = 21/60 (35%), Positives = 25/60 (41%)
Frame = +3
Query: 186 CDECRLRKSRCSKERPVCLQCRQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFETALGR 365
C ECR RK +C PVC C R+ LT +E+RL ET L R
Sbjct: 613 CRECRRRKVKCDGVMPVCTIC-------------------RKQLTLLEERLEKAETVLRR 653
>ref|NP_015076.1| DNA-binding transcription factor required for the activation of the
GAL genes in response to galactose; repressed by Gal80p
and activated by Gal3p; Gal4p [Saccharomyces
cerevisiae]|gi|1169823|sp|P04386|GAL4_YEAST Regulatory
protein GAL4|gi|73246|pir||RGBYG4 regulatory protein GAL4
- yeast (Saccharomyces
cerevisiae)|gi|171558|gb|AAA34626.1| GAL4
protein|gi|1061241|emb|CAA91596.1| GAL4 [Saccharomyces
cerevisiae]|gi|1370511|emb|CAA97969.1| GAL4
[Saccharomyces cerevisiae]|gi|4995952|dbj|BAA78208.1|
Gal4 [Drosophila melanogaster]
Length = 881
Score = 243 bits (620), Expect = 1e-62
Identities = 188/677 (27%), Positives = 281/677 (40%), Gaps = 56/677 (8%)
Frame = +3
Query: 186 CDECRLRKSRCSKERPVCLQCRQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFETALGR 365
CD CRL+K +CSKE+P C +C + N EC YSPK RSPLTR HLT VE RL E
Sbjct: 11 CDICRLKKLKCSKEKPKCAKCLKNNWECRYSPKTKRSPLTRAHLTEVESRLERLEQLFLL 70
Query: 366 LFPGG-----------------------------------------DLDATLR----SLL 410
+FP D+ TLR S
Sbjct: 71 IFPREDLDMILKMDSLQDIKALLTGLFVQDNVNKDAVTDRLASVETDMPLTLRQHRISAT 130
Query: 411 QNPEXXXXXXXXXXXXXXXXXXHXXXXXXXXXXXXXXXXDGFDWAEKEITLGDLTDGMAA 590
+ E H GFDW+E++ D++DG+
Sbjct: 131 SSSEESSNKGQRQLTVSIDSAAHHDNSTIPLDFMPRDALHGFDWSEED----DMSDGLPF 186
Query: 591 LSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARSGGVPLKAQLLE--SAPSGLIE 764
L P G+FG S + LR++ GF P S L + + + S
Sbjct: 187 LKTDPNNNGFFGDGSLLCILRSI---GFK---PENYTNSNVNRLPTMITDRYTLASRSTT 240
Query: 765 QAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAWCIGDDSSD 944
+ ++ N+H P VH PT +N+ W IL N ILA+GAWCI +S+D
Sbjct: 241 SRLLQSYLNNFHPYCPIVHSPTLMMLYNNQIEIASKDQWQILFNCILAIGAWCIEGESTD 300
Query: 945 LDITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTGWNYLGLAVRMAMSL 1124
+D+ +YQ A+ +L + VFE+G+ S Y Q R K NT +N+ ++RMA+SL
Sbjct: 301 IDVFYYQNAKSHL-TSKVFESGSIILVTALHLLSRYTQWRQKTNTSYNFHSFSIRMAISL 359
Query: 1125 GLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHED 1304
GL+++ P E RRR+WW V+ ++ + +GR I L + ++ V D
Sbjct: 360 GLNRDLPSSFSDSSILEQRRRIWWSVYSWEIQLSLLYGRSIQLSQNTISFPSSV-----D 414
Query: 1305 ELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQR----LISSPSLTPEDTLKLQQP 1472
++ TTT GPTIY G+I AR + +Y+ + + L +
Sbjct: 415 DVQRTTT-------GPTIYHGIIETARLLQVFTKIYELDKTVTAEKSPICAKKCLMICNE 467
Query: 1473 IEEWYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASR 1652
IEE P +LQ ++ + L+ R L W+ +L I + R + +
Sbjct: 468 IEEVSRQAPKFLQMDISTTALTNLLKEHPWLSFTRFELKWKQLSLIIYVLRDFFTNFTQK 527
Query: 1653 RWAPLSGMPGTPDAGTEDPCETE-CRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYF 1829
+ D E + C + A+ T+ S+S YMDN+ T AW Y+
Sbjct: 528 K------SQLEQDQNDHQSYEVKRCSIMLSDAAQRTVMSVSSYMDNHNVTPYFAWNCSYY 581
Query: 1830 LFQAGLIPIVFFM----TDPSNPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRL 1997
LF A L+PI + ++ N E A LQ I T LL + + + +V+ +
Sbjct: 582 LFNAVLVPIKTLLSNSKSNAENNETAQLLQQINTVLMLLKKLATFKIQTCEKYIQVLEEV 641
Query: 1998 LGPPIPPTQVIPTANPH 2048
P + IP PH
Sbjct: 642 CAPFLLSQCAIPL--PH 656
>ref|XP_408471.1| hypothetical protein AN4334.2 [Aspergillus nidulans FGSC
A4]|gi|40741305|gb|EAA60495.1| hypothetical protein
AN4334.2 [Aspergillus nidulans FGSC A4]
Length = 1488
Score = 216 bits (549), Expect = 3e-54
Identities = 171/609 (28%), Positives = 274/609 (44%), Gaps = 4/609 (0%)
Frame = +3
Query: 186 CDECRLRKSRCSKERPVCLQCRQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFETALGR 365
CDEC+ RK RCS + CL C + K C YS + ++ + E ++ E A
Sbjct: 848 CDECKRRKIRCSGDEN-CLNCLRDAKACRYSSPSHQLSKLQRRVQDCERLINEMEQAWAT 906
Query: 366 LFPGGDLDATLRSLLQNPEXXXXXXXXXXXXXXXXXXHXXXXXXXXXXXXXXXXDGFDWA 545
P DL LRS+ Q + H D +++
Sbjct: 907 YLPSVDLQGALRSIRQ--QDGSASVTSKKIKHQHELTHSTEQPPTSVAEHSNAED-YEFD 963
Query: 546 EKEITLGDLTDGMAALSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARSGGVPLK 725
E + + TDGM L+I P AGY G S V L+ L L +P+ S
Sbjct: 964 ESQ-DFDNSTDGMGFLTIDPHKAGYTGPQSGVAALKFL--QSLPLYIPLGSFIPASSLDD 1020
Query: 726 AQLLESAPSGLIEQA--FIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNT 899
+ +S P + ++DA+F YH +YP +HE TFRA+ +
Sbjct: 1021 EDICDSGPPRKQSEVARYLDAYFTFYHPAYPILHEGTFRARVS----------------- 1063
Query: 900 ILALGAWCIGDDSSDLDITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNT 1079
GA+ + + +DI F++EAR +L V E G+ +NY QKRNKPN
Sbjct: 1064 ----GAFAGDSNGTKMDIVFFKEARKHLSM-DVLEKGSLSYVQAIVLMANYLQKRNKPNA 1118
Query: 1080 GWNYLGLAVRMAMSLGLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPE 1259
+ +G+ MA+++GLH+EF S E+RRR+WW +FIF SG GRP +
Sbjct: 1119 AFILVGIGFSMALAIGLHREFGMPSTSPFTMEVRRRVWWTLFIFVSGVQLILGRPAV--S 1176
Query: 1260 ESVMDAKQVRNIHEDELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSL 1439
++ + N+ + +L LP+ S GPTI S LIAQ + + N++ L++
Sbjct: 1177 LVGVNVRLPANVDDHDLAVDMEELPESSTGPTITSCLIAQVKLAKIANAIQVELLTHHLP 1236
Query: 1440 TPEDTLKLQQPIEEWYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILI 1619
TP L+Q I W+N LP+ H + ++P P R +LWR+++L I+I
Sbjct: 1237 TPAKAQDLEQRISSWHNDLPA---HFNTEVYLEPKFDIP------RRVVLWRSYHLRIVI 1287
Query: 1620 HRPILLRWASRRWAPLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICT 1799
+RP L + + + + L+ G P A CL A + ++SI ++++
Sbjct: 1288 NRPFLFQNITSK-SELNTSSG-PIAS------------CLAAADMCVTSICGFLESTDNR 1333
Query: 1800 RLG-AWYMLYFLFQAGLIPIVFFMTDPSNPEAATWLQDIETTKALLTHPSLGTNR-FATR 1973
+ G WY +L A + ++ P++P A +W I+ +A+ SLG++ A R
Sbjct: 1334 QRGLTWYATCWLLTASFVQATCYVYGPTHPLAESWRSHIQ--RAVDCLGSLGSSHDMALR 1391
Query: 1974 CYEVINRLL 2000
V+ ++L
Sbjct: 1392 ARAVLQKVL 1400
>ref|XP_409685.1| hypothetical protein AN5548.2 [Aspergillus nidulans FGSC
A4]|gi|40743063|gb|EAA62253.1| hypothetical protein
AN5548.2 [Aspergillus nidulans FGSC A4]
Length = 1261
Score = 187 bits (474), Expect = 1e-45
Identities = 160/531 (30%), Positives = 241/531 (45%), Gaps = 43/531 (8%)
Frame = +3
Query: 534 FDWAEKEITLGDLTDGMAALSIKPEGAGYFGASSSVVPLRA---LFDHGFDLNM-----P 689
F+W E + + DGM +L +GY G SS L+ LF L+ P
Sbjct: 353 FEW--NEASTSNPPDGMVSLPTASAESGYLGHSSGTRLLQTISDLFPENVTLDQQQDDSP 410
Query: 690 VRSARSGGVPLKAQLLESAPSGLIEQAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPH 869
R +++G L LL A + +++ IDA+FL Y+ SYP +HE TFR ++ + P
Sbjct: 411 ARVSQTGSPSL---LLHFANTAVLDN-LIDAYFLWYNRSYPILHESTFREKYRNRQRIPS 466
Query: 870 GTAWHILLNTILALGAWC------IGDDSSDLDITFYQEARGYLQQASVFETGNXXXXXX 1031
++WH++ +LA+G W +G DS +Y AR + + E+G+
Sbjct: 467 RSSWHLIFYLVLAIGHWVSTGGMELGQDS------YYMAARSRMSMR-MLESGSLAAVQA 519
Query: 1032 X---------------XXXSNYAQKRNKPNTGWNYLGLAVRMAMSLGLHKEFP-GWKISL 1163
NY QKR++PNTG+NY+G+A RMA+ LGLH+E P G
Sbjct: 520 FLLMVCPDLPIAGCVLTFQGNYLQKRDRPNTGYNYIGIAYRMALGLGLHREPPSGQTNDS 579
Query: 1164 LQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHEDELTPTTTALPDES 1343
L E RR +WW V+ FDSG + T GRP+ + +S ++ + RNI +D ++ + LP
Sbjct: 580 LFNERRRAVWWVVYCFDSGFSLTTGRPV-MASDSFIETRLPRNI-DDSISALDSPLPSPV 637
Query: 1344 LGPTIYSGLIAQARFHILTNSVYQRLISSP---SLTPEDTLKLQQPIEEWYNGLPSYLQH 1514
PT YS +IA R + N ++ +IS P S + + ++ W LPSY
Sbjct: 638 DRPTTYSAIIAHGRLVSIGNRIFSEIISCPYRQSFSFRAPRSIDHQLKAWRLSLPSYFTG 697
Query: 1515 PQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRRWAPLSGMPGTPDA 1694
+P P P R + W+ NL +LL W S+R D
Sbjct: 698 QDIP----PWFRGP------RAVVGWKEQNLR------MLLWWGSQRLC--------NDN 733
Query: 1695 GTEDPCETECRVRCLQNARLTISSISEYMDNYI-CTRLG-AWYMLYFLFQAGLI------ 1850
+ E R C A TI I+ + +Y +G +WY YFLFQA ++
Sbjct: 734 VSMSAEREEARNMCHLAAVETIQDITTFCLDYADAVHVGLSWYATYFLFQASVVLSIHHL 793
Query: 1851 -PIVFFMTDPSNPEAATWLQDIETTKALLTHPSLG-TNRFATRCYEVINRL 1997
P+ + WL +E +A SLG TNR A RC V++R+
Sbjct: 794 KPVPQLDRGMAAVNRELWLSSVE--RARRCFASLGATNRAAFRCLAVLDRI 842
>ref|XP_401329.1| hypothetical protein UM03714.1 [Ustilago maydis
521]|gi|46099659|gb|EAK84892.1| hypothetical protein
UM03714.1 [Ustilago maydis 521]
Length = 988
Score = 169 bits (428), Expect = 3e-40
Identities = 124/415 (29%), Positives = 189/415 (44%), Gaps = 5/415 (1%)
Frame = +3
Query: 774 IDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTILALGAW--CIGDDSSDL 947
+++FF + SYP +H TFRAQ P AW +L+N + ALGA + DL
Sbjct: 330 LESFFEFHLPSYPILHPATFRAQLQGTVCPPASAAWPVLVNMVFALGALERRVSAQELDL 389
Query: 948 DITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTGWNYLGLAVRMAMSLG 1127
D FY+ A+ L A +FE +NY QK+N W LG A+RMA+SLG
Sbjct: 390 DTPFYERAK-QLYSAVLFEKAEIISVQALTLMANYCQKKNHFAAAWMVLGSALRMAVSLG 448
Query: 1128 LHKE--FPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHE 1301
L+ E + RE RRLW+ +F ++ + RP L + D +NI E
Sbjct: 449 LYSESTLQARDMPAFDREFGRRLWFTLFTMEADTCVSMARPNALLSINA-DVAPPQNIDE 507
Query: 1302 DELTPTTTALPDESLGPTIYSGLIAQARFHI-LTNSVYQRLISSPSLTPEDTLKLQQPIE 1478
++PT+ LP E T+ S L A+F ++ + RL+ + + E+ E
Sbjct: 508 TAMSPTSEELPAEVSEATLSSTLAVHAKFASEISMPLQARLMRGTNPSIEEVRAFDLKTE 567
Query: 1479 EWYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRRW 1658
E+ + LP Y+ P P S ++ RL WR N +++ RP LL A
Sbjct: 568 EFVDRLPDYMAEGY-------AGPRPASFSVASARLRWRCNNFRMVMFRPFLLSNAVAAA 620
Query: 1659 APLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLFQ 1838
A S P P + C + A I SIS + D++ + AW+ +YFL Q
Sbjct: 621 AARSRGETRP---VLRPAVKQAIALCRRMASCNIRSISTFWDSHPHNQAMAWHAIYFLTQ 677
Query: 1839 AGLIPIVFFMTDPSNPEAATWLQDIETTKALLTHPSLGTNRFATRCYEVINRLLG 2003
+ L+P+V + +P+ EA W+ ++T LL T +C + I +L G
Sbjct: 678 SALVPLVSLLDEPTGQEAREWVSLLQTCVRLLVDMGHIT-PIGAKCKDAIEKLAG 731
Score = 45.1 bits (105), Expect = 0.008
Identities = 22/63 (34%), Positives = 32/63 (49%), Gaps = 2/63 (3%)
Frame = +3
Query: 186 CDECRLRKSRCSKERPVCLQC--RQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFETAL 359
C EC+ RK RC P C +C K C Y RSP+TRQ+++ +E +L + +
Sbjct: 63 CTECKSRKMRCDGGVPFCGRCITHSRTKSCAYVDPPKRSPITRQYVSSLEAQLEEAKKMI 122
Query: 360 GRL 368
L
Sbjct: 123 AEL 125
>ref|NP_985390.1| AFL160Cp [Eremothecium gossypii]|gi|44984248|gb|AAS53214.1| AFL160Cp
[Eremothecium gossypii]
Length = 648
Score = 169 bits (427), Expect = 4e-40
Identities = 153/573 (26%), Positives = 235/573 (40%), Gaps = 9/573 (1%)
Frame = +3
Query: 186 CDECRLRKSRCSKERPVCLQCRQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFETALGR 365
CD CR RK +CSK P C +CR+ N+ CLYSPK+ RSPLTR HLT VE RL E L
Sbjct: 9 CDSCRRRKMKCSKTFPKCAKCREDNRVCLYSPKIRRSPLTRAHLTEVETRLGQMEQLLRN 68
Query: 366 LFPGGDLDA-TLRSLLQNPEXXXXXXXXXXXXXXXXXXHXXXXXXXXXXXXXXXXDGFDW 542
F DLDA TL +P+ F W
Sbjct: 69 AFV--DLDADTLLKHRHDPQLREILFGRTVDIGALSRTDEYMASPLPPLRQAE----FQW 122
Query: 543 AEKEITLGDLTDGMAALSIKPEGAGYFGASSSVVPLRALFDHGFDLNMPVRSARSGGVPL 722
E+ DL G G + PL +L + R ++ + L
Sbjct: 123 YERR----DLLTGT-------------GFNMKQSPLHSLVAEAQQEGLMARPSKYERMQL 165
Query: 723 KAQLLESAPSGLIEQAFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTAWHILLNTI 902
+ Q ++ A+F ++H YP V E F + DPS P W L+N +
Sbjct: 166 EEQSTTLR--------YMQAYFEHFHWLYPVVDEHEFYLLYEDPSTAPDACLWTGLVNVV 217
Query: 903 LALGAWCIGDDSSDLDITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTG 1082
LALGAWC G + + +Y +A ++ + TG+ ++Y + NT
Sbjct: 218 LALGAWCAGAPPA-AHVFYYDKAESHV-LGRMLRTGDRVLAIALLLMAHYNYMTHHRNTA 275
Query: 1083 WNYLGLAVRMAMSLGLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEE 1262
W LGLA ++A SLGLH++ G + QR + + LWWG++ + A GRP LP
Sbjct: 276 WMLLGLASQLATSLGLHRDLQG--LPPKQRTLAQILWWGIYSTTTQVALELGRPSPLP-- 331
Query: 1263 SVMDAKQVRNIHEDELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSLT 1442
R H D P T +YS L + I QR++ P+ +
Sbjct: 332 --------RIGHTDVPVPDMT--------NPLYSHLANEVNLVI----QLQRIMHVPNPS 371
Query: 1443 PEDTLKLQQPIEEWYNGL--------PSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRN 1598
L+ Q ++++Y+ + SYL L + T+ P+ L + R+ W+
Sbjct: 372 VAVCLQWYQTVKDFYHNIHPGAGDDRDSYL------LPREATDEQPNRL-FCKYRITWK- 423
Query: 1599 WNLTILIHRPILLRWASRRWAPLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEY 1778
+++ ++ IL S M P+ T C CL+ I+SI+ +
Sbjct: 424 FHICVV---SILFN---------SIMGNVPNVDT-----NSCYSMCLKFTNDIINSIATF 466
Query: 1779 MDNYICTRLGAWYMLYFLFQAGLIPIVFFMTDP 1877
+ Y L +WY + ++ +A +P+ + P
Sbjct: 467 FNTYSLPHLLSWYSINYMIRASTVPLYCLNSPP 499
>sp|P08657|LAC9_KLULA Lactose regulatory protein LAC9|gi|173307|gb|AAA35266.1| LAC9
protein|gi|298015|emb|CAA29565.1| unnamed protein product
[Kluyveromyces lactis]
Length = 865
Score = 167 bits (422), Expect = 1e-39
Identities = 126/401 (31%), Positives = 196/401 (48%), Gaps = 6/401 (1%)
Frame = +3
Query: 768 AFIDAFFLNYHTSYPFVHEPTFRAQFNDPSIRPHGTA-WHILLNTILALGAWCIGDDSSD 944
A+IDA+F +YH YP V + F AQ+ND I+P WHILLN +LALG+WC SS
Sbjct: 387 AYIDAYFKHYHALYPLVSKEMFFAQYND-QIKPENVEIWHILLNAVLALGSWCSNSCSSH 445
Query: 945 LDITFYQEARGYLQQASVFETGNXXXXXXXXXXSNYAQKRNKPNTGWNYLGLAVRMAMSL 1124
+ +YQ A YL A V ETG+ ++Y QK +KPNT W+ +GL MA SL
Sbjct: 446 HTL-YYQNALSYLSTA-VLETGSTDLTIALILLTHYVQKMHKPNTAWSLIGLCSHMATSL 503
Query: 1125 GLHKEFPGWKISLLQREIRRRLWWGVFIFDSGAAKTFGRPILLPEESVMDAKQVRNIHED 1304
GLH++ P I +++RR LWW ++ + GRP LLP +D
Sbjct: 504 GLHRDLPNSTIH--DQQLRRVLWWTIYCTGCDLSLETGRPSLLPNLQAIDIP-------- 553
Query: 1305 ELTPTTTALPDESLGPTIYSGLIAQARFHILTNSVYQRLISSPSLTPEDTLKLQQPIEE- 1481
P ++A E P+IYS +I ++++ + + Q+ +S+ S QQ E
Sbjct: 554 --LPASSATIKE---PSIYSSIIQESQW----SQILQQKLSNNS--------YQQSAGEC 596
Query: 1482 --WYNGLPSYLQHPQMPLSVQPTNPDPDSLALVRNRLLWRNWNLTILIHRPILLRWASRR 1655
W++ + ++L H P + + + AL +L W L ++ RP + S
Sbjct: 597 LSWFDSVQAFLDHWPTP------STEAELKALNETQLDW----LPLVKFRPYWMFHCS-- 644
Query: 1656 WAPLSGMPGTPDAGTEDPCETECRVRCLQNARLTISSISEYMDNYICTRLGAWYMLYFLF 1835
L + DA T++ C+ CLQ + I S++ ++ +Y L WY ++L
Sbjct: 645 LISLFSVFFEEDAPTDNNV-IRCKELCLQLSSRNIFSVATFVRSYAFNSLSCWYATHYLV 703
Query: 1836 QAGLIPIVFFMTDPSNPEAATWLQDIETTKA--LLTHPSLG 1952
++ L+P+ F +P+ A W ET KA L H ++G
Sbjct: 704 RSALVPLHF--ASRISPQHALW----ETVKAQLLSAHEAMG 738
Score = 79.3 bits (194), Expect = 4e-13
Identities = 36/76 (47%), Positives = 49/76 (64%)
Frame = +3
Query: 174 LHV*CDECRLRKSRCSKERPVCLQCRQLNKECLYSPKVSRSPLTRQHLTYVEDRLHSFET 353
+H CD CR +K +CSK P C C + N +C+YSP+V R+PLTR HLT +E+R+ E
Sbjct: 91 MHQACDACRKKKWKCSKTVPTCTNCLKYNLDCVYSPQVVRTPLTRAHLTEMENRVAELEQ 150
Query: 354 ALGRLFPGGDLDATLR 401
L LFP D+D L+
Sbjct: 151 FLKELFPVWDIDRLLQ 166
Database: BlastDB/NCBI/blast/db/FASTA/2004_07_04_01_00_0/nr
Posted date: Jul 5, 2004 10:54 PM
Number of letters in database: 629,712,918
Number of sequences in database: 1,895,298
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 629,712,918
effective HSP length: 134
effective length of database: 375,742,986
effective search space used: 254002258536
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)