11
Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5 Valerie Ducros 1 , Mirjam Czjzek 1 *, Anne Belaich 2 , Christian Gaudin 2 , Henri-Pierre Fierobe 2 , Jean-Pierre Belaich 2 , Gideon J Davies 3 and Richard Haser 1 llnstitut de Biologie Structurale et Microbiologie, Laboratoire de Cristallographie et Cristallisation des Macromolecules Biologiques, URA 1296, CNRS, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France, 21nstitut de Biologie Structurale et Microbiologie, Laboratoire de Bioenergetique et Ingenierie des Proteines, UPR 9036, CNRS, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France and 30epa rtment of Chemistry, University of York, Heslington, York, Y01 500, UK Background: Cellulases are glycosyl hydrolases - enzymes that hydrolyze glycosidic bonds. They have been widely studied using biochemical and micro- biological techniques and have attracted industrial inter- est because of their potential in biomass conversion and in the paper and textile industries. Glycosyl hydrolases have lately been assigned to specific families on the basis of similarities in their amino acid sequences. The cellulase endoglucanase A produced by ClostridiulII cellulolytiCIIIII (CeiCCA) belongs to family 5. Results: We have determined the crystal structure of the catalytic domain of CeiCCA at a resolution of 2.4 A and refined it to 1.6 A. The structure was solved by the multiple isomorphous replacement method. The overall structural fold, (ex/13)8' belongs to the TIM barrel motif superfamily. The catalytic centre is located at the C-terininal ends of the 13 strands; the aromatic residues, forming the substrate-binding site, are arranged along a long cleft on the surface of the globular enzyme. Conclusions: Strictly conserved residues within family 5 are described with respect to their catalytic function. The proton donor, Glu170, and the nucleophile, Glu307, are localized on 13 strands IV and VII, respectively, and are separated by 5.5 A, as expected for enzymes which retain the configuration of the substrate's anomeric carbon. Structure determination of the catalytic domain of CeiCCA allows a comparison with related enzymes belonging to glycosyl hydrolase families 2, 10 and 17, which also display an (a/13)8 fold. Structure 15 September 1995, 3:939-949 Key words: a/13 barrel, Clostridium cellulolyticum, family 5 cellulase, glycosyl hydrolase, X-ray structure Introduction Cellulolytic microorganisms, both bacteria and fungi, produce a wide variety of cellulases and xylanases, enzymes that hydrolyze glycosidic bonds and which play an important role in biomass conversion. Cellulases catalyze the hydrolysis of 13-1,4-glycosidic bonds of cellulose. Fungal organisms and some aerobic bacteria produce these enzymes in a non-clustered manner and secrete them directly into the external medium. In some anaerobic bacteria, such as clostridia, however, enzymes are secreted in the form of highly structured cellulolytic complexes, known as cellulosomes [1]. In most cases, cellulases have, in addition to the indepen- dently functioning catalytic domain, one or more domains involved in substrate binding or multienzyme complex formation [2]. Glycosyl hydrolases have been classified into 45 families on the basis of amino acid sequence similarities that were detected by hydrophobic cluster analysis (HCA) [3]. The catalytic domains of cellulases and xylanases are found in 11 of these 45 fami- lies. The 11 families are also referred to as cellulase fami- lies A-K [4,5]. Within a given family, enzymes may exhibit different substrate specificities but all the available *Corresponding author. data are consistent with a common active-site topology and catalytic mechanism that, in each family, leads to either inversion or retention of the configuration of the anomeric carbon [1,6-9]. The cellulase endoglucanase A from Clostridilllll cellll- lolytiCllIII (CelCCA), has a catalytic domain belonging to family 5 (cellulase family A) and a C-terminal domain containing a duplicated segment [10-12], which appears, in clostridial enzymes, to be involved in attachment of catalytic subunits to the scaffolding proteins of the cellu- losome [I]. The catalytic domain of CelCCA has been isolated by gene truncation and expressed, to a high level, in Escherichia coli [10]. The truncated enzyme exhibits similar properties to the wild-type enzyme although it is fourfold less efficient in the hydrolysis of crystalline cellu- lose. The optimum temperature for hydrolysis is in the range 37-50°C and the optimum pH is 6.0. Catalysis by enzymes of family 5 proceeds with net retention of the anomeric configuration, as has been shown by 1H NMR of the reaction products [12, I 3]. HCA plots have led to the identification of five segments that are particularly well conserved in the catalytic core, © Current Biology Ltd ISSN 0969-2126 939

Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

Embed Size (px)

Citation preview

Page 1: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

r

Crystal structure of the catalytic domain of abacterial cellulase belonging to family 5

Valerie Ducros1, Mirjam Czjzek1*, Anne Belaich2,

Christian Gaudin2, Henri-Pierre Fierobe2, Jean-Pierre Belaich2,

Gideon J Davies3 and Richard Haser1

llnstitut de Biologie Structurale et Microbiologie, Laboratoire de Cristallographie et Cristallisation des Macromolecules Biologiques,URA 1296, CNRS, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France, 21nstitut de Biologie Structurale et Microbiologie,Laboratoire de Bioenergetique et Ingenierie des Proteines, UPR 9036, CNRS, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20,

France and 30epartment of Chemistry, University of York, Heslington, York, Y01 500, UK

Background: Cellulases are glycosyl hydrolases ­enzymes that hydrolyze glycosidic bonds. They havebeen widely studied using biochemical and micro­biological techniques and have attracted industrial inter­est because of their potential in biomass conversion andin the paper and textile industries. Glycosyl hydrolaseshave lately been assigned to specific families on the basisof similarities in their amino acid sequences. Thecellulase endoglucanase A produced by ClostridiulIIcellulolytiCIIIII (CeiCCA) belongs to family 5.Results: We have determined the crystal structureof the catalytic domain of CeiCCA at a resolution of2.4 A and refined it to 1.6 A. The structure was solvedby the multiple isomorphous replacement method.The overall structural fold, (ex/13)8' belongs to the

TIM barrel motif superfamily. The catalytic centre islocated at the C-terininal ends of the 13 strands; thearomatic residues, forming the substrate-binding site,are arranged along a long cleft on the surface of theglobular enzyme.Conclusions: Strictly conserved residues within family5 are described with respect to their catalytic function.The proton donor, Glu170, and the nucleophile,Glu307, are localized on 13 strands IV and VII,respectively, and are separated by 5.5 A, as expected forenzymes which retain the configuration of thesubstrate's anomeric carbon. Structure determination ofthe catalytic domain of CeiCCA allows a comparisonwith related enzymes belonging to glycosyl hydrolasefamilies 2, 10 and 17, which also display an (a/13)8 fold.

Structure 15 September 1995, 3:939-949Key words: a/13 barrel, Clostridium cellulolyticum, family 5 cellulase, glycosyl hydrolase, X-ray structure

IntroductionCellulolytic microorganisms, both bacteria and fungi,produce a wide variety of cellulases and xylanases,enzymes that hydrolyze glycosidic bonds and which playan important role in biomass conversion. Cellulasescatalyze the hydrolysis of 13-1,4-glycosidic bonds ofcellulose. Fungal organisms and some aerobic bacteriaproduce these enzymes in a non-clustered manner andsecrete them directly into the external medium. In someanaerobic bacteria, such as clostridia, however, enzymesare secreted in the form of highly structured cellulolyticcomplexes, known as cellulosomes [1].

In most cases, cellulases have, in addition to the indepen­dently functioning catalytic domain, one or moredomains involved in substrate binding or multienzymecomplex formation [2]. Glycosyl hydrolases have beenclassified into 45 families on the basis of amino acidsequence similarities that were detected by hydrophobiccluster analysis (HCA) [3]. The catalytic domains ofcellulases and xylanases are found in 11 of these 45 fami­lies. The 11 families are also referred to as cellulase fami­lies A-K [4,5]. Within a given family, enzymes mayexhibit different substrate specificities but all the available

*Corresponding author.

data are consistent with a common active-site topologyand catalytic mechanism that, in each family, leads toeither inversion or retention of the configuration of theanomeric carbon [1,6-9].

The cellulase endoglucanase A from Clostridilllll cellll­lolytiCllIII (CelCCA), has a catalytic domain belonging tofamily 5 (cellulase family A) and a C-terminal domaincontaining a duplicated segment [10-12], which appears,in clostridial enzymes, to be involved in attachment ofcatalytic subunits to the scaffolding proteins of the cellu­losome [I]. The catalytic domain of CelCCA has beenisolated by gene truncation and expressed, to a high level,in Escherichia coli [10]. The truncated enzyme exhibitssimilar properties to the wild-type enzyme although it isfourfold less efficient in the hydrolysis of crystalline cellu­lose. The optimum temperature for hydrolysis is in therange 37-50°C and the optimum pH is 6.0. Catalysis byenzymes of family 5 proceeds with net retention of theanomeric configuration, as has been shown by 1H NMRof the reaction products [12, I 3].

HCA plots have led to the identification of five segmentsthat are particularly well conserved in the catalytic core,

© Current Biology Ltd ISSN 0969-2126

Supplied by The British Library - "The world's knowledge"

939

Page 2: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

940 Structure 1995, Vol 3 No 9

including seven strictly conserved residues [4]. Further­more, family 5 enzymes have been subdivided into sixsubgroups on the basis of further conserved residues;CelCCA is classified in subfamily 4 (referred to as family5-4) [14]. Extended biochemical studies and experimentsusing site-directed mutagenesis have been performed inorder to identify the residues involved in the catalyticreaction [11,15-19]. The results of these studies are con­sistent with there being one strictly conserved glutamate(corresponding to Glu307 in CelCCA) acting as thecatalytic nucleophile [18-20] and it has been proposedthat a second glutamate (Glu170) is the proton donor[15,19]. The importance of other conserved residues hasbeen shown, but there is no conclusive evidence of a rolefor them in catalysis. Although family 5 is the familycontaining the greatest number of cellulases, accountingfor 40% of all known sequences, there was, until recently,no detailed structural information about any member ofthis family. It has, however, been suggested that enzymesin family 5 will share both the same general (<</[3)8topology and a similar disposition of catalytic residues asthe structures of glycosyl hydrolases from families 2, 10and 17, whose structures are already known [21,22].

Three-dimensional (3D) structures have been determinedfor the catalytic domains of cellulases (cellobiohydrolases[also named exoglucanases] and endo- glucanases) andxylanases belonging to six different families: a cellobiohy­drolase II (CBHII) from Tric1loderma reesei [23] andendoglucanase E2 from 77lermomollospora fllsea [24] (bothbelonging to family 6); the endoglucanase CelD fromClostridillm thermocellllm (family 9) [25]; the endoglucanaseV from Hllmicola illsolens (family 45) [26]; the cellobiohy­drolase I (CBHI) from T. reesei (family 7) [27]; thexylanases from Bacilllls eircllialls [28], Trichoderma harzialllllll[29] and T. reesei [30] (all from family 11); a bifunctional[3-1,4-xylanase/glucanase Cex from Celllliomollas jimi[31]; and the xylanases from Pselldomollas jlllorescells [32]and Streptomyces lividalls [33] (all from family 10).

We report the high-resolution 3D structure of the cat­alytic domain of CelCCA from C. celllllolytiCllm. This

Table 1. Diffraction data and phasing statistics.

Diffraction data

structure determination provides an opportunity toobtain detailed structural information that can be used inthe interpretation of the mutational and biochemicalanalyses of enzymes from family 5.

Results and discussionThe crystal structure of the catalytic domain of CelCCAwas determined by multiple isomorphous replacement(MIR) at. 2.4 A resolution and has been refined againstsynchrotron diffraction data to ·1.6 A spacings (Tables1,2). The final (2Fo-Fc) map contoured at leT showed nodiscontinuity in the electron density and the side chainsare very well defined (Fig. 1).

Overall structureCelCCA is a globular protein of approximate dimensions50x60x60 A3 and has the architecture of the classiceightfold «/[3 barrel motif [34], a tertiary structurewhich has been observed for a number of enzymes dis­playing diverse catalytic functions [35,36]. Recently, thisfold has also been found in three [3-1,4-glycanases offamily 10 [31-33], the [3-1,3 and [3-1,3:1,4-glycanasesfrom family 17 [37] and in the catalytic domain of theE. coli [3-galactosidase from family 2 [38]. Furthermore,the structures of the two cellulases from family 6, a cel­lobiohydrolase and an endoglucanase, were described tohave an «/[3 barrel fold, but the eighth [3 strand is notlocated in the centre of the barrel, and they also displayother irregularities compared with the classical «/[3barrel motif [23,24].

The «/[3 barrel motif of the catalytic domain ofCelCCA, encompassing residues 1-380, is illustrated as aribbon diagram in Figure 2. Table 3 gives an assignmentof the residues involved in a helices and [3 strands. Theselection criteria used were those described by Kabschand Sander (DSSP program) [39]. A topological diagramshowing the arrangement of the secondary structuralelements is presented in Figure 3. The structure containsseven cysteines [10], none of which are involved indisulphide bonds.

Phasing statistics

Crystal Resolution >. (A)* No. of No. of Percent R-symt Derivative R-deriv* R-Cullis§ (%) Phasing(A) observations unique complete (%) (%) power#

reflections Centric Acentric Anomalous

Native 1.6 0.998/0.87* 35 5167 59967 98 7.2HgC12-1 2.5 1.54 57178 13946 88 4.1 HgC1 2-1 18.2 55 60 0.96 2.17HgC1 2-2 2.45 0.998 90871 17197 97 5.3 HgC1 2-2 24.5 44 48 0.68 2.94U02Ac2 2.3 1.54 55970 16462 89 3.4 U02Ac2 11.5 67 69 0.69 1.61

*>'=0.998 A on beamline X31; >'=0.87 A on beamline BW7B (see also the Materials and methods section).tR-sym=Ihk,1 Ij-<I> l/2hkl<l> where Ii is the scaled intensity of the ith observation and <I> is the mean intensity for that reflection.*R-deriv=I IFp~Fp 1/21 FpI, the mean relative isomorphous difference between the native protein (Fp) and the derivative (FpH) data.§R-Cullis=I II FpH±Fp I-FH(calc) 1/21 FpH-FpI where FH is the calculated heavy-atom structure factor contribution. #The phasingpower is the mean value of the heavy atom structure amplitude divided by the lack of closure error.

Supplied by The British Library - "The world's knowledge"

Page 3: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

A 5 bacterial cellul,ase Ducros et at.

*R factor=ll F0 I-I Fe I III F0 i, where !F0 i and IFe Irepresent the observed and the calculated structure factorampIitudes. respectively.

CelCCA has one additional helix at the N-terminalextremity, external to the TIM barrel motif. UrJike typi­cal alp barrel folds, 13 strands V and VI in CelCCA areconnected by a long loop containing only one helicalturn, not by an extended helix. The same feature wasrecently encountered in the TIM barrel domain of13-galactosidase from E. coli [38]. This results in the for­mation of a large cleft, constructed from the C-terminalends of the barrel, and running across the surface of themolecule, where it is exposed to the solvent. The activesites of 13 barrels are invariably found at the C-terminal

The cross-section of the 13 barrel core of CelCCA is ellip­tical, as described for the majority of triosephosphateisomerase (TIM) barrels [35], with the major axis locatednear 13 strand II. The core is hydrophobic and arranged inthree layers, which is also comparable with the TIM bar­rel [34]. The top layer is composed of the hydrophobicresidues Leu29, Tyr116, Tyr218 and Pro302, located onalternate 13 strands (I, III, V and VII), the side chainspointing inside the barrel. The intermediate and bottomlayers are composed of residues Ile164, Ile250, Leu336and HellS, Met220, Ile304 and Ile338, respectively.

\Ve compared the structure of CelCCA with those ofsome retaining and some inverting hydrolasesfrom families 10, 17 and 6 the all3 barrelmotif) and with TIM, which contains the first observed(a/[3)s,fold (Tabl~ The~ , was carrie~ o~tusmg tne Ca bacKbone at the structural foldS,overlapping the first 13 strand in CelCCA with the first13 strand in each of the other structures, the second withthe second, and so on (see the Materials and methodssection). The best superposition is observed within theeight-stranded 13 barrel and the largest deviations arefound in the loops and in some of the external helices.

ends of this of motif and therefore is alsothe location of CeICCA.

rurth,er:mc)re, the location of the Glu307 atthe C-terminal end of strand of the barrel confirmsthis As illustrated in 2, the C terminusof the on the side to the activesite, comes to lie the N termini of the 13 strands. Itis likely, therefore, that the domain of the dupli­cated segment will not obstruct the active site in theintact enzyme.

The greatest resemblance is found to the bifunctional13-1,4-xylanase/cellulase Cex and the 13-1,3:1,4­glucanase Ell, belonging to families 10 and 17, respec­tively. In fact, the eight residues suggested to be involvedin catalysis by the results of previous site-directed muta­genesis studies - His122, His123 (Trp84 in Cex),His254 (Gln203 in Cex), Glu170, Glu307, Tyr256(His205 in Cex), Asn169 and Trp340, where residuenumbering refers to that in CelCCA) - come to liewithin 1 A of the corresponding residues in the catalyticcentre of Cex. The disposition of particular catalyticresidues in these three families' will be discussed in thesection 'catalytic residues'.

The (a/[3)8 motif of CelCCA is more similar to theclassical TIM barrel than to the structures of both theendo- and exoglucanases belonging to family 6 (CBHIIand E2), which are not regular TIM folds. Moreover, incontrast to glycosyl hydrolase families 5, 10 and 17,CBHII and E2 are inverting enzymes. The orientation

F>O20.1%5990123.6%5983

7-1.6 A3035385

540.7%

F>2u19.1%5391323.15%

5380

0.0071\1.700

17.35 1\2Arg30

Table 2. Model refinement.

Final R factorNo. of reflections

Free R factor [53JNo. of reflections

Root mean square deviation in:Bond lengthsBond angies

Average B factorRamachandran outl iers

Resol ution range,'10. of non-hydrogen atoms,'10. of water moleculesDiscretely disordered residuesInItIa' R factor*

Fig. 1. Quality of the electron-densitymap in the region of othe catalytic site.(a) MIR map at 2.5 A resolution withthe model. (b) Final (2Fo-Fc) mapat .6 resolution. The map is con-toured at 'I u.

Supplied by The British Library - "The world's knowledge"

Page 4: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

942 Structure 1995, Vol 3 No 9

Table 3. 13 strands and a helices in CeICCA.

Fig. 2. The eightfold alp barrel of thecatalytic domain of the endoglucanaseA from C. cel/ulolyticum. (a) Ribbonrepresentation, viewed from the side, ofthe barrel showing the cleft formed bythe loops at the C-terminal extremity ofthe barrel. B strands are shown asgreen arrows' and the 0: helices as violetspirals. One additional 0: helix islocated at the N-terminal extremity andthis is coloured red. (Figure generatedusing TURBO-FRODO [50].) (b) StereoCa trace of CelCCA viewed along thebarrel axis.

the large cleft on the protein surface and located on theloops interconnecting the 13 strands, (Fig. 5). This cleft,approximately 10 A deep and 30 A wide, is similar tothose observed in other endoglycanases. The side chainsof Trp180 and Trp259 are arranged such that they areopposite and parallel to each other at the entrance of thecleft (Fig. Tryptophan residues are known to partici­pate in the binding of sugars through aromatic stackinginteractions with the glucopyranosyl rings [40]. A numberof aromatic residues, such as Trp57, Trp340, Phe42 andPhe352, are located on one side of the active site whilst

180, Trp181, Trp259, Trp287 and Tyr256 lie on theopposite side (Fig. 5). These residues line up to form ahydrophobic surface within the active-site cleft. The dis­tribution of tryptophans, located alternately on either sideof this cleft, seems well adapted for interactions with thenatural substrate, cellulose, as it is consistent with thearrangement of the hydrophobic faces of the pyranose

in the substrate [41]. The binding cleft is longenough to accommodate at least five glucosyl units. Thisobservation is in agreement with the results obtainedFierobe et al. [12], who determined the enzymatic para­meters of the catalytic domain of CelCCA on natural cel­lodextrins and p-nitrophenyl-cellodextrins. Their resultsshowed that both the affinity for the substrate and the rateof hydrolysis increase with the chain of cellodex­trin, at least up to a length of six glucosyl units.

Asp21-Leu29Thr63-Lys73Asp97-Asp112Met138-Arg155Ser187-Ala209Pro228-Thr232Ser279-Lys295Lys316-Ala332Pro365-Tyr374

aO0:10:20:3a4asa6a7a8

Gly32-Asn34Thr77-lle80Tyr116-Asn120Leu163-Glu166Leu219-Glu222Ile249-Va1253Va1303-Cys308Leu336-Glu339

131132133134135136137138

The active siteThe seven strictly conserved residues within family 5 ­Arg79, His122, Asn169, Glu170, His254, Tyr256 andGlu307 - are found in a spatial arrangement in whichthe residues are in close proximity to each other (Fig.They are all on the same side of the 13 barrel, situated in

of the substrate-binding sites in CBHII and E2 is rotatedby about 90° around the barrel axis with respect to thelarge cleft in CelCCA.

At present, there are no coordinates available to allowstructural comparisons with another member of family 5,although the structure of CelC from Clostridium thermocel­111m, also from this family, has recently been determinedby Alzari and co-workers (see note added in proof).

Supplied by The British Library - "The world's knowledge"

Page 5: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

A family 5 bacterial cellulase Ducros et 943

Fig. 3. Topological diagram of the sec­ondary structural elements of CeICCA.u helices are shown as cylinders and13 strands as arrows. The assignment ofthe secondary structure was carried outwith the program DSSP [39].

Table 4. Structural similarity of CelCCA with other TIM barrel motifs.

Total no. of Protein Family POB entry Ref. Ca atoms Rms deviation Ca atoms Rms deviationCn atoms (3 A) (2 A)

248 TIM lTIM [34] 153 2.17 64 1.46365 CBHII 6 3CBH [23J 69 2.19 63 .52286 E2 6 lTMl [24J 65 2. 3 70 ~~.::u312 Cex 10 2EXO 131 ] 212 1.95 124 1.43306 Ell 17 lGHR [37] 213 1.89 126 .25

(

Superposition of Cn positions was achieved in two steps permitting maximal deviations between corresponding atoms of 3 A at firstand then 2 A in a second step.

Fig. 4. A close-up view of the catalyticsite at the C-terminal extremity of thebarrel strands showing the arrangementof the seven strictly conserved residues(red and along with the trypto­phans (purple) at the entrance. The cat­alytic residues are in red.

Catalytic residuesThe cellulases of family 5 hydrolyze the glycosidic bondwith retention of the anomeric configuration, implyingthe participation of two acids at thesite. On the basis of sequence alignments and HCA pre­dictions, "vo glutamate residues are found to beconserved in 5. One is clearly identified as the

active-site :1Uc!eopnlJe by specific labelling [4,18-20],and site-specific mutant proteins led to the suggestionthat the other is the catalytic proton donor by demon-

that the replacement of the two con-served glutamates causes a dramatic loss m enzymeactivity [15-17, . In CelCCA to theresidues Glu307 and Giu 170, respectively.

Supplied by The British Library - "The world's knowledge"

Page 6: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

944 Structure 995, Vol 3 No 9

Fig. 5. Top view of the groove running along the surface of the(a/pis barrel showing the distribution of the aromatic residues(purple), which form the substrate-binding site, on alternate sidesof the groove. The two catalytic glutamates are shown blue.

Examination of the CelCCA 3D structure also supportsthe proposal that Glu170, located at the end of strand IV,is the proton donor. The separation between the carboxyoxygens of Glu170 and Glu307 (taken as the average ofthe possible distances between the four oxygens) is 5.5 A.This distance, observed in previously determined 3Dstructures of retaining f3-g1ycanases, allows the formationof a glycosyl-enzyme intermediate, situated between thetwo carboxy groups [9]. In contrast, for invertingenzymes a greater separation (9-9.5 is required for theinsertion, between the carboxy groups, of the nucle­ophilic water molecule involved in the catalytic reaction

. Furthermore, the structural comparison with Cexshowed that Glul70 and Glu307 superpose very wellwith the Cex proton donor (Glul27) and nucleophile(Glu233), respectively (Fig. 6).

Glu170 and Glu307 are located on opposite sides of theactive-site cleft, at the C termini of f3 strands IV and VII,respectively (Figs 4,5). This appears to be the samearrangement as was observed for other retaining l3-gly­canases with the alp barrel motif, such as C. jimixylanaselglucanase [31] P J1uorescens xylanase [32], S. livi­dam xylanase A [33] and the catalytic domain of E. colil3-galactosidase [38], as recently predicted by Henrissat etat. [21] and Jenkins et al. [22].

The environment of the two glutamates and theirinvolvement in hydrogen-bonding networks are conSIS­tent with the roles proposed for these residues on thebasis of results from site-directed mutagenesis studies(Fig. 7) [1 , 5-20]. A charged state for Glu307 isfavoured by a salt bridge with Arg79 and, in addition, theconserved Tyr256 is sufficiently close for it to form ahydrogen bond with Glu307 which may stabilize thecharge on the Glu307 carboxylate group. Furthermore,Glu307 is maintained in a specific position, with respect

to the active site, by strong geometric constraints: His254and Trp340 form a hydrophobic, sandwich-like stacking,with the side chain situated between theof the aromatic rings.

The mayimal activity ofCelCCA occurs around pH 6.0.At acidic pH, Glul70, which will act as a proton donorin the acid-base catalytic mechanism, should have a pK• . avalue higher than that of the catalytic nuc1eophile. Thecombination a histidine and an asparagine in theenvironment of Glu170 contributes to raising the pK

avalue of this residue. Indeed, Glu170 forms a hydrogenbond with the No 1 atom of His254 and forms a hydro­gen bond, via a water molecule, with Asn169 (Fig. 7).Examination of the more immediate environments of theproton donor residues within 5 revealed a con-served sequence Asn-Glu-Pro . The presence ofPro171 (this residue is strictly conserved in subfamily5-4) maintains the neighbouring Glu170 backbone in anorientation such that the Glu 170 side chain pointstowards the nucleophilic glutamate. Interestingly, theC. thermocellum Ce1C does not have a proline at this posi­tion and this results in an orientation which points theequivalent catalytic glutamate away from the active site(R Dominguez, personal communication). In Ce1CCA,the orientation of Glu170 is further constrained by thepresence of the hydrogen bond between its main-chaincarbonyl and the main-chain amide group of Argl72.

There are several tightly bonded and well ordered watermolecules in the catalytic site, which may have a func­tional role, in particular \Vat449, which is hydrogenbonded to Glu170 and Glu307 (see Table 5). There is anordered solvent structure element, situated below the tv\'O

catalytic residues, in which five water molecules form apentagonal ring, as previously observed in crambin [42].

The comparison of the structures of CelCCA and EII(family 17) provides two important insights (Fig. 8). Theproposed nucleophile in Ell, Glu232, overlaps perfectlywith Glu307 in CelCCA. Glu288, however, which hasbeen proposed as the Ell proton donor [37], does notcoincide with Glu170 in CelCCA, but comes to lie doseto another glutamate, Glu350, which is not directly situ­ated in the catalytic centre. In fact, it is Glu93 in Ellwhich superposes perfectly well with Glu170 in Cel­CCA. Glu93 is located on ~ strand IV at the expecteddistance of about 5 A from the nucleophile, as haspreviously been pointed out by Henrissat et al. [21] andJenkins et al. [22].

Other conserved residuesThe other residues strictly conserved throughout family 5- Arg79, His122 and His254 - are located at the C ter­minals of the J3 strands II, III and VI respectively (Fig. 7).

His254 is involved in forming a hydrogen bond withGIu170 and therefore seems to be implicated in thecatalytic reaction. Indeed, this position is found to becompletely intolerant to amino acid substitutions, in the

Supplied by The British Library - "The world's knowledge"

Page 7: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

A family 5 bacterial cellulase Ducras et al. 945

Fig. fl. Stereo superposition of the cat­alytic centre of CelCCA (blue) Cex

showing conserved residues in5 and 10, respectively. The cat­glutamates CeICCA, Clul70

and Glu307, overlap with those in Cex,Glu233 and Glu127. Moreover, the pro­ton donor in each structure is found inthe same conserved motif (Asn-Glul.Hisl22 and Hisl23 in CelCCA occupysimilar positions in the catalytic site toHis80 and Trp84 in Cex.

Fig. 7. Stereo diagram showing the envi­ronment of the catalytic residues thatform the active site. The probablehydrogen bonds are shown by dashedlines. The corresponding bond distancesare given in Table 5. The water mol­ecule is shown as a filled circle.

case of the catalytic activity of cellulase EGZ of Erwiniachrysanthemi [19].

The side chain of Arg79 is buried in the catalytic centreand is stabilized by hydrogen bonds with Glu166 andAsn34. Furthermore, Arg79 forms a salt bridge withGlu307 (2.54 A in length) and therefore seems to beinvolved in the catalytic activity. This arrangement wascorrectly proposed by Bartoli-German et al. [19] on thebasis of site-directed mutagenesis studies. In CelCCA,however, it has been shown that a lysine residue can sub­stitute for arginine, yielding a protein exhibiting 20% ofthe activity compared to the wild type [11] and, further­more, other substitutions of this residue in family 5members led to enzymes displaying a residual activity oforJy 5% [17 These results indicate that Arg79 isdoubly important: its charge is necessary for the forma­tion of the salt bridge to the nucleophilic glutamate,whilst the supplementary amine groups, not present inlysine, allow the formation of a hydrogen-bonding net­work and maintain the structure of the active site.

His122, also in the active site (at about 4 Afrom the closest catalytic glutamate, Glu307), is notinvolved in hydrogen bonding with either Glu170 orGlu307. Mutations of His122, however, led to a drasticdecrease in was less (12%activity) on substitution with [11,17,

In addition, the behaviour of the mutant was dependenton the nature of the substrate (p-nitropheny1cellobiosideagainst carboxymethylcellulose) [11,19]. The presence ofa ring in both histidine and phenylalanine side chains, therings being able to stack against sugar molecules, and theparticular orientation of His122 with respect to the activesite, support the proposal, by Bortoli-German et ai. [19],that His122 is involved in substrate binding. The activitydependence of the mutant on the nature of the substrate,however, seems to indicate that stacking effects are notthe only interactions of this residue with the substrate. Itis also noteworthy that, in the xylanase/glucanase Cex, ahistidine (His80) is found in the same place, as could beseen from the superposition of the two structures, andthis histidine was also expected to be involved in recogni­tion of substrate molecules within the active site (Fig. 6)[31]. Further investigation of the environment aroundHis122 reveals that the ring planes of His123 and His122are perpendicular to each other. In Cex, a tryptophan,Trp84, takes the place of His123, having the same rela­tive orientation of the ring planes with respect to thecatalytic glutamates. Mutations of His123 also affectenzyme activity [11]. However, of the roleof these two histidines is di±1icult in the absence of a sub­strate in the catalytic site.

For the cellulases and xylanases whose structures havebeen determined, relationships in their tertiary folding

Supplied by The British Library - "The world's knowledge"

Page 8: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

946 Structure 1995, Vol 3 No 9

I Table,5. Hydrogen bonds and salt bridges in the catalytic siteloICeICCA.

Atom 1

NTjl,Arg79N'fl ,Arg79NTll,Arg79NT)2,Arg79NE,Arg79N'll2,Arg79

NE2,Hisl22NE2,Hisl22

No2,Asn169N82,Asn'169001,Asn169N02,Asn169

Position

p2

f33a3

End of

134

Atom 2

N82,Asn120OE1,Glu166OE2,Glu166051,Asn34081,Asn34OE2,Glu307

No2,Asn169081,Asn169

N,Clu170NE2,Hisl22NE2,His122O,Wat449

Position

131

137

End ofp4

End of 134p3a3

3.343.102.833. 9') 7'L..,o

2.53

3.293.32

3.403.293.322.88

OE2,Glu170 End of No1,His254 p6 2.84O,Glu170 134 N,Argl72 134a4 3.24O,Clu170 NT) 1,Arg'l 72 134a4 2.99OE2,Glu170 O,Wat449 2.79OE1,Clu170 O,Wat487 2.82

No1,His254 136 OE2,Clu170 End of 134 2.84

I01l,Tyr256 p6a6 OE1,Clu307 137 2.660,Tyr256 N.Trp259 136a6 3.21

i

IOE1,Glu307 p7 01l,Tyr256 p6a6 2.660,Glu307 N,Ala255 p6a6 3.17

I

OE2,Glu307 NTl2,Arg79 132 2.54OE1,Glu307 0,Wat449 3.04

can be identified, even though they have low sequencesimilarities. Three of the seven enzymes with knownstructures exhibit variations of the (a/13)g barrel motif,the enzymes belonging to families 5 (A), 6 (B) and 10 (F)[23,24,31-33]. Other folds, that contain mainly 13 sheetmotifs, have been observed in families 7 (c), 11 (G) and45 (K) [26-30] and an a/a barrel was observed for theonly representative of family 9 (E) [25]. The existence ofa common topology and active site for different glycosylhydrolase families has previously been demonstrated forthe enzymes from families 7 and 16, which both sharethe plant legume lectin topology and have similarcatalytic centres [27]. These observations raise thequestion of evolutionary relationship, which has beenwidely discussed for the a/13 barrel fold, as 10% of theknown protein structures display this structural fold[35,43]. Our results underline the similarities that existbetween the enzymes of families 5, 10 and 17, whichhave already been stated by Henrissat et al. [21] andJenkins et al. [22] on the basis of HCA, sequencealignments and structural comparisons. Enzymes fromthese three families share the same structural fold and asimilar spatial arrangement of their catalytic residues,despite their rather different substrate specificities, adiversification resulting from evolutionary events.

Fig. 8. Superposition of the catalytic centres of CelCCA and Ell.The Ca traces and the residues are blue for CelCCA and yellowfor Ell. Glu232 is the nucleophile in Ell and it overlaps very wellwith Glu307. Glu288 (Ell) overlaps with Glu350 (CeICCAl, bothbeing about 9 Afrom their respective nucleophiles. Glu93, in Eil,is separated by 5 Afrom Glu232, and is located at the same posi­tion as Glu170. It is therefore likely to be the Ell proton donor.

Biological implicationsCellulose is the Inost abundant cOInponent ofplant cell walls. During the last few years, thepotential use of cellulases in various biotechnologyprocesses has Inade theIn the focus of nUInerousbiocheInical, genetic and physiological studies.Endoglucanase A is a cOInponent of the cellu­lolytic cOInplex of Clostridium cellulolyticum, aInesophilic anaerobic bacteriuIn able to grow oncrystalline cellulose and xylan. This enzYIne, alsoknown as CeICCA, has a catalytic dOInain belong­ing to faInily 5 of the glycosyl hydrolases, and cat­alyzes the cleavage of 13-1,4-g1ycosidic bonds witha InechanisIn that involves retention of theanoIneric configuration.

The three-dimensional (3D) structure of CelCCAprovides a structural basis for explaining the wealthof biochemical inforInation that is available forfamily 5 enzymes. CelCCA has the (Cl/I3)g topol­ogy first observed in triosephosphate isoInerase.The catalytic site is situated in a long groove at theC-terIninal extremity of the 13 barrel and the twocatalytic glutaInates are located on either side ofthe cleft, being separated by a distance of 5.5 A,consistent with a retaining Inechanism. The pre­sent biological knowledge about these enzyInes,together with structural cOInparisons, allows a 3Dstructural of the involvement of individualresidues in the catalytic process.

The thermostability of the cellulases is of Inajorindustrial iInportance. Comparative studies show­ing the high degree of sequence siInilarity with

Supplied by The British Library - "The world's knowledge"

Page 9: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

thermostable enzymes, such as CelE fromClostridium thermocellum, belonging to the samefamily, should allow improvement of the ther­mostability of the mesophilic enzyme CelCCA bysite-directed mutagenesis.

The. structure of CelCCA shows strong similarity .to those observed in glycosyl hydrolase families 10and 17 and has a similar arrangement of catalyticresidues. Interesting evolutionary questions arisewhen analyzing these enzymes, which show a lowlevel of sequence similarity and different substratespecificities but display related folds and probablyact by the same catalytic mechanism, involvingidentical residues.

Materials and methodsCrystallizationPurification and crystallization of the native CelCCA catalyticdomain have been described elsewhere [44]. The solvent con­tent in the crystals was calculated to be 53%, according toMatthews [45]. The space group of the crystals is P2J2)2), withcell dimensions a=52.4 A, b=76.2 Aand c=113.5 A. There isone molecule per asymmetric unit.

Data collection and processingMercury and uranyl derivative crystals were prepared by con­ventional soaking experiments. The data sets for a first mercuryderivative and an uranyl derivative crystal were measured withCuKa X-rays using a Rigaku RU-200 rotating anode, operat­ing at 40 kV and 80 rnA, with a graphite monochromator.Data were collected with a MARresearch imaging plate detec­tor at 15°C. Integration of the data sets was performed usingXDS [46], all further processing was done with programs fromthe CCP4 package [47].

Diffraction data for the native enzyme and the data for asecond mercury derivative crystal were collected with synchro­tron radiation, on beamline X31 (>"=0.998 A) and the high res­olution native data were collected on wiggler beamline BW7B(>"=0.870 A) at the EMBL Hamburg outstation. The wave­length of 0.998 A was chosen so as to optimize the f" compo­nent of the anomalous scattering from the mercury derivative.These data were integrated with the program DENZO [48]and further processing.was carried out, together with the other

~w

A family 5 bacterial cellulase Ducros et al. 947

data sets, with programs from the CCP4 package. The programAGROVATA (from the CCP4 package) was used to analyzethe data and to perform the data reduction. The statistics onthe data sets are given in Table 1.

Phase calculationThe positions of the mercury atoms, for the data set collectedwith a 0.998 A wavelength, were established by manual inspec­tion of the anomalous difference Patterson and confirmed byanalysis of isomorphous difference Patterson maps. Interpretationwas problematic as the majority of significant Harker sectionpeaks come from cross-vectors between different heavy-metalsites (Fig. 9). Major sites of mercury and uranyl atoms, for thedata collected with conventional X-radiation, were also locatedby the analysis of isomorphous difference Patterson maps, andminor sites were found in difference Fourier maps. Heavy-atomparameters were refined using the program MLPHARE [47].Anomalous scattering data from all the derivatives \\-ere includedin the phasing calculation and were also used to detennine thecorrect enantiomorph. The two mercury derivatives had fourheavy-atom 'sites in common, but showed significant differencesin the occupation of the sites. The second mercury derivativealso had two supplementary sites. The mean figure of merit foracentric/centric phases to 2.44 A was 0.695/0.795.

Model building and refinementThe 2.44 A multiple isomorphous replacement anomalousscattering (MIRAS) map showed a clear definition of solventand protein and no phase refinement was performed. The mapwas skeletonized using the graphics program 0 [49], and thea/~ barrel structure was immediately apparent in the skele­tonized map. The main chain and side chains were constructedwith the program 0, on the basis of the sequence, and usingthe aromatic residues as starting points. The final model build­ing and the fitting of the side chains to the density wereperformed using the graphics program TURBO-FRODO[50]. The R factor for the initial model was 40.7% and wasreduced to 24.9% after one cycle of simulated annealing withX-PLOR [51]. Bond lengths, angles and other parameters usedto define standard geometry restraints were from the set refer­enced by Engh and Huber [52]. 10% of the data were set asidefor cross-validation analysis [53]. This first refinement cyclewas followed by alternate graphical inspection and inclusion ofwater molecules and simulated annealing cycles. The finalrefinement parameters are given in Table 2.

Quality of the final model structureThe final 1.6 Anative data merged from 355167 observationsof 59967 unique reflections, constituting a mean multiplicity

Itl

Fig. 9. Harker section V=1/2 of theanomalous Patterson for CelCCA usingthe A=0.998 data. The Patterson mapwas calculated with all data between3.0 Aand loA. The labelling indicatesthe calculated positions of both the self­vectors (labelled Hl and those thatoccur as a result of cross-vectorsbetween the six different heavy-metalsites (labelled Xl.

tu

ItlSection V=ltl

Supplied by The British Library - "The world's knowledge"

Page 10: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

948 Structure 1995, Vol 3 No 9

AcknowledgemeHts: We would like to thank Keith Wilson for assis­tance and for kindly providing us with synchrotron beam time atthe ElvlliL Hamburg outstation, Zbyszek Dauter for stimulatingdiscussions and Veronique Roig-Zamboni for growing crystals.This project has been supported the CEE in a BIOTECHcontract (BI02-CT-94-3018). \Ve the European Union forsupport of the work at EMBL Hamburg through the HCMP toLarge Installations Project, contract no, CHGE-CT93-0040.

ReferencesBeguin, P, & Aubert, J.P. (1994). The biological degradation of cellu­lose. FEMS Microbiol, Rev. 13, 25-58.

2. Gilkes, N.R., Henrissat, S., Kilburn, D.G., ,\~iller, R.C. & Warren,R.A.j. (1991). Domains in microbial 13-1,4-glycanases: sequenceconservation, function, and enzyme families. lvlicrobiol. Rev. 55,303-315.

3. Henrissat, B. & Bairoch, A. (1993). New families in the classificationof glycosy! hydrolases based on amino acid sequence similarities.Biochem. j. 293,781-788.

4. Henrissat, B., Claeyssens, M., Tomme, P., lemesle, l. & Momon,J.P. (1989). Cellulase families revealed by hydrophobic clusteranalysis. Gene 81,83-95.

5. Henrissat, B. (1991). A classification of glycosyl hydrolases based onamino acid sequence similarities. Biochem.}. 280, 309-316.

6. Claeyssens, M. & Henrissat, B. (1992). Specificity mapping of cellu­lolytic enzymes - classification into families of structurally relatedproteins confirmed by biochemical analysis. Protein Sci. 1,1293-1297,

7. Gebler, j., et al., & Withers, S.G. (1992). Stereoselective hydrolysiscatalyzed by related 13-1,4-glucanases and 13-1,4-xylanases. j. BioI.Chern, 267, 12559-12561.

8. Sinnott, M.L. (1990). Catalytic mechanisms of enzymic glycosyltransfer. Chem, Rev. 90, 1171-1202.

9. McCarter, j.D. & Withers, S.G. (1994). Mechanisms of enzymaticglycoside hydroiysis. Curro Opin. Struct. Bioi. 4, 885-892.

10. Fierobe, H.P" et al., & Bela'ich, J.P. (1991). Characterization ofendogiucanase A from Clostridium cellulolyticum. j. B;;eteriol. 173,7956-7962.

11. Bela'ich, A., et al., & Bela"ich, J.P. (1992). The catalytic domain ofendoglucanase A from Clostridium cellulolyticum: effects of arginine79 and histidine 122 mutations on catalysis. }. BacteriDI. 174,4677-4682.

12. Fierobe, H.P., et al., & Bela'ich, J.P. (1993). Purification and charac­terization of endoglucanase C from Clostridium cellulo/yticum. Cat­alytic comparison with endoglucanase A. Eur. }. Biochem. 217,557-565.

13. Barras, F., et al., & Henrissat, B. (1992). Stereochemistry of thehydrolysis reaction catalyzed by endoglucanase Z from Erwiniachrysanthemi. FEBS Lett. 300,145-148.

14. Beguin, P. (1990). Molecular biology of cellulose. Annu. Rev.Microbiol. 44,219-248.

15. Baird, S.D., et al., & Seligy, V.l. (1990). The Glu residue in the con­served Asn-Glu-Pro sequence of two highly divergent endo-I3-1 ,4­giucanases Is essential for enzymatic activity. Biochem. Biophys.Res. Commun. 169, 035-1039.

16. B., Bartoli-German, i., Haiech, J., Chippaux, M. & Barras, F.991). Cellulase EGZ of En-vinia chrysanthemi: structural organiza­

tion and importance of His98 and Glu133 residues for catalysis.Protein Eng. 4, 325-333.

17. Navas, J. & Beguin, P, (1992). Site-directed mutagenesis of con­served residues of Clostridium thermocellum endoglucanase CelC.Biochem. Biophys. Res. Commun. 189, 807-812.

18. Wang, Q., et ai., & Withers, S.G. 993). Glu280 is the nucleophilein the active site of Clostridium thermocellum CeIC, a family Aendo-f'-1 ,4-glucanase. j. Bioi. Chern. 268, 14096-14102.

19. Bortoli-German, Haiech, J" Chippaux, ,\1. & Barras, F. (1995).Informational suppression to investigate structural, functional andevolutionary aspects of the Erwinia chrysanthemi ceilulase EGZ.j, Mol. 8iol. 246, 82-94.

20. Withers, S.G., et al., & Aebersold, R. (1990). Unequivocal demon­stration of the involvement of a glutamate residue as a nucleophilein the mechanism of a retaining glycosidase, j. Am. Chern, Soc. 112,5887-5889.

Souchon, SPM

our article was submitted (R Dominguez, HSpinelli, Z vauter, KS Wilson, S Chauvaux, Pi\.lzari, "'"ature Structural Blolc;gy 1995, 2:569-576).

The final model structure, encompassing 380 residues and 385water molecules, has rms deviations from stereochemicalideality of 0.007 A and 1.70" for bonds and angles, respec­tively. 87% of the residues are in the most favoured regions ofthe Ramachandran plot (Fig. 10), with only one non-glycineresidue, Arg30, in a disallowed region. This residue is locatedat the end of the first helix with its side chain completelyexposed to the solvent. The corresponding electron density,however, is dear and unambiguous in both the MlRAS andthe 2F0 -Fc maps.

fig. 10. Distribution of the main-chain dihedral angles forCelCCA [54]. Arg30 is shown as it is the only residue in the disal­lowed region. The plot was produced with the programPROCHECK [55].

Note added in proofThe structure of an endoglucanase belonging to farnily5, CeiC from ClostridluIn thermocel/um, has been published since

of 5.9 observations/reflection. The R-merge is 0.072 with avalue of 0.264 in the outer resolution shell (1.69-1.6 A). Thefinal overall R factor, resulting from the refinement involving53913 reflections (for which F>2aF) in the resolution range7-1.6 A, is 19.1%. The corresponding free R factor, for 5380reflections, is 23.15%.

The atomic coordinates of the refined structural model havebeen deposited with the Brookhaven Protein Data Bank (entrynumber 1EGD).

The determination of the rms deviations and the comparisonof the different a/13 barrel structures with respect to CelCCAwas accomplished by the superposition of the Ca backbonetraces with the option for rigid-body refinement using thegraphics program TURBO-FRODO. In a first visual step,Ca atoms of corresponding 13 strands were chosen for asimple rigid-body superposition: the first and last residues ofthe !3 strands I, II, IV and VI of each structure were over­lapped. Then the superposition was optimized by using ageometrical cut-off in the rigid-body refinement, includingall pairs of CO( atoms closer than 3 A and in a second step allthose closer than 2 A(Table 4).

Supplied by The British Library - "The world's knowledge"

Page 11: Crystal structure of the catalytic domain of a bacterial cellulase belonging to family 5

21. Henrissat, B., Callebaud, I., Fabrega, 5., lehn, P., Momon, J.P. &Davies, G. (1995). Conserved catalytic machinery and the predic­tion of a common fold for several families of glycosyl hydrolases.Proc. Natl. Acad. Sci. USA 92, 7090-7094.

22. Jenkins, J.A., lo leggio, l., Harris, G.W. & Pickersgill, R.W. (1995).J3-glucosidase, J3-galactosidase, family A cellulases, family Fxylanases and two barley glycanases form a superfamily of enzymeswith 8-fold J3/a architecture and with two conserved glutamates nearthe carboxy-terminal ends of J3-strands four and seven. FEBS Lett.362,281-285.

23. Rouvinen, T., Bergfors, T., Teeri, T., Knowles, J.K.C & Jones, T.A.(1990). Three-dimensional structure of cellobiohydrolase II fromTrichoderma reesei. Science 249,380-385.

24. Spezio, M., Wilson, D.B. & Karplus, P.A. (1993); Crystal structure ofthe catalytic domain of thermophilic endocellulase. Biochemistry32,9906-9916.

25. Juy, M., et al., & Aubert, J.P. (1992). Three-dimensional structure ofa thermostable bacterial cellulase. Nature 357, 89-91. .

26. Davies, G.J., et al., & Schulein, M. (1993). Structure and function ofendoglucanase V. Nature 365, 362-364.

27. Divne, C, et al., & Jones, T.A. (1994). The three-dimensional crystalstructure of the catalytic core of cellobiohydrolase I from Tricho­derma reesei. Science 265, 524-528.

28. Wakarchuk, W.W., Campbell, R.l., Sung, W.l., Davoodi, J. &Yaguchi, M. (1994). Mutational and crystallographic analysis of theactive-site residues of the Bacillus circulans xylanase. Protein Sci. 3,467-475.

29. Campbell, R.l., Rose, D.R., Wakarchuk, W.W., To, R., Sung, W.l. &Yaguchi, M. (1993). A comparison of the 20 kDa xylanases from Tri­choderma harzianum and Bacillus circulans. In Proceedings of theSecond TRICEL Symposium on Trichoderma reesei Cellulases andother Hydrolases. (Suominen, P. & Reinikainen, T., eds), pp. 63-72,Foundation for Biotechnical and Industrial Fermentation Research.

30. Torronen, A., Harkki, A. & Rouvinen, J. (1994). Three-dimensionalstructure of endo-l,4-J3-xylanase II from Trichoderma reesei: twoconformational states in the active site. EMBO j. 13, 2493-2501.

31. White, A., Withers, S.G., Gilkes, N.R. & Rose, D.R. (1994). Crystalstructure of the catalytic domain of the J3-1,4-glucanase Cex fromCellulomonas fimi. Biochemistry 33, 12546-12552.

32. Harris, G.W., et al., & Pickersgill, R.W. (1994). Structure of the cat­alytic core of the family F xylanase from Pseudomonas fluorescensand identification of the xylopentaose-binding sites. Structure 2,1107-1116.

33. Derewenda, U., et al., & Derewenda, Z.S. (1994). Crystal structure,at 2.6 A resolution, of the Streptomyces Iividans xylanase A, a mem­ber of the F family of J3-1,4-o-glucanases. j. BioI. Chem. 269,20811-20814.

34. Alber, T., et al., & Wilson, LA. (1981). On the three-dimensionalstructure and catalytic mechanism of triose phosphate isomerase.Phi/os. Trans. R. Soc. Land. B 239, 159-171.

35. Br:inden, C-1. (1991). The TIM barrel- the most frequently occur­ring folding motif in proteins. Curro Opin. Struct. BioI. 1, 978-983.

36. Br:inden, C-1. (1980). Relation between structure and function ofa/J3 proteins. Q. Rev. Biophys. 13, 317-338.

A family 5 bacterial cellulase Ducros et al. 949

37. Varghese, J.N., Garrett, T.P.J., Colman, P.M., Chen, l., Hoj., P.B. &Fincher, G.B. (1994). Three-dimensional structures of two l3-glucanendohydrolases with distinct substrate specificities. Proc. Natl. Acad.Sci. USA 91,2785-2789.

38. Jacobson, R.H., Zhang, X.J., DuBose, R.F. & Matthews, B.W. (1994).Three-dimensional structure of J3-galactosidase from E. coli. Nature369,761-766.

39. Kabsch, W. & Sander, C (1983). Dictionary of protein secondarystructure: pattern recognition of hydrogen-bonded and geometricalfeatures. Biopolymers 22,2577-2637.

40. Vyas, N.K. (1991). Atomic features of protein--<:arbohydrate interac­tions. Curro Opin. Struct. BioI. 1, 732-740.

41. Gebler, K., Kraub, N., Steiner, T., Betze!, c., Sandmann, C. &Saenger, W. (1994). Crystal structure of J3-D-cellotetraose hemi­hydrate with implications for the structure of cellulose II. Science266,1027-1029.

42. Teeter, M.M., (1984). Water structure of a hydrophobic protein atatomic resolution: pentagon rings of water molecules in crystals ofcrambin. Proc. Nat!. Acad. Sci. USA 81,6014-6018.

43. Farber, G.K. (1993). An a/J3 barrel full of evolutionary trouble. CurroOpin. Struct. BioI. 3, 409-412.

44. Roig, V., et al., & Haser, R. (1993). Crystallization and preliminaryX-ray analysis of the catalytic domain of endoglucanase fromClostridium cellulolyticum. j. Mol. BioI. 233, 325-327.

45. Matthews, B.W. (1968). Solvent content of protein crystals. j. Mol.BioI. 33, 491-497.

46. Kabsch, W. (1988). Evaluation of single crystal X-ray diffraction datafrom a position-sensitive detector. j. Appl. Cryst. 21, 916-924.

47. Collaborative computational project, No.4. (1994). The CCP4 suite:programs for protein crystallography. Acta. Cryst. D 50, 760-763.

48. Otwinowski, Z. (1993). DENZO: an oscillation data processing pro­gram for macromolecular crystallography. Yale University, NewHaven, CT.

49. Jones, T.A., Zou, J.Y., Cowan, S.W. &. Kjeldgaard, M. (1991).Improved methods for building models in electron density mapsand the location of errors in the models. Acta. Cryst. A 47,110-119.

50. Roussel, A. & Cambillau, C. (1992). TURBO-FRODO. Biographics,lCCMB, Marseille, France.

51. Brunger, A.T. (1993). X-PLOR, Version 3.1. Yale University, NewHaven, CT.

52. Engh, R.A. & Huber, R. (1991). Accurate bond and angle para­meters for X-ray protein structure refinement. Acta Cryst. A 47,392-400.

53. Brunger, A.T. (1992). Free R value: a novel statistical quantity forassessing the accuracy of crystal structures. Nature 355, 472-475.

54. Ramachandran, G.N. & Sasisekharan, V. (1968). Conformation ofpolypeptides and proteins. Adv. Protein Chem. 23,283-437.

55. laskowski, R.A., MacArthur, M.W., Moss, D.S. & Thornton, J.M.(1993). PROCHECK: a program to check the stereochemistry ofprotein structures. j. Appl. Cryst. 26, 283-291.

Received: 22 May 1995; revisions requested: 12 Jun 1995;revisions received: 28 Jun 1995. Accepted: 3 Ju11995.

l

Supplied by The British Library - "The world's knowledge"