GAPDS

Glyceraldehyde-3-Phosphate Dehydrogenase (PDB ID: 3PFW) from Homo sapiens

Created by: Felicia White

Sperm-specific glyceraldehyde-3-phosphate dehydrogenase (abbreviated GAPDS, PDB ID: 3pfw) is a glycolytic enzyme that catalyzes the oxidative phosphorylation of D-G3P (D-glyceraldehyde 3-phosphate) to form DPG (1,3-diphosphoglycerate). In addition, GAPDS also has other biological functions such as nuclear RNA transport, DNA replication and repair, membrane fusion and apoptosis (1). The human GAPD protein is also associated with several diseases, including Huntington’s or Alzheimer’s among others, that affect the neurological system.  The protein is present in mammals in two different forms that have a significant sequence identity but perform different functions. The somatic counterpart (GAPDH) is active all over the body in various cell types. GAPDH is the protein responsible for functions such as RNA transport, DNA replication during mitosis, and most membrane fusion and cellular apoptosis. The somatic form also influences the neurological diseases. Mammals also possess a sperm-specific counterpart that acts only in the sperm and do not have functions in the rest of the body. This protein, GAPDS, is associated strongly with sperm motility and therefore, fertility, of the male. The movement of the sperm cells, powered by their flagella, requires a large amount of ATP which is generated through glycolysis. The GAPDS shows enzymatic activity in a reaction that eventually leads to ATP formation. This protein contains a unique N-terminal 72-residue polyproline extension, not found in the GAPDH protein, which is needed for the tight association of GAPDS with the flagellum. Through this association of the flagella and the protein, the protein is able to provide the needed ATP for movement via glycolysis. Researchers are interested in studying this particular protein as a form of contraception to provide an alternative to hormone therapies and surgical options that currently exist. If the GAPDS was somehow inhibited from catalyzing the glycolysis reaction (without changing the GADPH enzymatic activity) or if the association of the GAPDS and flagella did not occur then the sperm cells would be left without the required energy for movement, obviously reducing their motility and therefore their ability to fertilize the female ovum.

            The ExPasy server was used to determine the weight and isoelectric point for the human GAPDS protein, the weight is reported at 44501.1 Da and the isoelectric point was found to be pH of 8.39 (3) so the protein is charged at physiological pH. The PSI-Blast yielded three proteins for comparison based on sequence similarity. One protein, a GAPDS from Rattus norvegicus (PDB ID: 2vyn) has an e-value of 0.0 which indicates that this protein has the potential to have 100 percent sequence identity with the query protein (PDB ID: 3pfw). Experimental comparisons of the sequence of the Rattus norvegicus GAPDS and the human GAPDS have confirmed 87% sequence identity and 95% similarity (2). The blast also highlighted two other proteins with very similar sequences. These proteins are from Trypanosoma cruzi (PDB ID: 3dmt) and Trypanosoma brucei (PDB ID: 2x0n) each with very low e-values approaching zero (4). Additionally, the Dali server yielded three proteins based on tertiary structure similarity to the POI. Two also turned up in the blast search, protein 3dmt (z-score = 50.0) and 2vyn (z-score = 56.1), however the third was not high in the list of the blast search (5). This protein (PDB ID: 4o59 z-score = 54.3)  was isolated from Bos taurus. The z-scores are considered significant above a value of 2, so the z-scores yielded from the Dali server all are significant and the larger value for the z-score usually translates to a more similar tertiary structure. Given this information it is prudent to consider this GAPDS from the Rattus norvegicus (PBD ID: 2vyn) as one of the comparison proteins since determining the active site could be done with more confidence given that the Rattus norvegicus GAPDS and the human GAPDS should have very similar, if not the same, active sites with the same functionally important residues present and have very similar tertiary structure which is determined in part by the manner in which the protein folds into secondary components. The GAPDH from Bos taurus (PDB ID: 4o59) will also be considered a comparison protein given the considerable z-score (50.0) as well as the close structural alignment. The difference in comparison proteins lies mostly in ligands as well as the length of the amino acid chain. The GAPDH from Trypanosoma cruzi has the same ligands but has a modified residue and the residues included in the active site are conserved but they are not necessarily in the same numerical positions in the sequence. This somatic GAPDH can be found in either the closed or open conformation, however the closed conformation is of more importance since this resembles the only possible conformation of the POI. Also, the Trypanosoma GAPDH contains 351 amino acids where the human GAPDS contains 346. The Rattus norvegicus GAPDS contains 334 residues. This protein also has formic acid in place of the glycerol ligand but does contain the NAD+ ligands. This difference in ligands could relate to catalysis of different sinlge step reactions in the production of ATP. The Rattus norvegicus GAPDS is the testis-specific protein and is only found in the closed conformation; a trait that is shared with the POI.  Much like the Trypanosoma protein, the residues in the active site of the Rattus norvegicus GAPDS are mostly conserved but may not be in the same numerical location along the amino acid chain.

           The ligands of GAPDS (PBD ID: 3pfw) are NAD+ and glycerol. The function of the NAD+ is to accept a proton and a pair of electrons during the glycolysis reaction. Glycerol serves an altogether different function. Glycerol is a sugar alcohol and an intermediate in carbohydrate and lipid metabolism (6). Glucose is a carbohydrate and the protein is catalyzing part of the glycolysis reaction so glycerol might be present as a sort of inhibitor, preventing the reaction from proceeding too quickly or too far towards the products in accordance with Le Chateliers principle. Additionally, in some organisms, glycerol can enter the glycolysis pathway directly and provide energy for cellular metabolism so glycerol could be attached as a ligand only to further the production of ATP (6). Human GAPDS (PBD ID: 3pfw) in composed of 418 amino acids (including the tail), the sequence is comprised of about 10% acidic residues and 12.9% basic residues lending itself to a higher pI. Also, given the relative amount of hydrophobic residues, the protein adopts a globular shape to shield them from water.Additionally, the protein has regions of alpha helices, beta sheets, and 3/10 helices as well as random coil segments. GAPDS has two subunitsper chain named O and P, for a total of four subunits, and is known as a homotetramer (7). The GAPDS ligands are imbeded in the structure. This protein is not bound to any metals and does not appear to have specifically known ionic interactions or hydrogen bonds among the residues. The structure was of the protein was determined by over expression of human GAPDS and removing the N-terminal polyproline chain. Crystal structures were determined while the protein was in complex with NAD+ and phosphate and NAD+ and glycerol using X-ray diffraction (1,7).  The secondary structure consists of multiple α-helices and β-strands and contains many turns and bends according to the ProtParam tool (8). The PDB only classifies about 56% of the protein into either an α-helix or β-strand so it is probable that some of the turns or bends that the ProtParam tool lists are actually sections of random coil or 3/10 helices. This implies that GAPDS will have segments associated with other subunits or proteins to stabilize it. ProtParam also estimates the half-life of the protein to be about thirty hours.

As mentioned previously, GAPDS (PBD ID: 3pfw) functions to catalyze glycolysis reactions. First and foremost, the N-terminus contains a polyproline tail that is approximately 72 residues long (1,9). The tail binds to the cytoskeletal fibrous sheath of the flagella via anchoring proteins and accounts for most of the sequence dissimilarity between the human GADPS and the somatic counter-part. Without this tail the protein would not be able to attach itself to the sperm and provide it with the energy (ATP) needed for motility, therefore completely neglecting the functional importance of this protein. Residues Asp-106, Lys-151, Tyr-173, Ser-193, and Asn-388 are residues that are functionally important since they are binding sites for NAD+ (binding pocket) and according to the primary citation the reaction that the protein catalyzes is dependent on NAD+ which is converted to NADH over the course of the reaction (1, 9). Additionally, nucleotide binding to NAD takes place at Arg-85 and Ile-86. Residue Cys-224 is the nucleophilic active site (space-filling) that bonds to the inorganic phosphate which becomes bonded to D-glyceraldehyde 3-phosphate during the reaction to yield 3-phospho-D-glyceroyl phosphate as one of the products (9). The D-glyceraldehyde 3-phosphate is bound to residues Thr-254 and Arg-306. An acid-base-nucleophile triad is important for enzymatic catalysis and involves the interaction of an acidic residue with a basic residue to polarize the base. The base will then deprotonate with the nucleophile and increase its reactivity toward an electrophile encouraging bonding. In GAPDS Asp-106 acts as the acidic residue aligning and polarizing the His-251 residue. The Histidine then polarizes and deprotonates the thiol group of Cys-224 to increase its reactivity in binding with the inorganic phosphate which it will later donate to D-glyeraldehyde-3-phosohate. Given all of the residues and reactants that must be in or very near the active site, it is clear that the substrates must fit into the protein site in a specific conformation and that donor for the inorganic phosphate should be rather small. Otherwise, it seems as though the rate of the reaction would be slowed by steric hindrance. Most likely the phosphate donor is not really a donor so much as a free phosphate group. The small NAD binding pocket in which 2 glycerol molecules and all the necessary substrates must fit limits the sort of phosphate that can used as well as the possibility of catalyzing other reactions.