You're browsing as a guest โ€” progress won't be saved.

Browsing as Guest
Back to Biochemistry for Health Sciences

Protein Folding & Molecular Chaperones

~5 min read

Lesson 11 of 12

Notes

Protein folding is the process by which a polypeptide chain spontaneously attains its native three-dimensional structure. This process is thermodynamically driven, kinetically complex, and in the cell is assisted by molecular chaperones. Failure of folding quality control underlies a broad class of diseases termed protein conformational disorders or proteinopathies.

Hierarchy of Protein Structure

Primary structure: the linear sequence of amino acids linked by peptide bonds, determined by the gene. All subsequent structural information is encoded here โ€” demonstrated by Anfinsen's classic experiment with ribonuclease A (denaturation then renaturation restored full activity, proving primary structure determines fold).

Secondary structure: local regular structural elements formed by hydrogen bonding between backbone amide (N-H) and carbonyl (C=O) groups. Alpha-helix: right-handed, 3.6 residues per turn, 0.54 nm pitch, i to i+4 hydrogen bonding, side chains point outward. Beta-sheet: extended strands connected by inter-strand hydrogen bonds โ€” parallel (same Nโ†’C direction, weaker bonds) or antiparallel (opposite directions, stronger, more linear bonds). Turns: short loops connecting secondary structure elements, often containing glycine (small, flexible) or proline (forces turn). Intrinsically disordered regions (IDRs): lack stable secondary structure under physiological conditions; important in signalling proteins.

Tertiary structure: the overall three-dimensional fold of a single polypeptide. Stabilised by: hydrophobic interactions (burial of nonpolar side chains โ€” the dominant force), hydrogen bonds (backbone-backbone and side chain-side chain), electrostatic interactions/salt bridges (opposite charges attract, pH-dependent), van der Waals forces, and disulfide bonds (covalent S-S between cysteine residues โ€” formed in oxidising environment of ER lumen; broken by beta-mercaptoethanol). Protein domains are independently folding, functional units โ€” many proteins consist of multiple domains (e.g., immunoglobulins, kinases).

Quaternary structure: assembly of two or more polypeptide subunits (protomers). Stabilised primarily by non-covalent interactions. Can be homomeric (identical subunits, e.g., haemoglobin S fibrils) or heteromeric (different subunits, e.g., haemoglobin alpha2beta2). Functional advantages: cooperativity (haemoglobin O2 binding), active site formation at subunit interface (many enzymes), allosteric regulation, larger structures than a single chain allows. Haemoglobin allosteric transitions (Tโ†”R state, Bohr effect, 2,3-BPG binding) are paradigmatic of quaternary structure-function relationships.

Protein Folding Thermodynamics and Kinetics

Folding is driven thermodynamically by the hydrophobic effect โ€” the entropy gain from releasing ordered water molecules around hydrophobic groups as they are buried in the protein core. The native state represents the global free-energy minimum: ฮ”G = ฮ”H โˆ’ Tฮ”S. The folding funnel energy landscape model describes how the ensemble of unfolded conformations progressively moves toward the native state, with many pathways converging on the native fold. Transient kinetic intermediates (molten globule states) have significant secondary structure but non-native tertiary packing.

Levinthal's paradox: if a 100-residue protein sampled all possible conformations randomly, it would take longer than the age of the universe to find the native state. In reality, folding occurs in microseconds to seconds via cooperative, pathway-directed processes guided by local secondary structure formation and hydrophobic collapse.

Molecular Chaperones

Molecular chaperones are proteins that assist folding or prevent aggregation of other proteins without being part of the final structure. They are particularly important for large, multi-domain proteins that transiently expose hydrophobic regions during folding and for stress-denatured proteins (heat shock proteins โ€” HSPs โ€” named for their induction by heat stress).

Hsp70 (DnaK in bacteria): binds exposed hydrophobic segments on nascent or partially unfolded proteins in an ATP-dependent cycle. ATP binding โ†’ open conformation (low affinity). Substrate binding โ†’ ATP hydrolysis โ†’ closed conformation (high affinity). Nucleotide exchange factor (NEF/GrpE) releases ADP โ†’ ATP rebinding โ†’ substrate release, giving the polypeptide another chance to fold. Works with Hsp40 (J-domain cochaperone, stimulates ATPase activity) and NEF. Prevents aggregation during ribosomal translation.

GroEL/GroES (Hsp60/Hsp10 system in bacteria; TRiC/CCT in eukaryotes): a large barrel-shaped chaperonin complex that provides an isolated folding chamber. GroEL: 14 subunits in two heptameric rings, each ring having a central cavity. GroES: heptameric cap. Mechanism: unfolded protein binds GroEL apical domains (hydrophobic contacts). GroES + ATP bind โ†’ cavity expands and hydrophilic residues lining the inner wall favour correct folding. ATP hydrolysis in cis ring โ†’ GroES released โ†’ protein either native or rebounds for another cycle (up to ~200 cycles for some substrates). Eukaryotic TRiC is required for actin, tubulin, and many other essential proteins.

Hsp90: late-stage chaperone for specific client proteins including many signalling proteins (steroid hormone receptors, kinases, HER2). Forms a homodimer with a complex cochaperone system (Hsp70, Hop, p23, Aha1). Hsp90 inhibitors (geldanamycin analogues) destabilise kinase clients, explored as cancer therapeutics.

Protein disulfide isomerase (PDI): in the ER lumen, catalyses formation, reduction, and isomerisation of disulfide bonds, enabling correct disulfide pattern for secreted and membrane proteins. Requires oxidising environment (provided by Ero1). Calnexin and calreticulin: ER lectins that bind monoglucosylated N-linked glycans on newly synthesised glycoproteins, retaining them in the ER until folding is complete. BiP (GRP78, an Hsp70): ER-resident chaperone central to the unfolded protein response (UPR).

Ubiquitin-Proteasome System (UPS)

Irreversibly misfolded proteins are targeted for degradation by the UPS. Ubiquitin (76 aa) is covalently attached to lysine residues of the substrate via an E1 (ubiquitin activating)โ†’E2 (conjugating)โ†’E3 (ligase) cascade. Polyubiquitin chains (typically K48-linked) mark proteins for the 26S proteasome (20S catalytic barrel + 19S regulatory caps). The 19S cap unfolds and deubiquitinates substrates; the 20S barrel has chymotryptic, tryptic, and caspase-like peptidase activities. Proteasome inhibitors (bortezomib) are used clinically in multiple myeloma โ€” plasma cells, with massive immunoglobulin production, are particularly dependent on proteasome function.

Prion Diseases

Prions (proteinaceous infectious particles, Stanley Prusiner, Nobel 1997) are misfolded forms of the prion protein (PrP) that act as templates to convert normal cellular PrPC (alpha-helix-rich) to the aberrant PrPSc conformation (beta-sheet-rich). PrPSc is resistant to proteases, accumulates in brain, and causes fatal spongiform encephalopathies: Creutzfeldt-Jakob disease (sCJD โ€” sporadic; fCJD โ€” familial PRNP mutations; vCJD โ€” variant, from BSE exposure), Gerstmann-Strรคussler-Scheinker syndrome, fatal familial insomnia, kuru (ritual cannibalism). The prion mechanism demonstrates that protein conformation, not just sequence, can be heritable โ€” a paradigm shift in biology.

Protein Aggregation Diseases

Amyloid: beta-sheet-rich fibrillar aggregates with characteristic Congo red staining and apple-green birefringence under polarised light. Cross-beta structure: beta-strands perpendicular to fibre axis. Alzheimer's disease: Aฮฒ peptides (cleaved from APP by beta-secretase/BACE1 and gamma-secretase/presenilin) aggregate as senile plaques; tau (hyperphosphorylated) aggregates as neurofibrillary tangles. Parkinson's disease: alpha-synuclein aggregates as Lewy bodies and neurites; loss of dopaminergic neurons in substantia nigra. Huntington's disease: polyglutamine (polyQ) expansion in huntingtin protein โ†’ aggregation and nuclear inclusions; dominant gain-of-function toxicity. Systemic amyloidoses: AL amyloidosis (immunoglobulin light chains in plasma cell dyscrasias), AA amyloidosis (serum amyloid A, from chronic inflammation), transthyretin amyloidosis (ATTR โ€” wild-type in senile cardiac amyloid; hereditary variants cause ATTR cardiomyopathy/neuropathy). Tafamidis stabilises transthyretin tetramers, preventing dissociation and misfolding โ€” approved for ATTR cardiomyopathy.

What to study next