Biochemistry Basics: A Complete Guide to the Chemistry of Life

Updated May 2026
Biochemistry is the branch of science that studies the chemical processes and substances occurring within living organisms. It bridges biology and chemistry by examining how molecules such as proteins, nucleic acids, carbohydrates, and lipids interact to sustain life, from the reactions inside a single cell to the metabolic pathways that power an entire organism.

What Is Biochemistry

Biochemistry occupies the intersection of biology and chemistry, applying chemical principles to understand the molecular machinery of living systems. While general chemistry deals with elements and reactions in isolation, and biology studies organisms and ecosystems at a larger scale, biochemistry narrows the focus to the specific molecules and reactions that allow cells to grow, reproduce, and respond to their environment.

The field emerged in the early twentieth century when scientists began isolating and characterizing the molecules responsible for biological functions. Friedrich Wohler's 1828 synthesis of urea from inorganic compounds challenged the prevailing belief that organic molecules could only be produced by living organisms. Eduard Buchner's demonstration in 1897 that cell-free yeast extracts could ferment sugar proved that biological reactions did not require intact living cells, opening the door to enzyme chemistry.

Modern biochemistry spans a wide range of subdisciplines. Structural biochemistry determines the three-dimensional shapes of proteins and nucleic acids using techniques such as X-ray crystallography and cryo-electron microscopy. Metabolic biochemistry maps the networks of chemical reactions that cells use to extract energy from nutrients. Molecular biology, a close relative, focuses on the flow of genetic information from DNA to RNA to protein. Clinical biochemistry applies these principles to human health, using blood tests and biomarkers to diagnose disease.

At its core, biochemistry seeks to answer a fundamental question: how do collections of inanimate atoms and molecules give rise to the complex, self-regulating systems we call life? Every process in a living organism, from muscle contraction to immune defense to the transmission of nerve impulses, can be described in chemical terms. Understanding those chemical terms is what biochemistry is all about.

The Four Classes of Biomolecules

Living organisms are built from four major classes of organic molecules: proteins, nucleic acids, carbohydrates, and lipids. Each class plays distinct roles, yet they constantly interact within cells to maintain life. Together, these molecules account for the vast majority of the dry weight of any cell.

Proteins are polymers of amino acids linked by peptide bonds. They serve as enzymes, structural components, transport carriers, signaling molecules, and immune defenders. The human body contains tens of thousands of different proteins, each with a specific three-dimensional shape that determines its function. A single misfolded protein can cause disease, as seen in conditions like Alzheimer's and sickle cell anemia.

Nucleic acids, specifically DNA and RNA, store and transmit genetic information. DNA holds the long-term blueprint for building proteins, while various forms of RNA carry out the instructions. Messenger RNA (mRNA) copies the DNA template, transfer RNA (tRNA) delivers amino acids to the ribosome, and ribosomal RNA (rRNA) forms the structural core of the ribosome itself. The discovery of DNA's double helix structure by Watson and Crick in 1953 remains one of the most important milestones in all of science.

Carbohydrates range from simple sugars like glucose and fructose to complex polysaccharides like starch, glycogen, and cellulose. They serve as the primary short-term energy source for most organisms. Glucose, a six-carbon monosaccharide, is the central molecule of cellular metabolism. Cells break it down through glycolysis and the citric acid cycle to generate ATP, the universal energy currency. Complex carbohydrates also serve structural roles, as in the cellulose that strengthens plant cell walls and the chitin that forms the exoskeletons of insects.

Lipids are a chemically diverse group defined by their hydrophobic nature. They include fatty acids, triglycerides, phospholipids, and steroids. Phospholipids form the bilayer membranes that surround every cell, creating a selective barrier between the cell's interior and its environment. Triglycerides store energy more efficiently than carbohydrates, packing roughly nine kilocalories per gram compared to four for carbohydrates. Steroids such as cholesterol serve as precursors for hormones including testosterone, estrogen, and cortisol.

Proteins and Amino Acids

Proteins are arguably the most versatile class of biomolecules. They are constructed from a set of 20 standard amino acids, each sharing a common backbone with an amino group, a carboxyl group, and a variable side chain (the R group) attached to a central alpha carbon. The R group determines each amino acid's chemical properties: some are hydrophobic, some are hydrophilic, some carry positive or negative charges at physiological pH, and some contain reactive functional groups like sulfhydryl or hydroxyl groups.

Protein structure is described at four levels. The primary structure is the linear sequence of amino acids in the polypeptide chain. This sequence is encoded directly by the gene that specifies the protein. The secondary structure refers to local folding patterns stabilized by hydrogen bonds between backbone atoms, primarily alpha helices and beta sheets. The tertiary structure is the overall three-dimensional shape of a single polypeptide, determined by interactions among the R groups, including hydrophobic packing, ionic bonds, hydrogen bonds, and disulfide bridges between cysteine residues. The quaternary structure describes the arrangement of multiple polypeptide subunits into a functional complex, as seen in hemoglobin's four-subunit assembly.

The relationship between structure and function in proteins cannot be overstated. An enzyme's ability to catalyze a specific reaction depends entirely on the precise geometry of its active site. A transport protein's ability to carry oxygen depends on the exact arrangement of its iron-containing heme groups. When proteins lose their three-dimensional structure through a process called denaturation, caused by heat, extreme pH, or chemical agents, they typically lose their function as well.

Protein synthesis occurs at the ribosome, where mRNA is translated into an amino acid sequence. The newly synthesized polypeptide then folds into its functional shape, sometimes with the assistance of chaperone proteins. Post-translational modifications such as phosphorylation, glycosylation, and ubiquitination further regulate protein activity, localization, and lifespan within the cell.

Enzymes as Biological Catalysts

Enzymes are proteins (with some notable RNA exceptions called ribozymes) that accelerate biochemical reactions by lowering the activation energy required for the reaction to proceed. Without enzymes, most metabolic reactions would occur far too slowly to sustain life. A typical enzyme can increase a reaction rate by a factor of a million or more, and some enzymes approach the theoretical speed limit set by the rate at which substrates can diffuse into the active site.

The lock-and-key model, proposed by Emil Fischer in 1894, was the first attempt to explain enzyme specificity. It suggested that the enzyme's active site and the substrate fit together like a key in a lock. The more accurate induced-fit model, developed by Daniel Koshland in 1958, recognizes that the active site changes shape slightly upon substrate binding, optimizing the fit and promoting catalysis.

Enzyme activity is influenced by several factors. Temperature affects the kinetic energy of molecules: moderate increases in temperature raise reaction rates, but excessive heat denatures the enzyme. Most human enzymes function optimally near 37 degrees Celsius. pH affects the ionization state of amino acid residues in the active site, so each enzyme has a pH optimum. Pepsin, which works in the acidic stomach environment, is most active near pH 2, while trypsin, which operates in the small intestine, prefers pH 8.

Enzyme kinetics, the quantitative study of reaction rates, is described by the Michaelis-Menten equation. The two key parameters are Km, the substrate concentration at which the reaction rate is half-maximal, and Vmax, the maximum rate achieved when all enzyme molecules are saturated with substrate. A low Km indicates high substrate affinity, meaning the enzyme reaches half its maximum speed at a low substrate concentration.

Cells regulate enzyme activity through several mechanisms. Allosteric regulation involves molecules binding at sites other than the active site, changing the enzyme's shape and either activating or inhibiting it. Competitive inhibitors occupy the active site and block substrate binding. Noncompetitive inhibitors bind elsewhere and reduce catalytic efficiency regardless of substrate concentration. Cells also control enzyme abundance through gene expression and targeted protein degradation.

DNA, RNA, and Nucleic Acids

Nucleic acids are polymers of nucleotides, each consisting of a nitrogenous base, a five-carbon sugar, and one or more phosphate groups. DNA (deoxyribonucleic acid) uses the sugar deoxyribose and the bases adenine (A), guanine (G), cytosine (C), and thymine (T). RNA (ribonucleic acid) uses ribose and substitutes uracil (U) for thymine. The phosphodiester bonds linking nucleotides create the sugar-phosphate backbone, while the bases project inward and pair with complementary bases on the opposing strand.

DNA's double helix stores genetic information in the sequence of its bases. The two strands run antiparallel, with one oriented 5' to 3' and the other 3' to 5'. Hydrogen bonds between complementary base pairs, A with T (two bonds) and G with C (three bonds), hold the strands together. This complementarity is the basis for DNA replication: each strand serves as a template for synthesizing a new complementary strand, ensuring that genetic information is faithfully copied before cell division.

RNA plays multiple roles beyond simply carrying messages from DNA to the ribosome. Ribosomal RNA forms the catalytic core of the ribosome, directly facilitating peptide bond formation. Transfer RNA acts as an adapter, matching amino acids to their corresponding mRNA codons through its anticodon loop. Small interfering RNA (siRNA) and microRNA (miRNA) regulate gene expression by targeting specific mRNAs for degradation or translational repression. The discovery that RNA can both store information and catalyze reactions supports the RNA world hypothesis, which proposes that early life relied on RNA before the evolution of DNA and proteins.

The central dogma of molecular biology, articulated by Francis Crick in 1958, describes the flow of genetic information: DNA is transcribed into RNA, which is translated into protein. While there are exceptions, such as reverse transcription in retroviruses, this general framework remains a foundational principle of biochemistry and molecular biology.

Metabolism and Energy Transfer

Metabolism encompasses all the chemical reactions occurring within a living organism. These reactions are organized into metabolic pathways, sequences of enzyme-catalyzed steps that convert starting materials into products. Metabolic pathways fall into two broad categories: catabolic pathways break down complex molecules into simpler ones, releasing energy in the process, while anabolic pathways use energy to build complex molecules from simpler precursors.

The energy released by catabolic reactions is captured in the form of ATP (adenosine triphosphate), the cell's primary energy currency. ATP consists of adenine, ribose, and three phosphate groups linked by high-energy phosphoanhydride bonds. When the terminal phosphate is hydrolyzed, releasing ADP and inorganic phosphate, the reaction releases approximately 7.3 kilocalories per mole of free energy under standard conditions. Cells use this energy to drive thermodynamically unfavorable reactions, transport molecules across membranes, and power mechanical work like muscle contraction.

Electron carriers play a central role in energy metabolism. NAD+ (nicotinamide adenine dinucleotide) and FAD (flavin adenine dinucleotide) accept electrons from substrates during catabolic reactions, becoming NADH and FADH2. These reduced carriers then deliver their electrons to the electron transport chain in the mitochondrial inner membrane, where the energy of electron transfer is used to pump protons across the membrane, creating a proton gradient that drives ATP synthesis.

Metabolic regulation ensures that cells produce the right amounts of each molecule at the right time. Key regulatory mechanisms include allosteric control of enzymes at committed steps in pathways, hormonal signaling that adjusts metabolic rates across entire tissues, and the compartmentalization of reactions within specific organelles. The liver, for example, serves as a metabolic hub, switching between glucose storage, glucose release, fatty acid synthesis, and amino acid catabolism depending on the body's nutritional state and hormonal signals.

Cellular Respiration and ATP

Cellular respiration is the process by which cells extract energy from glucose and other organic fuels, converting it into ATP. The complete aerobic oxidation of one molecule of glucose yields approximately 30 to 32 molecules of ATP, depending on the shuttle system used to transfer cytoplasmic NADH into the mitochondria. The process occurs in three main stages: glycolysis, the citric acid cycle (also called the Krebs cycle or TCA cycle), and oxidative phosphorylation.

Glycolysis takes place in the cytoplasm and does not require oxygen. It splits one six-carbon glucose molecule into two three-carbon pyruvate molecules, producing a net gain of 2 ATP and 2 NADH. The pathway involves ten enzyme-catalyzed steps, with key regulatory enzymes at hexokinase, phosphofructokinase (PFK-1), and pyruvate kinase. PFK-1 is the primary control point: it is activated by AMP and fructose-2,6-bisphosphate and inhibited by ATP and citrate, linking glycolytic rate to the cell's energy status.

The citric acid cycle operates in the mitochondrial matrix. Pyruvate is first converted to acetyl-CoA by the pyruvate dehydrogenase complex, releasing one CO2 and one NADH per pyruvate. Acetyl-CoA then enters the eight-step cycle, where its two carbons are oxidized to CO2. Each turn of the cycle produces 3 NADH, 1 FADH2, 1 GTP (equivalent to 1 ATP), and 2 CO2. Since each glucose yields two pyruvate molecules, the cycle turns twice per glucose.

Oxidative phosphorylation takes place at the inner mitochondrial membrane and accounts for the vast majority of ATP production. The electron transport chain consists of four protein complexes (I through IV) and two mobile electron carriers, ubiquinone and cytochrome c. Electrons from NADH enter at Complex I, while those from FADH2 enter at Complex II. As electrons pass through the chain, energy is released and used to pump protons from the matrix into the intermembrane space. The resulting proton gradient, called the proton-motive force, drives ATP synthase (Complex V), a molecular turbine that catalyzes the phosphorylation of ADP to ATP as protons flow back into the matrix.

In the absence of oxygen, cells can still generate ATP through fermentation. Lactic acid fermentation, which occurs in exercising muscle cells and certain bacteria, converts pyruvate to lactate while regenerating NAD+. Alcoholic fermentation, used by yeast, converts pyruvate to ethanol and CO2. Both pathways yield only the 2 ATP from glycolysis, far less than aerobic respiration, but they allow cells to continue generating energy when oxygen is unavailable.

Carbohydrates and Lipids

Carbohydrates serve as the body's most readily accessible energy source and play important structural roles as well. Monosaccharides are the simplest carbohydrates: glucose, fructose, and galactose are the most common six-carbon sugars. Disaccharides form when two monosaccharides join through a glycosidic bond: sucrose (glucose + fructose), lactose (glucose + galactose), and maltose (glucose + glucose) are familiar examples. Polysaccharides are long chains of monosaccharides: starch and glycogen serve as energy storage in plants and animals respectively, while cellulose provides structural rigidity to plant cell walls.

Glycogen metabolism is tightly regulated by hormones. Insulin, released when blood glucose is high, stimulates glycogen synthesis (glycogenesis) in the liver and muscles. Glucagon, released when blood glucose drops, triggers glycogen breakdown (glycogenolysis) in the liver, releasing glucose into the bloodstream. Epinephrine activates glycogenolysis in muscle during exercise, providing a rapid fuel supply for intense activity. These hormonal controls ensure that blood glucose remains within a narrow range of approximately 70 to 110 milligrams per deciliter.

Lipids perform diverse functions despite their shared hydrophobic character. Fatty acids, the simplest lipids, are long hydrocarbon chains with a carboxyl group at one end. Saturated fatty acids have no carbon-carbon double bonds and pack tightly, making fats like butter solid at room temperature. Unsaturated fatty acids contain one or more double bonds that introduce kinks in the chain, preventing tight packing and producing oils that are liquid at room temperature. Trans fats, produced by partial hydrogenation of vegetable oils, have straightened chains that mimic saturated fats and have been linked to increased cardiovascular risk.

Phospholipids are the structural foundation of all biological membranes. Each phospholipid has two fatty acid tails (hydrophobic) and a phosphate-containing head group (hydrophilic). In water, phospholipids spontaneously arrange into bilayers with the tails facing inward and the heads facing outward, creating the selectively permeable barriers that define cells and organelles. Cholesterol, a steroid lipid, inserts into animal cell membranes and modulates their fluidity, preventing them from becoming too rigid at low temperatures or too fluid at high temperatures.

Cell Signaling and Regulation

Cells constantly communicate with one another through chemical signals, and the biochemical machinery that detects and responds to these signals is remarkably sophisticated. Signal transduction pathways convert extracellular signals into intracellular responses, often amplifying the signal dramatically along the way. A single hormone molecule binding to a cell surface receptor can ultimately activate thousands of enzyme molecules inside the cell.

Cell surface receptors fall into several major families. G protein-coupled receptors (GPCRs) are the largest family, with over 800 members in the human genome. When a ligand binds a GPCR, the receptor activates an associated G protein, which in turn activates or inhibits downstream effector enzymes such as adenylyl cyclase or phospholipase C. Receptor tyrosine kinases (RTKs), used by growth factors like insulin and epidermal growth factor, dimerize upon ligand binding and phosphorylate each other's intracellular domains, creating docking sites for signaling proteins.

Second messengers are small molecules that relay signals within the cell. Cyclic AMP (cAMP), produced by adenylyl cyclase, activates protein kinase A (PKA), which phosphorylates target proteins to alter their activity. Calcium ions, released from the endoplasmic reticulum, bind to calmodulin and activate calcium-dependent enzymes. Inositol trisphosphate (IP3) and diacylglycerol (DAG) are produced by phospholipase C and trigger calcium release and protein kinase C activation, respectively.

Protein phosphorylation is the most common post-translational modification used in signaling. Kinases add phosphate groups to serine, threonine, or tyrosine residues, while phosphatases remove them. This reversible switch can activate or inactivate enzymes, change protein localization, or create binding sites for other proteins. Dysregulation of kinase signaling is a hallmark of cancer, which is why many modern cancer drugs are kinase inhibitors.

Vitamins and cofactors serve as essential partners for many enzymes involved in signaling and metabolism. B vitamins, for example, are precursors to coenzymes such as NAD+ (from niacin), FAD (from riboflavin), and coenzyme A (from pantothenic acid). Without adequate vitamin intake, the enzymes that depend on these cofactors cannot function properly, leading to metabolic disorders and deficiency diseases like pellagra, beriberi, and scurvy.

Biochemistry in Medicine and Research

The principles of biochemistry underpin nearly every aspect of modern medicine. Clinical biochemistry uses blood and urine tests to measure metabolite and enzyme levels, helping physicians diagnose diseases ranging from diabetes to liver failure. Elevated blood glucose indicates diabetes mellitus. High levels of cardiac troponin signal heart muscle damage. Abnormal liver enzyme levels (ALT, AST) suggest hepatocellular injury. Lipid panels measuring total cholesterol, LDL, HDL, and triglycerides assess cardiovascular risk.

Drug design relies heavily on biochemical knowledge. Most pharmaceuticals work by interacting with specific proteins, either inhibiting enzymes, blocking receptors, or mimicking natural signaling molecules. Statins lower cholesterol by inhibiting HMG-CoA reductase, the rate-limiting enzyme in cholesterol synthesis. ACE inhibitors treat hypertension by blocking the enzyme that converts angiotensin I to the vasoconstrictor angiotensin II. Protease inhibitors treat HIV infection by blocking the viral protease needed to process viral polyproteins into functional components.

Biotechnology tools derived from biochemistry have transformed research and medicine. Polymerase chain reaction (PCR) amplifies specific DNA sequences from tiny samples, enabling genetic testing, forensic identification, and pathogen detection. Gel electrophoresis separates proteins or nucleic acids by size, allowing researchers to analyze gene expression and protein composition. CRISPR-Cas9 gene editing, adapted from a bacterial immune system, allows precise modifications to DNA sequences in living cells, opening possibilities for treating genetic diseases.

The field continues to advance rapidly. Structural biology techniques like cryo-electron microscopy now resolve protein structures at near-atomic resolution without the need for crystallization. Metabolomics profiles the complete set of small molecules in a biological sample, revealing metabolic signatures of disease. Proteomics catalogs all proteins in a cell or tissue, identifying changes associated with cancer, neurodegeneration, and other conditions. These tools are making it possible to understand biological systems at a level of detail that was unimaginable even a few decades ago.

Explore Biochemistry Topics

Foundations

Biomolecules

Enzymes

Metabolism and Energy

Protein Biology

Cell Signaling and Regulation

Methods and Applications