← Writing
Geometry of Curved Spaces · Part 1

The Geometry of Curved Spaces

You are standing on a flat x-y plane. To measure the distance of a tiny step, you square each coordinate change and add them together: ds2=dx2+dy2ds^2 = dx^2 + dy^2. This is the simplest metric imaginable, Pythagoras applied infinitesimally. Every step in every direction is governed by the same formula, uniformly, everywhere.

Now look at the surface. It is the same plane, but with a bump. Spin it and watch the coordinate grid. Near the peak, a step of the same coordinate size covers more actual surface area than the same step on the flat rim. The coordinate step and the real distance have come apart. The formula ds2=dx2+dy2ds^2 = dx^2 + dy^2 no longer works; it would lie to you about distances near that peak.

This is the problem Riemannian geometry was invented to solve. This post is meant to be visual explainer of basics of Riemannian geometry.


Manifolds: what is a curved space?

Intuition

Imagine a crumpled piece of paper. To an ant walking on it, the surface feels flat locally, but globally it is curved. A manifold is like this: locally flat everywhere, globally curved. You can assign coordinates to each small patch, just like street addresses on a city map, even though the whole surface bends. It’s the stage on which the geometry takes place.

Imagine an ant living on the surface of a sphere. It cannot fly off the surface and look at the sphere from outside. Yet it can still discover that its world is curved: if it draws a large enough triangle, the angles add up to more than 180°. If it walks two paths that start out parallel, they will eventually meet. Curvature is not a property of how the surface sits inside some larger space; it is an intrinsic property of the surface itself, discoverable from within.

To do any calculation, the ant needs a way to label points. It lays down coordinates xμx^\mu at every location, a coordinate chart. On a flat plane these are just xx and yy. On the bump, they might be latitude and longitude, or radial distance from the peak, or any other convenient labelling. The choice is arbitrary. The coordinates are a filing system we impose on the surface, not a feature of the surface itself.

The abstract object, the surface, without any particular coordinate labels, is called a manifold, written MM. Formally, a manifold is a smooth space with the property that any small enough patch looks flat: if you zoom in close enough to any point, the surface is locally indistinguishable from an ordinary Euclidean plane. Globally it can curve, fold, and close on itself. But locally, coordinates always behave like xx and yy.

Formally: a manifold MM is a set of points that locally looks like Rn\mathbb{R}^n. Around every point there exists a small open neighborhood that can be mapped smoothly to Rn\mathbb{R}^n and back — that neighborhood is a coordinate patch, or chart. Within it, points carry coordinates (x1,x2,,xn)(x^1, x^2, \ldots, x^n) that behave just like ordinary Cartesian coordinates.

The surface in the animation is the manifold MM. The blue region U1U_1 and the amber region U2U_2 are two such coordinate patches. Inside each one the local axes (x1,x2)(x^1, x^2) are drawn — two perpendicular directions spanning R2\mathbb{R}^2 at that location. The purple strip is U1U2U_1 \cap U_2, the overlap where both charts are simultaneously valid. On that overlap, the coordinates assigned by U1U_1 and those assigned by U2U_2 are smooth functions of each other. That compatibility condition is precisely what makes MM a smooth manifold rather than an arbitrary set of points.


The metric tensor: how do you measure distances?

Intuition

Imagine stretching a piece of paper more in some directions than others. Your ruler starts behaving oddly: distances change depending on where you place it and which direction you measure. The metric is this ruler, carried to every point on the surface. It encodes how lengths and angles work at each location, even when the surface curves or when time behaves differently from space.

Here is the problem the bump creates precisely. On a flat plane, a step dx1dx^1 in the first coordinate direction always covers the same physical distance, no matter where you are. On the bump, the same coordinate step dx1dx^1 near the peak covers more actual surface than the same step on the flat rim. The surface has stretched the coordinate grid unevenly. If we try to use ds2=(dx1)2+(dx2)2ds^2 = (dx^1)^2 + (dx^2)^2, we get the wrong answer near the peak.

What we need is a correction factor at every point: a symmetric matrix gμν(x)g_{\mu\nu}(x) that encodes exactly how much a coordinate step stretches into real distance. On the flat plane this matrix is the identity; no correction is needed. On the bump, each entry varies from point to point, tracking the stretching of the surface. The corrected distance formula is:

ds2=gμν(x)dxμdxνds^2 = g_{\mu\nu}(x) \, dx^\mu \, dx^\nu

Here μ\mu and ν\nu each range over 1 and 2, and we use the Einstein summation convention: whenever an index appears twice in a single term, once up, once down, it is summed over. So the right-hand side expands to g11(dx1)2+g12dx1dx2+g21dx2dx1+g22(dx2)2g_{11}(dx^1)^2 + g_{12} \, dx^1 dx^2 + g_{21} \, dx^2 dx^1 + g_{22}(dx^2)^2. From here on, repeated indices always mean sum.

Reading the formula term by term:

  • gμν(x)g_{\mu\nu}(x) is the metric tensor, a symmetric matrix at every point xMx \in M. Its entries encode how coordinate steps scale into real distances at that location.
  • dxμdx^\mu and dxνdx^\nu are infinitesimal displacements in the coordinate directions.

For our bump surface, g11g_{11} stretches distances along x1x^1, g22g_{22} along x2x^2, and g12=g21g_{12} = g_{21} captures coupling when the coordinate directions are not orthogonal. Two standard forms anchor the intuition:

Euclidean R3 ⁣:ds2=dx2+dy2+dz2gμν=δμν\text{Euclidean }\mathbb{R}^3\colon \quad ds^2 = dx^2 + dy^2 + dz^2 \qquad g_{\mu\nu} = \delta_{\mu\nu}

Spacetime  ⁣:ds2=dt2+dx2+dy2+dz2gμν=diag(1,+1,+1,+1)\text{Spacetime }\colon \quad ds^2 = -dt^2 + dx^2 + dy^2 + dz^2 \qquad g_{\mu\nu} = \operatorname{diag}(-1,+1,+1,+1)

The Euclidean case is Pythagoras at infinitesimal scale. The spacetime case introduces a minus sign for the time direction — and that sign changes everything.

Riemannian vs semi-Riemannian

When all eigenvalues of gμνg_{\mu\nu} are positive, the metric is Riemannian: ds20ds^2 \geq 0 always, distances are real, and the geometry is that of an ordinary curved surface. Our bump lives here.

When one or more eigenvalues are negative, the metric is semi-Riemannian (or pseudo-Riemannian). The physically important case is Lorentzian: one negative eigenvalue for the time direction. In that setting ds2ds^2 can be positive (spacelike separation, two events that are purely spatial), zero (lightlike, the path a photon takes), or negative (timelike, the path a massive object follows). The metric tensor is the same object in both cases — the only difference is the sign pattern of its eigenvalues, called the signature.

The full metric tensor is one symmetric matrix at every point of MM. From it alone you can derive distances, angles, volumes, and every geometric quantity that follows. The component above shows g11g_{11}, g12g_{12}, g22g_{22}, and ds2ds^2 live when you click two nearby points.


Signature: what kind of metric is this?

Intuition

Think of a video game map: every direction you can move is a direction in space. Now add a dimension for time, which behaves differently: you can never turn around in it. The signature tells you which directions are time-like and which are space-like. In relativity the signature is (−, +, +, +): one time direction and three spatial ones, and that single minus sign is what makes the geometry of spacetime different from ordinary curved space.

At any point on MM, the metric matrix gμνg_{\mu\nu} is symmetric and can be diagonalised. The signs of its eigenvalues, all positive or some negative, classify the geometry at that point.

For our bump surface, both eigenvalues are positive at every point. The signature is (+,+)(+,+). This is a Riemannian metric: ds20ds^2 \geq 0 always, distances are real and positive, and two distinct points are always separated by a positive distance.

Flip one sign, allow one negative eigenvalue, and you get a Lorentzian metric. The Minkowski metric of special relativity has signature (,+,+,+)(-,+,+,+): one time direction and three spatial directions, with ds2=c2dt2+dx2+dy2+dz2ds^2 = -c^2 dt^2 + dx^2 + dy^2 + dz^2. Distances can be zero between distinct events, or imaginary. The null interval, ds2=0ds^2 = 0, is exactly what light travels along. These are not pathological cases; they are the geometry of spacetime.

This section earns its place by naming the vocabulary. The follow-up post on the Schwarzschild solution opens with a Lorentzian metric, one sign flipped from everything built here, and the geometry of that single sign change is what makes black holes possible.


Curvature: does the space bend?

Intuition

Roll a marble on three different surfaces. On a flat table it goes straight. On a sphere it curves inward, parallel paths that start out side by side will eventually meet. On a saddle it curves outward, parallel paths that start together spread apart. Curvature tells you which of these three worlds you are in, and it shapes how everything moves through the space, just as gravity shapes how objects fall.

The simplest measure of curvature at a point is Gaussian curvature, written KK, a single number. At the peak of the bump: K>0K > 0, the surface curves inward like the top of a sphere. On the flat rim: K=0K = 0. Between the peak and the flat rim, the surface bends outward in one direction and inward in another, a saddle shape, giving K<0K < 0. The heat map on the component above shows this variation across the surface.

The remarkable theorem that makes curvature intrinsic is Gauss’s Theorema Egregium: KK can be computed from gμνg_{\mu\nu} and its derivatives alone. No knowledge of how the surface sits in 3D space is needed. For the bump surface, write the height as f(x1,x2)f(x^1, x^2); then:

K=f11f22f122(1+f12+f22)2K = \frac{f_{11} f_{22} - f_{12}^2}{\left(1 + f_1^2 + f_2^2\right)^2}

where fμ=f/xμf_\mu = \partial f / \partial x^\mu and fμν=2f/xμxνf_{\mu\nu} = \partial^2 f / \partial x^\mu \partial x^\nu. The numerator involves second derivatives of the height, the rate at which the slope itself is changing. The denominator is the square of the metric determinant factor. Every piece of this formula is computable by an ant on the surface who can measure distances but has never seen the third dimension.

For spaces of higher dimension, KK is no longer sufficient; we need a richer object. The Riemann curvature tensor RλμνρR^\lambda{}_{\mu\nu\rho} is a rank-4 tensor derived from gμνg_{\mu\nu} and its second derivatives; it encodes the curvature in every pair of directions simultaneously. Successive compressions extract coarser information: the Ricci tensor Rμν=RλμλνR_{\mu\nu} = R^\lambda{}_{\mu\lambda\nu} contracts one pair of indices, leaving a symmetric matrix. The Ricci scalar R=gμνRμνR = g^{\mu\nu} R_{\mu\nu} contracts again, leaving a single number. (Here gμνg^{\mu\nu} denotes the matrix inverse of gμνg_{\mu\nu}, the “raised-index” version of the metric, satisfying gμλgλν=δνμg^{\mu\lambda} g_{\lambda\nu} = \delta^\mu_\nu.) Each compression loses detail but becomes more tractable. Einstein’s field equations, the equations governing general relativity, are written in terms of RμνR_{\mu\nu} and RR, with the metric gμνg_{\mu\nu} as the fundamental unknown. Hover any point on the surface above to read KK at that location.


Geodesics: what is a straight line?

Intuition

Aeroplanes don’t fly straight lines on a flat map, they fly great circles, the shortest path on a sphere. What looks like a curve on the map is actually the straightest route the surface allows. Geodesics are these “straightest possible paths” in curved space. In general relativity, free-falling objects, planets, light, anything not being pushed, follow geodesics through spacetime.

On a flat plane, the shortest path between two points is a straight line. On a curved surface, “straight” needs a new definition. A path that bends to follow the surface geometry, never turning left or right relative to the surface itself, is called a geodesic. It is the shortest path between two points, or more precisely, the path that locally minimises length.

Parameterise a path on MM as γ(t)\gamma(t), where tt runs from 0 to 1 and γ\gamma traces a curve on the surface. The velocity in coordinates is x˙μ=dxμ/dt\dot{x}^\mu = dx^\mu/dt. To find the shortest path we minimise the arc-length integral using the calculus of variations. The result is the geodesic equation:

x¨μ+Γνρμx˙νx˙ρ=0\ddot{x}^\mu + \Gamma^\mu_{\nu\rho} \, \dot{x}^\nu \dot{x}^\rho = 0

The term Γνρμ\Gamma^\mu_{\nu\rho}, pronounced “Christoffel symbol”, corrects for the curvature of the surface. It bends the path to follow the geometry rather than the coordinate grid. Think of it as the term that says: account for the fact that the coordinate basis vectors themselves are changing direction as you move across the curved surface. We will define Γνρμ\Gamma^\mu_{\nu\rho} precisely in the next section.

On the bump, a geodesic between two points on opposite sides does not always pass over the peak. The path over the top may be longer in the intrinsic metric gμνg_{\mu\nu} than an arc that curves around the side, even though the route over the peak looks shorter when you draw it on a flat map. This is the core lesson of geodesics: the shortest path is determined by the geometry, not by the coordinates. Click two points on the surface above to trace the geodesic between them.


Connections and covariant derivatives

On flat space, moving a vector from point AA to point BB is trivial: just slide it along without rotating it. On a curved surface, this breaks down. The tangent plane at AA, the flat plane that best approximates the surface there, is a different plane from the tangent plane at BB. There is no obvious way to compare a vector at AA with a vector at BB, because they live in different planes. We need a rule for transporting vectors between points in a way that is consistent with the geometry.

That rule is a connection. For a Riemannian manifold the natural choice is the Levi-Civita connection: the unique connection that is torsion-free (Γνρμ=Γρνμ\Gamma^\mu_{\nu\rho} = \Gamma^\mu_{\rho\nu}, so the order of the lower indices does not matter) and metric-compatible (g=0\nabla g = 0, meaning it preserves lengths and angles under transport). Both conditions together pin down a single connection. Its concrete expression, the Christoffel symbols, is:

Γνρμ=12gμλ(νgρλ+ρgνλλgνρ)\Gamma^\mu_{\nu\rho} = \frac{1}{2} \, g^{\mu\lambda} \left(\partial_\nu g_{\rho\lambda} + \partial_\rho g_{\nu\lambda} - \partial_\lambda g_{\nu\rho}\right)

This is the same Γ\Gamma that appeared in the geodesic equation. It was hiding in gμνg_{\mu\nu} all along: the connection is nothing more than the metric’s own derivatives, assembled in the right combination. Knowing gμνg_{\mu\nu} at every point is enough to recover Γνρμ\Gamma^\mu_{\nu\rho} everywhere.

Parallel transport is what happens when you carry a vector ww along a path γ\gamma using this connection, moving it without rotating it relative to the surface at each step. Mathematically, the transported vector satisfies:

dwμdt=Γνρμx˙νwρ\frac{dw^\mu}{dt} = -\Gamma^\mu_{\nu\rho} \, \dot{x}^\nu \, w^\rho

On a curved surface, something unexpected happens: if you parallel-transport a vector around a closed loop, it returns rotated. The rotation angle is called the holonomy of the loop, and it is directly proportional to the Gaussian curvature enclosed by the path. An ant living on the surface can discover exactly how much the space curves by carrying a gyroscope around a closed path and measuring the rotation when it returns home. The component above lets you draw a closed path; the arrow shows the transported vector and updates the holonomy angle when the loop closes.

The covariant derivative XY\nabla_X Y is the infinitesimal version of this idea: it measures how a vector field YY changes in the direction of XX, corrected for the connection. In coordinates:

(XY)μ=XννYμ+ΓνρμXνYρ(\nabla_X Y)^\mu = X^\nu \partial_\nu Y^\mu + \Gamma^\mu_{\nu\rho} \, X^\nu Y^\rho

The first term XννYμX^\nu \partial_\nu Y^\mu is the ordinary directional derivative. On a flat space that would be the whole story. On a curved surface it is not a tensor; if you change coordinates, it picks up spurious extra terms from the fact that the coordinate basis vectors are changing direction. The Γ\Gamma term exactly subtracts those spurious contributions, leaving a quantity that genuinely measures how YY changes relative to the geometry, independent of the coordinate choice.


A Riemannian manifold (M,gμν)(M, g_{\mu\nu}) is a smooth space MM equipped with a metric tensor gμνg_{\mu\nu}. The metric defines distances, angles, and volumes at every point. Its signature classifies the geometry, Riemannian or Lorentzian. From gμνg_{\mu\nu} alone you can derive the Riemann curvature tensor RλμνρR^\lambda{}_{\mu\nu\rho}, which measures how the space bends in every direction. Geodesics, the solutions to x¨μ+Γνρμx˙νx˙ρ=0\ddot{x}^\mu + \Gamma^\mu_{\nu\rho} \dot{x}^\nu \dot{x}^\rho = 0, are the shortest paths. The Levi-Civita connection Γνρμ\Gamma^\mu_{\nu\rho} encodes how to differentiate vector fields, carrying the geometry’s effect into calculus itself.

In plain language

  • Manifold — a surface that is curved globally but looks flat up close, like the Earth beneath your feet.
  • Metric tensor — a ruler that adjusts itself at every point so you always measure real distance, not just coordinate distance.
  • Signature — the sign pattern that tells you whether you are in ordinary curved space or spacetime, where time behaves differently from every spatial direction.
  • Curvature — a number (or tensor) that captures how much the surface bends: zero on a flat table, positive on a sphere, negative on a saddle.
  • Geodesics — the straightest paths available on a curved surface, the routes a planet, a photon, or an aeroplane traces when nothing is pushing it off course.
  • Connection — the rule for sliding a vector across a curved surface without rotating it, and the correction term that makes calculus work correctly in curved coordinates.
Bloch Sphere
|ψ⟩ = |0⟩
P(|0⟩) = cos²(θ/2)
1.000
P(|1⟩) = sin²(θ/2)
0.000
θ
φ

drag to orbit · sliders to move · click states in text to navigate