History of string theory
The history of string theory spans several decades of intense research including two superstring revolutions. Through the combined efforts of many researchers, string theory has developed into a broad and varied subject with connections to quantum gravity, particle and condensed matter physics, cosmology, and pure mathematics.
1943–1959: S-matrix theoryEdit
String theory is an outgrowth of S-matrix theory, a research program begun by Werner Heisenberg in 1943 (following John Archibald Wheeler's 1937 introduction of the S-matrix), picked up and advocated by many prominent theorists starting in the late 1950s and throughout the 1960s, which was discarded and marginalized in the mid 1970s to disappear by the 1980s. It was forgotten because some of its mathematical methods were alien, and because quantum chromodynamics supplanted it as an experimentally better qualified approach to the strong interactions.
The theory was a radical rethinking of the foundation of physical law. By the 1940s it was clear that the proton and the neutron were not pointlike particles like the electron. Their magnetic moment differed greatly from that of a pointlike spin-½ charged particle, too much to attribute the difference to a small perturbation. Their interactions were so strong that they scattered like a small sphere, not like a point. Heisenberg proposed that the strongly interacting particles were in fact extended objects, and because there are difficulties of principle with extended relativistic particles, he proposed that the notion of a space-time point broke down at nuclear scales.
Without space and time, it is difficult to formulate a physical theory. Heisenberg believed that the solution to this problem is to focus on the observable quantities—those things measurable by experiments. An experiment only sees a microscopic quantity if it can be transferred by a series of events to the classical devices that surround the experimental chamber. The objects that fly to infinity are stable particles, in quantum superpositions of different momentum states.
Heisenberg proposed that even when space and time are unreliable, the notion of momentum state, which is defined far away from the experimental chamber, still works. The physical quantity he proposed as fundamental is the quantum mechanical amplitude for a group of incoming particles to turn into a group of outgoing particles, and he did not admit that there were any steps in between.
The S-matrix is the quantity that describes how a collection of incoming particles turn into outgoing ones. Heisenberg proposed to study the S-matrix directly, without any assumptions about space-time structure. But when transitions from the far-past to the far-future occur in one step with no intermediate steps, it is difficult to calculate anything. In quantum field theory, the intermediate steps are the fluctuations of fields or equivalently the fluctuations of virtual particles. In this proposed S-matrix theory, there are no local quantities at all.
Heisenberg proposed to use unitarity to determine the S-matrix. In all conceivable situations, the sum of the squares of the amplitudes must be equal to 1. This property can determine the amplitude in a quantum field theory order by order in a perturbation series once the basic interactions are given, and in many quantum field theories the amplitudes grow too fast at high energies to make a unitary S-matrix. But without extra assumptions on the high-energy behavior, unitarity is not enough to determine the scattering, and the proposal was ignored for many years.
Heisenberg's proposal was reinvigorated in the 1956 when Murray Gell-Mann recognized that dispersion relations—like those discovered by Hendrik Kramers and Ralph Kronig in the 1920s (see Kramers–Kronig relations)—allow a notion of causality to be formulated, a notion that events in the future would not influence events in the past, even when the microscopic notion of past and future are not clearly defined. He also recognized that these relations might be useful in computing observables for the case of strong interaction physics. The dispersion relations were analytic properties of the S-matrix, and they imposed more stringent conditions than those that follow from unitarity alone. This development in S-matrix theory was based on Murray Gell-Mann and Marvin Leonard Goldberger's (1954) discovery of crossing symmetry, another condition that the S-matrix had to fulfil.
Prominent advocates of the new "dispersion relations" approach were Stanley Mandelstam and Geoffrey Chew, both at UC Berkeley at the time. Mandelstam discovered the double dispersion relations, a new and powerful analytic form, in 1958, and believed that it would be the key to progress in the intractable strong interactions.
1959–1968: Regge theory and bootstrap modelsEdit
By the late 1950s, many strongly interacting particles of ever higher spins had been discovered, and it became clear that they were not all fundamental. While Japanese physicist Shoichi Sakata proposed that the particles could be understood as bound states of just three of them (the proton, the neutron and the Lambda; see Sakata model), Geoffrey Chew believed that none of these particles are fundamental (for details, see Bootstrap model). Sakata's approach was reworked in the 1960s into the quark model by Murray Gell-Mann and George Zweig by making the charges of the hypothetical constituents fractional and rejecting the idea that they were observed particles. At the time, Chew's approach was considered more mainstream because it did not introduce fractional charge values and because it focused on experimentally measurable S-matrix elements, not on hypothetical pointlike constituents.
In 1959, Tullio Regge, a young theorist in Italy, discovered that bound states in quantum mechanics can be organized into families known as Regge trajectories, each family having distinctive angular momenta. This idea was generalized to relativistic quantum mechanics by Mandelstam, Vladimir Gribov and Marcel Froissart, using a mathematical method (the Sommerfeld–Watson representation) discovered decades earlier by Arnold Sommerfeld and Kenneth Marshall Watson: the result was dubbed the Froissart–Gribov formula.
In 1961, Geoffrey Chew and Steven Frautschi recognized that mesons had straight line Regge trajectories (in their scheme, spin is plotted against mass squared on a so-called Chew–Frautschi plot), which implied that the scattering of these particles would have very strange behavior—it should fall off exponentially quickly at large angles. With this realization, theorists hoped to construct a theory of composite particles on Regge trajectories, whose scattering amplitudes had the asymptotic form demanded by Regge theory.
In 1967, a notable step forward in the bootstrap approach was the principle of DHS duality introduced by Richard Dolen, David Horn, and Christoph Schmid in 1967, at Caltech (the original term for it was "average duality" or "finite energy sum rule (FESR) duality"). The three researchers noticed that Regge pole exchange (at high energy) and resonance (at low energy) descriptions offer multiple representations/approximations of one and the same physically observable process.
1968–1974: dual resonance modelEdit
The first model in which hadronic particles essentially follow the Regge trajectories was the dual resonance model that was constructed by Gabriele Veneziano in 1968, who noted that the Euler beta function could be used to describe 4-particle scattering amplitude data for such particles. The Veneziano scattering amplitude (or Veneziano model) was quickly generalized to an N-particle amplitude by Ziro Koba and Holger Bech Nielsen (their approach was dubbed the Koba–Nielsen formalism), and to what are now recognized as closed strings by Miguel Virasoro and Joel A. Shapiro (their approach was dubbed the Shapiro–Virasoro model).
In 1969–70, Yoichiro Nambu, Holger Bech Nielsen, and Leonard Susskind presented a physical interpretation of the Veneziano amplitude by representing nuclear forces as vibrating, one-dimensional strings. However, this string-based description of the strong force made many predictions that directly contradicted experimental findings.
In 1971, Pierre Ramond and, independently, John H. Schwarz and André Neveu attempted to implement fermions into the dual model. This led to the concept of "spinning strings", and pointed the way to a method for removing the problematic tachyon (see RNS formalism).
Dual resonance models for strong interactions were a relatively popular subject of study between 1968 and 1973. The scientific community lost interest in string theory as a theory of strong interactions in 1973 when quantum chromodynamics became the main focus of theoretical research (mainly due to the theoretical appeal of its asymptotic freedom).
1974–1984: bosonic string theory and superstring theoryEdit
In 1974, John H. Schwarz and Joel Scherk, and independently Tamiaki Yoneya, studied the boson-like patterns of string vibration and found that their properties exactly matched those of the graviton, the gravitational force's hypothetical messenger particle. Schwarz and Scherk argued that string theory had failed to catch on because physicists had underestimated its scope. This led to the development of bosonic string theory.
String theory is formulated in terms of the Polyakov action, which describes how strings move through space and time. Like springs, the strings tend to contract to minimize their potential energy, but conservation of energy prevents them from disappearing, and instead they oscillate. By applying the ideas of quantum mechanics to strings it is possible to deduce the different vibrational modes of strings, and that each vibrational state appears to be a different particle. The mass of each particle, and the fashion with which it can interact, are determined by the way the string vibrates—in essence, by the "note" the string "sounds." The scale of notes, each corresponding to a different kind of particle, is termed the "spectrum" of the theory.
Early models included both open strings, which have two distinct endpoints, and closed strings, where the endpoints are joined to make a complete loop. The two types of string behave in slightly different ways, yielding two spectra. Not all modern string theories use both types; some incorporate only the closed variety.
The earliest string model has several problems: it has a critical dimension D = 26, a feature that was originally discovered by Claud Lovelace in 1971; the theory has a fundamental instability, the presence of tachyons (see tachyon condensation); additionally, the spectrum of particles contains only bosons, particles like the photon that obey particular rules of behavior. While bosons are a critical ingredient of the Universe, they are not its only constituents. Investigating how a string theory may include fermions in its spectrum led to the invention of supersymmetry (in the West) in 1971, a mathematical transformation between bosons and fermions. String theories that include fermionic vibrations are now known as superstring theories.
In 1977, the GSO projection (named after Ferdinando Gliozzi, Joel Scherk, and David I. Olive) led to a family of tachyon-free unitary free string theories, the first consistent superstring theories (see below).
1984–1994: first superstring revolutionEdit
The first superstring revolution is a period of important discoveries that began in 1984. It was realized that string theory was capable of describing all elementary particles as well as the interactions between them. Hundreds of physicists started to work on string theory as the most promising idea to unify physical theories. The revolution was started by a discovery of anomaly cancellation in type I string theory via the Green–Schwarz mechanism (named after Michael Green and John H. Schwarz) in 1984. The ground-breaking discovery of the heterotic string was made by David Gross, Jeffrey Harvey, Emil Martinec, and Ryan Rohm in 1985. It was also realized by Philip Candelas, Gary Horowitz, Andrew Strominger, and Edward Witten in 1985 that to obtain supersymmetry, the six small extra dimensions (the D = 10 critical dimension of superstring theory had been originally discovered by John H. Schwarz in 1972) need to be compactified on a Calabi–Yau manifold. (In string theory, compactification is a generalization of Kaluza–Klein theory, which was first proposed in the 1920s.)
In 1987, Eric Bergshoeff, Ergin Sezgin and Paul Townsend showed that there are no superstrings in eleven dimensions (the largest number of dimensions consistent with a single graviton in supergravity theories), but supermembranes.
1994–2003: second superstring revolutionEdit
In the early 1990s, Edward Witten and others found strong evidence that the different superstring theories were different limits of an 11-dimensional theory that became known as M-theory (for details, see Introduction to M-theory). These discoveries sparked the second superstring revolution that took place approximately between 1994 and 1995.
The different versions of superstring theory were unified, as long hoped, by new equivalences. These are known as S-duality, T-duality, U-duality, mirror symmetry, and conifold transitions. The different theories of strings were also related to M-theory.
In 1995, Joseph Polchinski discovered that the theory requires the inclusion of higher-dimensional objects, called D-branes: these are the sources of electric and magnetic Ramond–Ramond fields that are required by string duality. D-branes added additional rich mathematical structure to the theory, and opened possibilities for constructing realistic cosmological models in the theory (for details, see Brane cosmology).
In 1997–98, Juan Maldacena conjectured a relationship between string theory and N = 4 supersymmetric Yang–Mills theory, a gauge theory. This conjecture, called the AdS/CFT correspondence, has generated a great deal of interest in high energy physics. It is a realization of the holographic principle, which has far-reaching implications: the AdS/CFT correspondence has helped elucidate the mysteries of black holes suggested by Stephen Hawking's work and is believed to provide a resolution of the black hole information paradox.
In 2003, Michael R. Douglas's discovery of the string theory landscape, which suggests that string theory has a large number of inequivalent false vacua, led to much discussion of what string theory might eventually be expected to predict, and how cosmology can be incorporated into the theory.
- Rickles 2014, p. 28 n. 17: "S-matrix theory had enough time to spawn string theory".
- Heisenberg, W. (1943). "Die „beobachtbaren Größen" in der Theorie der Elementarteilchen". Zeitschrift für Physik. 120 (7): 513–538. Bibcode:1943ZPhy..120..513H. doi:10.1007/bf01329800.
- Wheeler, John Archibald (1937). "On the Mathematical Description of Light Nuclei by the Method of Resonating Group Structure". Phys. Rev. 52 (11): 1107–1122. Bibcode:1937PhRv...52.1107W. doi:10.1103/physrev.52.1107.
- Rickles 2014, p. 113: "An unfortunate (for string theory) series of events terminated the growing popularity that string theory was enjoying in the early 1970s."
- Rickles 2014, p. 4.
- Gell-Mann, M. G. (1956). "Dispersion relations in pion-pion and photon-nucleon scattering." In J. Ballam, et al. (eds.), High energy nuclear physics, in: Proceedings of the Sixth Annual Rochester Conference Rochester: New York, USA, April 3-7, 1956 (pp. 30–6). New York: Interscience Publishers.
- Rickles 2014, p. 29.
- Gell-Mann, M., and Goldberger, M. L. (1954). "The scattering of low energy photons by particles of spin 1/2." Physical Review, 96, 1433–8.
- Mandelstam, S. (1958). "Determination of the pion-nucleon scattering amplitude from dispersion relations and unitarity general theory". Physical Review. 112 (4): 1344–1360. Bibcode:1958PhRv..112.1344M. doi:10.1103/physrev.112.1344.
- Chew, G. F.; Goldberger, M. L.; Low, F. E.; Nambu, Y. (1957). "Application of dispersion relations to low energy meson-nucleon scattering". Physical Review. 106 (6): 1337–1344. Bibcode:1957PhRv..106.1337C. doi:10.1103/physrev.106.1337.
- Sakata, S. (1956). "On a composite model for the new particles". Progress of Theoretical Physics. 16 (6): 686–688. Bibcode:1956PThPh..16..686S. doi: .
- Chew, G. (1962). S-Matrix theory of strong interactions. New York: W.A. Benjamin, p. 32.
- Kaiser, D (2002). "Nuclear democracy: Political engagement, pedagogical reform, and particle physics in postwar America". Isis. 93 (2): 229–268. doi:10.1086/344960.
- Regge, Tullio, "Introduction to complex angular momentum," Il Nuovo Cimento Series 10, Vol. 14, 1959, p. 951.
- White, Alan. R. (2000). "The Past and Future of S-Matrix Theory".
- Chew, Geoffrey; Frautschi, S. (1961). "Principle of Equivalence for all Strongly Interacting Particles within the S-Matrix Framework". Physical Review Letters. 7 (10): 394–397. Bibcode:1961PhRvL...7..394C. doi:10.1103/PhysRevLett.7.394.
- Dolen, R.; Horn, D.; Schmid, C. (1967). "Prediction of Regge-parameters of rho poles from low-energy pi-N scattering data". Physical Review Letters. 19 (7): 402–407. Bibcode:1967PhRvL..19..402D. doi:10.1103/physrevlett.19.402.
- Rickles 2014, pp. 38–9.
- Veneziano, G (1968). "Construction of a crossing-symmetric, Reggeon-behaved amplitude for linearly rising trajectories". Il Nuovo Cimento A. 57: 190–197. Bibcode:1968NCimA..57..190V. doi:10.1007/BF02824451.
- Koba, Z.; Nielsen, H. (1969). "Reaction amplitude for N-mesons: A generalization of the Veneziano-Bardakçi-Ruegg-Virasoro model". Nuclear Physics B. 10 (4): 633–655. Bibcode:1969NuPhB..10..633K. doi:10.1016/0550-3213(69)90331-9.
- Virasoro, M (1969). "Alternative constructions of crossing-symmetric amplitudes with Regge behavior". Physical Review. 177 (5): 2309–2311. Bibcode:1969PhRv..177.2309V. doi:10.1103/physrev.177.2309.
- Shapiro, J. A. (1970). "Electrostatic analogue for the Virasoro model". Physics Letters B. 33 (5): 361–362. Bibcode:1970PhLB...33..361S. doi:10.1016/0370-2693(70)90255-8.
- Chan, H. M.; Paton, J. E. (1969). "Generalized Veneziano Model with Isospin". Nucl. Phys. B. 10: 516. Bibcode:1969NuPhB..10..516P. doi:10.1016/0550-3213(69)90038-8.
- Rickles 2014, p. 5.
- Nambu, Y. (1970). "Quark model and the factorization of the Veneziano amplitude." In R. Chand (ed.), Symmetries and Quark Models: Proceedings of the International Conference held at Wayne State University, Detroit, Michigan, June 18–20, 1969 (pp. 269–277). Singapore: World Scientific.
- Nielsen, H. B. "An almost physical interpretation of the dual N point function." Nordita preprint (1969); unpublished.
- Susskind, L (1969). "Harmonic oscillator analogy for the Veneziano amplitude". Physical Review Letters. 23 (10): 545–547. Bibcode:1969PhRvL..23..545S. doi:10.1103/physrevlett.23.545.
- Susskind, L (1970). "Structure of hadrons implied by duality". Physical Review D. 1 (4): 1182–1186. Bibcode:1970PhRvD...1.1182S. doi:10.1103/physrevd.1.1182.
- Ramond, P. (1971). "Dual Theory for Free Fermions". Phys. Rev. D. 3: 2415. Bibcode:1971PhRvD...3.2415R. doi:10.1103/PhysRevD.3.2415.
- Neveu, A.; Schwarz, J. (1971). "Tachyon-free dual model with a positive-intercept trajectory". Physics Letters. 34B (6): 517–518. Bibcode:1971PhLB...34..517N. doi:10.1016/0370-2693(71)90669-1.
- Rickles 2014, p. 97.
- Rickles 2014, pp. 5–6, 44.
- Rickles 2014, p. 77.
- Rickles 2014, p. 11 n. 22.
- Scherk, J.; Schwarz, J. (1974). "Dual models for non-hadrons". Nuclear Physics B. 81 (1): 118–144. Bibcode:1974NuPhB..81..118S. doi:10.1016/0550-3213(74)90010-8.
- Yoneya, T. (1974). "Connection of dual models to electrodynamics and gravidynamics". Progress of Theoretical Physics. 51 (6): 1907–1920. Bibcode:1974PThPh..51.1907Y. doi:10.1143/ptp.51.1907.
- Zwiebach, Barton (2009). A First Course in String Theory. Cambridge University Press. p. 582.
- Lovelace, Claud (1971), "Pomeron form factors and dual Regge cuts", Physics Letters B, 34 (6): 500–506, Bibcode:1971PhLB...34..500L, doi:10.1016/0370-2693(71)90665-4.
- Sakata, Fumihiko; Wu, Ke; Zhao, En-Guang (eds.), Frontiers of Theoretical Physics: A General View of Theoretical Physics at the Crossing of Centuries, World Scientific, 2001, p. 121.
- Rickles 2014, p. 104.
- J. L. Gervais and B. Sakita worked on the two-dimensional case in which they use the concept of "supergauge," taken from Ramond, Neveu, and Schwarz's work on dual models: Gervais, J.-L.; Sakita, B. (1971). "Field theory interpretation of supergauges in dual models". Nuclear Physics B. 34 (2): 632–639. Bibcode:1971NuPhB..34..632G. doi:10.1016/0550-3213(71)90351-8.
- Gliozzi, F.; Scherk, J.; Olive, D. I. (1977). "Supersymmetry, Supergravity Theories and the Dual Spinor Model". Nucl. Phys. B. 122 (2): 253. Bibcode:1977NuPhB.122..253G. doi:10.1016/0550-3213(77)90206-1.
- Rickles 2014, p. 147: "Green and Schwarz's anomaly cancellation paper triggered a very large increase in the production of papers on the subject, including a related pair of papers that between them had the potential to provide the foundation for a realistic unified theory of both particle physics and gravity."
- Rickles 2014, p. 157.
- Green, M. B.; Schwarz, J. H. (1984). "Anomaly cancellations in supersymmetric D = 10 gauge theory and superstring theory". Physics Letters B. 149: 117. Bibcode:1984PhLB..149..117G. doi:10.1016/0370-2693(84)91565-X.
- Johnson, Clifford V. D-branes. Cambridge University Press. 2006, pp. 169–70.
- Gross, D. J.; Harvey, J. A.; Martinec, E.; Rohm, R. (1985). "Heterotic string". Physical Review Letters. 54 (6): 502–505. Bibcode:1985PhRvL..54..502G. doi:10.1103/physrevlett.54.502. PMID 10031535.
- Schwarz, J. H. (1972). "Physical states and pomeron poles in the dual pion model". Nuclear Physics B. 46 (1): 61–74. Bibcode:1972NuPhB..46...61S. doi:10.1016/0550-3213(72)90201-5.
- Candelas, P.; Horowitz, G.; Strominger, A.; Witten, E. (1985). "Vacuum configurations for superstrings". Nuclear Physics B. 258: 46–74. Bibcode:1985NuPhB.258...46C. doi:10.1016/0550-3213(85)90602-9.
- Rickles 2014, p. 89 n. 44.
- Green, M. B., Schwarz, J. H. (1982). "Supersymmetrical string theories." Physics Letters B, 109, 444–448 (this paper classified the consistent ten-dimensional superstring theories and gave them the names Type I, Type IIA, and Type IIB).
- This was demonstrated in Nahm, Werner, "Supersymmetries and their representations". Nuclear Physics B 135 no 1 (1978) pp 149-166, doi:10.1016/0550-3213(78)90218-3
- E. Bergshoeff, E. Sezgin, P. K. Townsend, "Supermembranes and Eleven-Dimensional Supergravity," Phys. Lett. B 189: 75 (1987).
- It was Edward Witten who observed that the theory must be a 11-dimensional one in Witten, Edward (1995). "String theory dynamics in various dimensions". Nuclear Physics B. 443 (1): 85–126. arXiv: . Bibcode:1995NuPhB.443...85W. doi:10.1016/0550-3213(95)00158-O.
- Duff, Michael (1998). "The theory formerly known as strings". Scientific American. 278 (2): 64–9. Bibcode:1998SciAm.278b..64D. doi:10.1038/scientificamerican0298-64.
- When Witten named it M-theory, he did not specify what the "M" stood for, presumably because he did not feel he had the right to name a theory he had not been able to fully describe. The "M" sometimes is said to stand for Mystery, or Magic, or Mother. More serious suggestions include Matrix or Membrane. Sheldon Glashow has noted that the "M" might be an upside down "W", standing for Witten. Others have suggested that the "M" in M-theory should stand for Missing, Monstrous or even Murky. According to Witten himself, as quoted in the PBS documentary based on Brian Greene's The Elegant Universe, the "M" in M-theory stands for "magic, mystery, or matrix according to taste."
- Rickles 2014, p. 208 n. 2.
- Polchinski, J (1995). "Dirichlet branes and Ramond-Ramond charges". Physical Review D. 50 (10): R6041–R6045. arXiv: . Bibcode:1995PhRvL..75.4724P. doi:10.1103/PhysRevLett.75.4724.
- Rickles 2014, p. 212.
- Maldacena, Juan (1998). "The Large N limit of superconformal field theories and supergravity". Advances in Theoretical and Mathematical Physics. 2 (4): 231–252. arXiv: . Bibcode:1998AdTMP...2..231M. doi:10.1023/A:1026654312961.
- Rickles 2014, p. 207.
- Rickles 2014, p. 222.
- Maldacena, Juan (2005). "The Illusion of Gravity" (PDF). Scientific American. 293 (5): 56–63. Bibcode:2005SciAm.293e..56M. doi:10.1038/scientificamerican1105-56. PMID 16318027. Archived from the original (PDF) on 2013-11-10. (p. 63.)
- Douglas, Michael R., "The statistics of string / M theory vacua", JHEP 0305, 46 (2003). arXiv:hep-th/0303194
- The most commonly quoted number is of the order 10500. See: Ashok S., Douglas, M., "Counting flux vacua", JHEP 0401, 060 (2004).
- Rickles 2014, pp. 230–5 and 236 n. 63.
- Paul Frampton (1974). Dual Resonance Models. Frontiers in Physics, W. A. Benjamin. ISBN 978-0-8053-2581-2.
- Joel A. Shapiro (2007). "Reminiscence on the Birth of String Theory". arXiv: [hep-th].
- John H. Schwarz (2012). "The Early History of String Theory and Supersymmetry". arXiv: [physics.hist-ph].
- Andrea Cappelli; Elena Castellani; Filippo Colomo; Paolo Di Vecchia (2012). The Birth of String Theory. Cambridge University Press. ISBN 978-0-521-19790-8.