Quantum Structures and the Nature of Reality

Diederik Aerts

Quantum Structures and the Nature of Reality

Diederik Aerts

1999, Springer eBooks

visibility

…

description

251 pages

link

1 file

QUANTUM STRUCTURES AND THE NATURE OF REALITY EINSTEIN MEETS MAGRITTE: An Interdisciplinary Reflection on Science, Nature, Art, Human Action and Society Series Editor Diederik Aerts, Center Leo Apostel, Vrije Universiteit Brussel, Belgium Volume 1 Einstein Meets Magritte: An Interdisciplinary Reflection The White Book of 'Einstein Meets Magritte' Edited by DiederikAerts, Jan Broekaert and Ernest Mathijs Volume 2 Science and Art The Red Book of 'Einstein Meets Magritte' Edited by Diederik Aerts, Ernest Mathijs and Bert Mosselmans Volume 3 Science. Technology. and Social Change The Orange Book of 'Einstein Meets Magritte' Edited by DiederikAerts, Serge Gutwirth, Sonja Smets and Luk Van Langenhove Volume 4 World Views and the Problem of Synthesis The Yellow Book of 'Einstein Meets Magritte' Edited by Diederik Aerts, Hubert Van Belle and Jan Van der Veken Volume 5 A World in Transition; Humankind and Nature The Green Book of 'Einstein Meets Magritte' Edited by DiederikAerts, Jan Broekaert and Willy Weyns Volume 6 Metadebates on Science The Blue Book of 'Einstein Meets Magritte' Edited by Gustaaf C. Cornelis, Sonja Smets, Jean Paul Van Bendegem Volume 7 Quantum Structures and the Nature of Reality The Indigo Book of 'Einstein Meets Magritte' Edited by Diederik Aerts and Jarosl'aw Pykacz Volume 8 The Evolution of Complexity The Violet Book of 'Einstein Meets Magritte' Edited by Francis Heylighen, J ohan Bollen and Alexander Riegler VOLUME 7 Quantum Structures and the Nature of Reality The Indigo Book of "Einstein Meets Magritte" Edited by DiederikAerts and Jaroslaw Pykacz Brussels Free University, University of Gdansk VUB UNIVERSITY PRE S S VRIJE UNIVERSITEIT BRUSSEL BELGIUM SPRINGER-SCIENCE+BUSINESS MEDIA, B.V. Library of congress Cataloging-in-Publication Data ISBN 978-90-481-5243-8 DOI 10.1007/978-94-017-2834-8 ISBN 978-94-017-2834-8 (eBook) Sold and distributed in Belgium by VUB University Press, Waversteenweg 1077, B-l160 Brussels, Belgium Printed an acid-free paper AlI Rights Reserved © 1999 Springer Science+Business Media Dordrecht Originally published by Kluwer Academic Publishers in 1999 No part of the material protected by this copyright notice may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying, recording or by any information storage and retrieval system, without written permis sion from the copyright owner. Table of contents General Introduction Diederik Aerts vii Editorial Introduction: Quantum Structures and the Nature of Reality Diederik Aerts and J aroslaw Pykacz xv 1. A Half-Century of Quantum Logic. What Have We Learned? D.J. Foulis 1 2. Quantum Mechanical Measurements S. Gudder 37 3. From Logic to Physics: The Logico-Algebraic Foundations of Quantum Theory G. Cattaneo and F. Laudisa 53 4. Non-Classical Logics, Non-Classical Sets, and Non-Classical Physics J. Pykacz 67 5. Against "Paradoxes": A New Quantum Philosophy for Quantum Mechanics C. Garola 103 6. Quantum Mechanics: Structures, Axioms and Paradoxes D. Aerts 141 7. Orthogonality Relations: From Classical to Quantum T. Durt 207 8. Quantum Logical Semantics, Historical Truths and Interpretations in Art M.L. Dalla Chiara and R. Giuntini 225 Index 235 v DIEDERIK AERTS THE GENERAL INTRODUCTION OF EINSTEIN MEETS MAGRITTE The series of books 'Einstein meets Magritte' presented here originates from an international interdisciplinary conference with the same title, which took place in Brussels in spring 1995. On the eve of the third millennium, we assembled scientists and artists to reflect together on the deep nature of reality and the knowledge and skill humankind has gathered in this field. We had decided to call this meeting 'Einstein meets Magritte' because we believed that meaningful keys could be found at the place where the two meet. It is the way of the world that has made Einstein and Magritte into icons of our culture. The purpose of the conference was to reflect and debate without fear on the most profound and timeless questions. On one of those evenings, when the talks and discussions were long and exhausting and the press were doing all they could to get Albert Einstein and Rene Magritte in front of the microphones and cameras, a few of my most loyal aides and myself succeeded in getting them safely and quietly to a taxi, which then carried us off into the Brussels night. We got out at Manneken Pis, since that was on Einstein's list, and we concealed ourselves among the many tourists who were coming and going, expressing their wonder in every language under the sun at the famous little statue. And one of us was taking pictures: Einstein and Magritte leaning against the railings, with us beside them, and one more, arm in arm, and then another in case the first was no good, when suddenly I felt a heavy slap on my shoulder: "How you doing, mate?" It was Jacky and his inseparable girlfriends Nicole and Sylvie, and everyone embraced everyone else. I introduced Albert and Rene, and interest was immediately shown, and I had my heart in my mouth, because Jacky was a painter, poet and urban philosopher. We walked together through the alleys of Brussels in dismal Belgian rain, over cobblestones that glistened in the street lamps. When we had provided for the inner man with 'Rabbit in Beer' and 'Mussels with Fries', and finally a 'Dame Blanche' topped with warm chocolate sauce as apotheosis, Jacky enticed us to his house in the Rue Haute where we threw ourselves into deep, soft armchairs. Albert and Rene were offered the best places and as always Jacky told the story VB © 1999 Kluwer Academic Publishers. viii DIEDERIK AERTS of his life and discussed his rightness, as he did repeatedly, with a confidence and suppleness that distinguished him so sharply from modern science. Albert listened enthralled and Rene was fascinated, and once more my heart was in my mouth, but Nicole winked reassuringly, and Sylvie brought us snacks on cushions of Brussels lace and sweet white wine in tall, old-fashioned crystal glasses. The topic of discussion for the evening turned out to be 'the doubts of modern science'. In science there is not a single hypothesis for which one cannot find two groups of hard-working scientists, one of which can 'prove' the hypothesis while the other can 'prove' its negation. And the more fundamental and important the question is, the more clearly the situation turns out like this. "It's crazy," maintained Jacky, "In fact science states that one doesn't know anything anymore." "That's right," said Albert, "Truth is not a simple concept, and I believe that the history of science makes it clear how often erroneous hypotheses have been believed over the centuries." "A good thing too," replied Rene, "Things can only happen as a result of the movement brought about by that constant doubt." Meanwhile Sylvie came to join us and handed round pictures of the exhibitions of Jacky's paintings and poems. Jacky suddenly got very excited, as if something had inspired him, and he leapt up and vanished into his studio. A few minutes later he returned with his palette and brush poised. Before I could stop him he had started painting violently right at the spot where Albert and Rene were sitting. A large, gossamerthin piece of Brussels lace gradually took shape and Albert and Rene vanished. Fortunately, my young assistants, Jacky's girlfriends and myself got away with just a few vicious daubs of paint in the face. The series of eight volumes introduced here are not just the results of the conference, as would be the case with a record of the proceedings. The authors were invited to write with the events at the conference in the back of their mind, so that the books would form a second phase in the process of thought set in motion at the conference. A second ph,ase/ more clearly crystallised than the self-organising forum that arose during the conference, but one which focuses on the same timeless questions and problems. The whole ensemble was already streamlined at the conference into a number of main topics named after the colours of the rainbow - red, orange, yellow, green, blue, indigo and violet, as well as white, the synthesis of all colours. This order was maintained and led to eight separate books in the series. EINSTEIN MEETS MAGRITTE ix Volume 1: Einstein meets Magritte: an Interdisciplinary Reflection The White Book of Einstein meets Magritte The white book contains more fully developed versions of the contributions made by the keynote speakers at the conference. So this white book covers various scientific topics. In his article, 'Basically, it's purely academic', John Ziman asks himself what 'basic research' really is in today's world. In his contribution, 'The manifest image and the scientific image', Bas Van Fraassen analyses the considerable differences between the theoretical scientific description of the world and the way it appears to us. He argues that most formulations of this problem may themselves be tendentious metaphysics, full of false contrasts, and that insistence on a radical separation between science and what we have apart from science, and on the impossibility of accommodating science without surrender, may be a way of either idolising or demonising science rather than understanding it. In the 'Microdynamics of incommensurability: philosophy of science meets science studies', Barbara Herrnstein-Smith examines the bemusing but instructive logical, rhetorical and cognitive dynamics of contemporary theoretical controversy about science. In his contribution 'Subjects, objects, data and values', Robert Pirsig proposes a radical integration of science and value that does no harm to either. It is argued that values can exist as a part of scientific data, but outside any subject or object. This argument opens a door to a 'metaphysics of value' that provides a fundamentally different but not unscientific way of understanding the world. Ilya Prigogine discusses in 'Einstein and Magritte: a study of creativity', the global transformation of a classical science which was based on certainties into a new science that takes possibilities as its basic concepts. Constantin Piron demonstrates in his contribution 'Quanta and relativity: two failed revolutions' that none of the two great revolutions in physics, quantum mechanics and relativity theory, have actually been digested by the physics community. He claims that the vast majority of physicists still cling to the idea of a non-existent void full of little particles, in the spirit of Leibniz or Descartes. Rom Harre reflects on the significance of the theory of relativity. In his article 'The redundancy of spacetime: relativity from Cusa to Einstein', he defends the hypothesis that relativity theory is best interpreted as a grammar for coordinating narratives told by different observers. In his contribution 'The stuff the world is made of: physics and reality', Diederik Aerts analyses the consequences of the recent advances in quantum mechanics, theoretically as well as experimentally, for the nature of reality. He analyses the deep conceptual paradoxes in the light of these recent data and tries to picture a coherent model of the world. In his contribution 'Da- x DIED ERIK AERTS sein's brain: phenomenology meets cognitive science', Francisco Varela puts forward the hypothesis that the relation between brain processes and living human experience is the really hard problem of consciousness. He argues that science needs to be complemented by a deep scientific investigation of experience itself to move this major question beyond the sterile oppositions of dualism and reductionism. In his contribution 'What creativity in art and science tell us about how the brain must work' William Calvin defends the prospects for a mental Darwinism that operates on the milliseconds to minutes time scale, forming novel ideas and sentences never previously expressed. Adolf Griinbaum in his article 'The hermeneutic versus the scientific conception of psychoanalysis: an unsuccessful effort to chart a via media for the human sciences' argues that the so called 'hermeneutic' reconstruction of psychoanalytic theory and therapy proposed by Karl Jaspers, Paul Ricoeur and Jiirgen Habermas fails both as a channel and as alleged prototype for the study of human nature. In his article 'Immortality, biology and computers', Zygmunt Bauman analyses the shift that post modern society has provoked regarding the concept of immortality. He points out that strategies of collective and individual immortality have shifted from the modern deconstruction of death to a postmodern deconstruction of immortality, and points out that the possible consequences of this process need to be taken into consideration. Brian Arthur, in his article 'The end of certainty in economics', points out that our economy is very non-classical, meaning that it is based on essentially self-referential systems of beliefs about future economic conditions. He argues that our economy is inherently complex, subjective, ever-changing, and to an unavoidable degree ill-defined. Volume 2: Science and Art The Red Book of Einstein meets M agritte And then Magritte comes in. Many obvious differences exist between science and art. But the Science and Art volume of this series addresses not only these differences but also the possibilities of crossing several of the gaps between science and art. Several contributions deal with sociological and philosophical elaborations of the similarities and differences between science and art, while others approach science from an artistic point of view and art from a scientific point of view. The volume also considers several approaches that attempt to go beyond the classical dichotomy between the two activities. In a special section, attention is paid to the particular role played by perception in both science and art as a regulator of human understanding. Together, these contributions strive for an intensive interaction between science and art, and to a con- EINSTEIN MEETS MAGRITTE Xl sideration of them as converging rather than diverging. It is to be hoped that both science and art will benefit from this attempt. Volume 3: Science, Technology and Social Change The Orange Book of Einstein meets Magritte The major subject of the orange book is that society as a whole is changing, due to changes in technology, economy and the changing strategies and discourses of social scientists. The collected articles in the orange stream discuss a range of specific societal problems related to the subject of social change, the topics of the articles range from the scale of for instance sociology of health and psychohistory to more specific social problems like for instance anorexia nervosa, art academies and the information superhighway. Although the authors approach different subject matters from dissimilar perspectives and work with various methods, all the papers are related to the theme of science, technology and social change. In the orange book the reader will find a lot of arguments and hints pertaining to questions like: To what exactly will this social change lead in the 21st century? What kind of society lies ahead? She/he will be confronted to a plethora of enriching conceptions of the relationships between social sciences and social changes. Volume 4: World Views and the Problem of Synthesis The Yellow Book of Einstein meets M agritte A rapidly evolving world is seen to entail ideological, social, political, cultural and scientific fragmentation. Many cultures, subcultures and cultural fragments state their views assertively, while science progresses in increasingly narrowly defined areas of inquiry, widening not only the chasm between specialists and the layman, but also preventing specialists from having an overall view of their discipline. What are the motive forces behind this process of fragmentation, what are its effects? Are they truly inhospitable to the idea of synthesis, or do they call out, more urgently than ever before, for new forms of synthesis? What conditions would have to be met by contemporary synthesis? These and related questions will be addressed in the yellow book of Einstein meets Magritte. Volume 5: A World in Transition; Humankind and Nature The Green Book of Einstein meets Magritte IA World in Transition. Humankind and Nature' is appropriately entitled after its aim for an intrinsic property of reality: change. Of major concern, in this era of transformation, is the extensive and profound interaction of humankind with nature. The global scaled, social and technological project of humankind definitely involves a myriad of changes of xii DIEDERIK AERTS the ecosphere. This book develops, from the call for an interdisciplinary synthesis and respect for plurality, acknowledging the evolving scientific truth, the need for an integrated but inevitably provisional world view. Contributors from different parts of the world focus on four modes of change: i) Social change and the individual condition, ii) Complex evolution and fundamental emergent transformations, iii) Ecological transformation and responsibility inquiries, iv) The economic-ecological and socio-technical equilibria. Primarily reflecting on the deep transformations of humankind and on the relationship between humans and nature it addresses major points of contemporary concern. Volume 6: Metadebates The Blue Book of Einstein meets Magritte This book provides a meta-disciplinary reflection on science, nature, human action and society. It pertains to a dialogue between scientists, sociologists of science, historians and philosophers of science. It covers several topics: (1) the relation between science and philosophy, (2) new approaches to cognitive science, (3) reflections on classical thinking and contemporary science, (4) empirical epistemology, (5) epistemology of quantum mechanics. Indeed, quantum mechanics is a discipline which deserves and receives special attention here, for it still is a fascinating and intriguing discipline from a historiographical and philosophical point of view. This book does not only contain articles on a general level, it also provides new insights and bold, even provocative theories on the meta-level. That way, the reader gets acquainted with 'science in the making', sitting in the front row. Volume 7: Quantum Structures and the Nature of Reality The Indigo Book of Einstein meets M agritte This book refers to the satellite symposium that was organised by the International Quantum Structure Association (IQSA) at Einstein meets Magritte. The IQSA is a society for the advancement and dissemination of theories about structures based on quantum mechanics in their physical, mathematical, philosophical, applied and interdisciplinary aspects. The book contains several contributions presenting different fields of research in quantum structures. A great effort has been made to present some of the more technical aspects of quantum structures for a wide audience. Some parts of the articles are explanatory, sketching the historical development of research into quantum structures, while other parts make an effort to analyse the way the study of quantum structures has contributed to an understanding of the nature of our reality. EINSTEIN MEETS MAGRITTE XIll Volume 8: The Evolution of Complexity The Indigo Book of Einstein meets Magritte The violet book collects the contributions that consider theories of evolution and self-organisation, on the one hand, and systems theory and cybernetics, on the other hand. Both can add to the development of an integrated world view. The basic idea is that evolution leads to the spontaneous emergence of systems of higher and higher complexity or "intelligence": from elementary particles, via atoms, molecules, living cells, multicellular organisms, plants, and animals to human beings, culture and society. This perspective makes it possible to unify knowledge from presently separate disciplines: physics, chemistry, biology, psychology, sociology, etc. The volume thus wishes to revive the transdisciplinary tradition of general systems theory by integrating the recently developed insights of the "complex adaptive systems" approach, pioneered among others by the Santa Fe Institute. Even these books only signify a single phase in the ever-recurring process of thought and creation regarding the basic questions on the reality that surrounds us and our place in it. Brussels, July 17, 1998. D. AERTS & J. PYKACZ QUANTUM STRUCTURES AND THE NATURE OF REALITY This book grew out of the Satellite Symposium "Quantum Structures and the Nature of Reality" organised by the International Quantum Structures Association (IQSA) during the conference "Einstein Meets Magritte: an Interdisciplinary Reflection on Science, Nature, Art, Human Action, and Society" which was organised in Brussels at the Vrije Universiteit Brussel from May 29 to June 3, 1995. The purpose of the Symposium (and of the present book) was to acquaint the possibly widest audience consisting of people interested in foundations of quantum physics but not necessarily physically or even mathematically experienced with the variety of subjects considered within the IQSA, aims, problems, and methods used to solve them. The International Quantum Structures Association was established in 1991 and it gathers researchers interested in studying various aspects of logico-algebraic structures encountered in the very foundations of quantum physics. Quantum structures (formerly called quantum logics) are situated on the Map of Science in that place where quantum physics (the foundations of quantum mechanics), mathematics (mathematical logic, abstract algebra, theory of ordered structures, measure theory, probability theory, and fuzzy set theory), logic (many-valued, modal, intuitionistic, and paraconsistent), and philosophy (esp. philosophy of science) meet. Therefore, activities of the members of the IQSA are really interdisciplinary, which was one more reason for including the IQSA Satellite Symposium within the scope of the Interdisciplinary Conference "Einstein meets Magritte" . Although the present book consists of papers written independently by various authors which unavoidably leads to some repetitions, we tried to organise it in such a way that it could serve as a possibly self-contained introduction to the theory of quantum structures oriented towards an inexperienced reader. Therefore, the book begins with the review paper by D. Faulis (University of Massachusetts, Amherst, USA) A HalfCentury of Quantum Logics- What Have We Learned, which in its first part contains a brief exposition of the historical development of quantum structures from their prehistory traced back to Leibniz and Boole till the most recent papers concerning effect algebras, with the exclusion, however, of the "parallel stream" within the theory of quantum strucxv © 1999 Kluwer Academic Publishers. D. AERTS & J. PYKACZ XVI tures connected with many-valued logics and fuzzy setsl. In the main part of his paper Foulis illustrates practically all fundamental notions encountered within the theory of quantum structures, and relations between them on the very elementary example of a firefly in the box with windows using only the simplest mathematical tools, i.e., sets and functions. Almost an equally elementary example is used in the next paper Quantum Mechanical Measurements by S. Gudder (University of Denver, USA) to illustrate the notion of an effect algebra. Studying effect algebras became nowadays popular within the IQSA since these structures are more general, therefore, also more "flexible" than traditional "quantum logics", i.e., orthomodular partially ordered sets or lattices. Their introduction allowed a lot of "fresh air" to come into the theory of quantum structures2 , which, after the period of a rapid development in the sixties and the seventies, in the eighties was seen by many as a "decaying" theory without much future. The paper Fmm Logic to Physics: The Logico-Algebmic Foundations of Quantum Theory by G. Cattaneo and F. Laudisa (Universita di Milano and Universita di Firenze, Italy) also contains a brief description of the historical development of quantum logic ideas. In particular, the reader will find here a more detailed outline of the seminal Birkhoff and von Neumann's paper The Logic of Quantum Mechanics generally regarded as a cornerstone of this branch of Science. The Cattaneo and Laudisa paper is finished with the notion of an effect introduced (already encountered in the paper by S. Gudder), however, with the aid of Zadeh's idea of a fuzzy set, i.e., a set that admits gradual rather than abrupt transition from membership to non-membership. Fuzzy set theory is as closely related to Lukasiewicz infinite-valued logic as traditional set theory is related to the classical, two-valued logic. These relations are exposed in the paper Non-Classical Logics, NonClassical Sets, and Non-Classical Physics by J. Pykacz (Uniwersytet Gdanski, Gdansk, Poland) in which it is also shown how these two "nonclassical" branches of mathematics can be utilised in the "non-classical" branch of physics, i.e., in quantum mechanics. Also this paper contains a brief survey of the historical development of all "non-classical" theories that it deals with. 1 2 The brief history of this "stream" is contained in the paper by Pykacz. and also was one of the reasons of changing the name of this branch of Science from "theory of quantum logics" to the more general "theory of quantum structures" . QU ANTUM STRUCTURES AND REALITY xvii The paper by C. Garola (Universita di Lecce, Italy) Against "Paradoxes": A New Quantum Philosophy for Quantum Mechanics, shows that changing the basic philosophical premises (in this case abandoning the verificationist theory of truth in favour of the classical (Tarskian) theory of truth and introducing a new conception of physical laws) implies changes in the interpretation of quantum theory and provides new solutions to some old "paradoxes" encountered in the very foundations of quantum physics. The paper by D. Aerts (Brussels Free University, Belgium) Quantum Mechanics: Structures, Axioms and Paradoxes, gives an overview of some of the results in quantum structures research that have been obtained in the Brussels group. Working within a realist approach to quantum mechanics-following the inspiration that was outlined in the Geneva approach-the Brussels group tries to elaborate a generalised quantum mechanics, where some of the basic axioms of standard quantum mechanics are abandoned (e.g., the superposition principle). The reason for this generalisation is standard quantum mechanics' impossibility to describe the situation of two separated quantum entities, a shortcoming directly linked to the Einstein-Podolsky-Rosen paradox. Aerts shows the different steps of this generalised axiomatic quantum mechanics and the related problems to be solved tentatively by means of a simple macroscopic model entailing a quantum mechanical structure. Together with this alternative quantum mechanics an explanation for the probabilities of quantum mechanics-their origin residing in a lack of knowledge about the interaction between the measuring apparatus and the system-is exposed by means of the simple example. The paper Orthogonality Relations and the f.-Model by T. Durt (Vrije Universiteit Brussel, Belgium) is also written in the spirit of the Brussels approach and devoted to various orthogonality relations that are quantum-logical generalisations of the classical negation. These relations are studied within the f.-model of quantum spin-measurements developed recently in Brussels, where a continuous transition from quantum to classical mechanics can be modelled. This description also needs the more general framework, developed in Brussels and presented in the paper of Aerts, where some of the standard quantum mechanical axioms are not fulfilled. The final paper of the book Quantum Logical Semantics, Historical Truths, and Interpretations in Arts by M.L. Dalla Chiara and R. Giuntini (Universita di Firenze, Italy) shows that notions and constructions typical to the theory of quantum structures, and non-classical logics implied by them can find application even in such remote fields XVlll D. AERTS & J. PYKACZ as History and Art. This shows that the methods elaborated within the theory of quantum structures do not apply exclusively to microworld and quantum physics and justifies once more the idea of including the IQSA Symposium within the very broad scope of the Interdisciplinary Conference "Einstein Meets Magritte" . Diederik Aerts J aroslaw Pykacz DAVID J. FOULIS A HALF-CENTURY OF QUANTUM LOGIC WHAT HAVE WE LEARNED? 1. INTRODUCTION This expository paper comprises my personal response to the question in the title. Before giving my answers to this question, I discuss the utility of quantum logic in Section 2, offer a succinct review of the history of quantum logic in Section 3, and present in Section 4 two simple thought experiments involving a firefly in a box. The two thought experiments are pursued in Sections 5 through 9, where they give rise to natural and (I hope) compelling illustrations of the basic ideas of quantum logic. In Section 10, I replace the firefly by a "quantum firefly," and in Section 11, I summarize the lessons that we have (or should have) learned. This paper is written for so-called laypersons (although some of the ideas presented here have yet to be fully appreciated even by some expert quantum logicians). Thus, in using the firefly box to motivate and exemplify the fundamental notions of quantum logic, I need only the simplest mathematical tools, i.e., sets and functions. This does not necessarily imply that a casual reading of the narrative will guarantee an adequate understanding of the basic principles-a certain amount of attentiveness to detail is still required. 2. QUA N TUM LOG I C: W HAT GOO DIS IT? Before proceeding, I should address a question germane to any meaningful discussion of what we have learned, namely the related question what good is quantum logic? Until now, quantum logic has had little or no impact on mainstream physics; indeed some physicists go out of their way to express a contempt for the subject l , Whether or not the insights achieved by quantum logicians contribute directly to an achievement of 1 Witness the following ill-natured remark, arrogantly inserted in the review (Mathematical Reviews of the American Mathematical Society, Nov.-Dec., 1991, 91k 81010) of a paper written by a well-known Polish physicist who (in the opinion of the reviewer, a "mainline physicist" of some repute) had the temerity to concern himself with matters pertaining to the foundations of physics: " ... a small but persistent core of diehards who find fault with quantum mechanics is still active today. The journal Foundations of Physics serves to give them somewhere to publish." 1 © 1999 Kluwer Academic Publishers. 2 DAVID J. FOULIS whatever the Holy Grail of contemporary or future physicists happens to be 2 , quantum logic has already made significant contributions to the philosophy of science and to both mathematical and philosophical logic. Prior to Galileo's celebrated declaration that the Great Book of Nature is written in mathematical symbols, what we now call physical science was commonly referred to as natural philosophy. Quantum logic offers the possibility of reestablishing some of the close bonds between physics and philosophy that existed before the exploitation of powerful techniques of mathematical analysis changed not only the methods of physical scientists, but their collective mindset as well. The hope is that quantum logic will enable the mathematics of Descartes, Newton, Leibniz, Euler, Laplace, Lagrange, Gauss, Riemann, Hamilton, Levi-Civita, Hilbert, Banach, Borel, and Cartan, augmented by the mathematical logic of Boole, Tarski, Church, Post, Heyting, and Lukasiewicz to achieve a new and fertile physics/philosophy connection. The mathematics of quantum mechanics involves operators on infinite dimensional vector spaces. Quantum logic enables the construction of finite, small, easily comprehended mathematical systems that reflect many of the features of the infinite dimensional structures, thus considerably enhancing our understanding of the latter. For instance, a finite system of propositions relating to spin-one particles constructed by Kochen and Specker [50] settled once and for all an aspect of a long standing problem relating to the existence of so-called hidden variables [5J. Another example is afforded by the work of M. KHiy in which a finite model casts considerable light on the celebrated paradox of Einstein, Podolski, and Rosen [47J. One facet of quantum logic, yet to be exploited, is its potential as an instrument of pedagogy. In introductory quantum physics classes (especially in the United states), students are informed ex cathedra that the state of a physical system is represented by a complex-valued wavefunction 7/J, that observables correspond to self-adjoint operators, that the temporal evolution of the system is governed by a Schrodinger equation, and so on. Students are expected to accept all this uncritically, as their professors probably did before them. Any question of why is dismissed with an appeal to authority and an injunction to wait and see how well it all works. Those students whose curiosity precludes blind compliance with the gospel according to Dirac and von Neumann are told that they have no feeling for physics and that they would be better off studying 2 Just now, the desired consummation of theoretical physics seems to be a so called "theory of everything," i.e., a master theory encompassing both quantum mechanics and the general theory of relativity A HALF-CENTURY OF QUANTUM LOGIC 3 mathematics or philosophy. A happy alternative to teaching by dogma is provided by basic quantum logic, which furnishes a sound and intellectually satisfying background for the introduction of the standard notions of elementary quantum mechanics. Quantum logic is a recognized, autonomous, and rapidly developing field of mathematics3 and it has engendered related research in a number of fields such as measure theory [11, 17, 18, 20, 23, 26, 34, 36, 45, 70,71,74, 76] and functional analysis [4, 11, 19, 22, 23, 25, 35, 72]. The recently discovered connection between quantum logic and the theory of partially ordered abelian groups [29, 33] promises a rich cross fertilization between the two fields. Also, quantum logic is an indispensable constituent of current research on quantum computation and quantum information theory [24]. 3. A B R I E F HIS TOR Y 0 F QUA N TUM LOG I C In 1666, G.W. Leibniz envisaged a universal scientific language, the characteristica universalis, together with a symbolic calculus, the calculus ratiocinator, for formal logical deduction within this language. Leibniz soon turned his attention to other matters, including the creation of the calculus of infinitesimals, and only partially developed his logical calculus. Nearly two centuries later, in Mathematical Analysis of Logic (1847) and Laws of Thought (1854), G. Boole took the first decisive steps toward the realization of Leibniz's projected calculus of scientific reasoning4. From 1847 to the 1930's, Boolean algebra, which may be considered as a classical precursor of quantum logic, underwent further development in the hands of De Morgan, Jevons, Peirce, Schroder, et al, and received its modern axiomatic form thanks to the work of Huntington, Birkhoff, Stone, et al. Nowadays, Boolean algebras are studied either as special kinds of lattices [9], or equivalently as special kinds of rings 5 . In 1933, Kolmogorov, building upon an original idea of Frechet, established the modern theory of probability using Boolean sigma-algebras of sets as a foundation [51]. 3 Papers on quantum logic are reviewed in Sections 03G and 81P of the American Mathematical Society's Mathematical Reviews. 4 One of Boole's primary motivations was to construct a mathematical foundation for a theory of probability. Indeed, the full title of his 1854 masterpiece is An Investigation Into the Laws of Thought, On Which are Founded the Mathematical Theories of Logic and Probabilities. 5 A Boolean algebra can be defined either as a complemented distributive lattice or as a ring with unit in which every element is idempotent. 4 DAVID J. FOULIS The genesis of quantum logic is Section 5, Chapter 3 of J. von Neumann's 1932 book on the mathematical foundations of quantum mechanics [59]. Here von Neumann argued that certain linear operators, the projections defined on a Hilbert space6 , could be regarded as representing experimental propositions affiliated with the properties of a quantum mechanical system. He wrote, " ... the relation between the properties of a physical system on the one hand, and the projections on the other, makes possible a sort of logical calculus with these." In 1936, von Neumann, now in collaboration with G. Birkhoff, published a definitive article on the logic of quantum mechanics [10]. Birkhoff and von Neumann proposed that the specific quantum logic of projection operators on a Hilbert space should be replaced by a general class of quantum logics governed by a set of axioms, much in the same way that Boolean algebras had already been characterized axiomatically. They observed that, for propositions P, Q, R pertaining to a classical mechanical system, the distributive law P & (Q or R) = (P & Q) or (P & R) holds, they gave an example to show that this law can fail for propositions affiliated with a quantum mechanical system, and they concluded that, " ... whereas logicians have usually assumed that properties of negation were the ones least able to withstand a critical analysis, the study of mechanics points to the distributive identities as the weakest link in the algebra of logic." Birkhoff and von Neumann went on to argue that a quantum logic ought to satisfy only a weakened version of the distributive law called the modular law7 ; however, they pointed out that projection operators on a Hilbert space can fail to satisfy even this attenuated version of distributivity. Much of von Neumann's subsequent work on continuous geometries [60J and rings of operators [61 J was motivated by his desire to construct logical calculi satisfying the modular law. In 1937, K. Husimi [41J discovered that projection operators on a Hilbert space satisfy a 6 A Hilbert space is a vector space (over the reals, the complexes, or the quaternions) equipped with an inner product, and complete with respect to the metric arising from the inner product. 7 A lattice L satisfies the modular law iff, for p, q, r E L, p S r implies that p V (q 1\ r)=(pVq)l\r. A HALF-CENTURY OF QUANTUM LOGIC 5 weakened version of the modular law, now called the orlhomodular identit" . From 1937 until 1955, all research on quantum logic ceased as scientists turned their attention to military applications of physics. In 1955, L. Loomis [53] and S. Maeda [57] independently rediscovered Husimi's orthomodular identity in connection with their efforts to extend von Neumann's dimension theory for rings of operators to more general structures. The structures studied by Husimi, Loomis, and Maeda are now called orthomodular lattices9 . In 1957, G. Mackey wrote an expository article on quantum mechanics [55] based on lectures he was giving at Harvard. In 1963, he published an expanded version of these lectures in the form of an influential monograph [56] in which he referred to propositions affiliated with a physical system as questions. Under fairly reasonable hypotheses, it is easy to show that Mackey's questions form an orthomodular lattice. The simplicity and elegance of Mackey's formulation and the natural and compelling way in which it gave rise to a system of experimental propositions inspired a renewed interest in the study of quantum logic, now identified with the study of orthomodular lattices. Could it be that these lattices provide a basis for Leibniz's long awaited calculus ratiocinator? Thus motivated, a small but devoted group of researchersCatlin, Finch, Foulis, Greechie, Gudder, Holland, Janowitz, Jauch, Kotas, MacLaren, Maeda, Piron, Pool, Ramsay, Randall, Schreiner, Suppes, Varadarajan, et al-began in the early 1960's the task of working out a general mathematical theory of orthomodular lattices. A comprehensive account of the resulting theory and an extensive bibliography up to about 1983 can be found in [44]. In 1964, C. Piron introduced an alternative to Mackey's approach in which questions again band together to form an orthomodular lattice, but this time possessing more of the special features of the lattice of projection operators on a Hilbert space [62]. In fact, Piron was able to show that his questions could be represented as actual projection operators on a so-called generalized Hilbert space. Piron's work raised the issue of how to characterize the standard Hilbert spaces among the class of generalized Hilbert spaces. A list of more or less "natural conditions" on generalized Hilbert spaces was soon proposed in the hopes of singling out the "true" Hilbert spaces. 8 A lattice L with an orthocomplementation p >--+ p' satisfies the orthomodular identity iff, for p, q E L, p ~ q implies that q = p V (q 1\ p'). 9 The terminology "orthomodular lattice" was suggested by 1. Kaplansky because, in such a lattice, orthogonal pairs are modular pairs. 6 DAVID J. FOULIS In 1980, H. Keller dashed these hopes by constructing an example of a generalized Hilbert space satisfying all of the proposed natural conditions, but that is not a standard Hilbert space [46]. In 1995 M. Soler showed that Keller's counterexample could be bypassed by adding just one more natural condition to the previous list [72]. Thanks to Soler's work, we are now in possession of a satisfactory axiomatic approach to Hilbert-space based quantum mechanics [40]. As early as 1962 [27], it was noticed by some of the aforementioned researchers that, even without the imposition of additional hypotheses, Mackey's questions form an intriguing structure called an orthomodular poset. For this reason, orthomodular posets were also considered as possible candidates for quantum logics and were studied in parallel with orthomodular lattices, especially by S. Gudder [34, 35] and his students. A comprehensive account of orthomodular lattices and posets as models for quantum logics can be found in [65]. In orthodox quantum mechanics, when systems are combined or coupled to form composite systems [3, 28, 43], the combined system is represented mathematically by a so-called tensor product of Hilbert spaces. Even in the early 1960's, researchers realized that the entire quantum logic program would falter unless a suitable version of tensor product could be found for the more general logical structures then under consideration. After many unsuccessful attempts to formulate a suitable tensor product for orthomodular lattices and posets, it was discovered in 1979 that all such attempts were doomed to failure owing to the fact that the category of orthomodular posets is too small to admit a tensor product [67]. C. Randall and D. Foulis showed that, to accommodate the construction of tensor products, a larger category of mathematical structures called orthoalgebras has to be employed [8,31,49,66,68]. For a while, it seemed that orthoalgebras were the true quantum logics [8, 20, 26, 30, 36, 71]. Composite physical systems were studied from the perspective of quantum logic in an important and influential sequence of papers by D. Aerts [1, 2, 3]. In these studies, Aerts introduced the crucial notion of an entity which, roughly speaking, consists of a quantum logic of questions or propositions affiliated with a physical system together with a related system of properties, or attributes, of the system. Among other things, Aerts showed convincingly that a proper representation of a composite system requires consideration of the way in which properties of a total system depend on the properties of its constituents. In parallel with the development of quantum logic, and starting as early as 1970 [19, 38, 39], Davies, Lewis, Holevo, Ludwig, Prugovecki, Ali, Busch, Lahti, Mittelstaedt, Schroeck, Bujagski, Beltrametti, et al worked out a theory of A HALF-CENTURY OF QUANTUM LOGIC 7 quantum statistics and quantum measurement based on so-called effect operatorslO on a Hilbert space [54]. Every projection operator is an effect operator, but not conversely, and the effect operatorsdo not even form a lattice, let alone an orthomodular lattice, or even an orthoalgebra. In 1989, R. Guintini and H. Greuling introduced axioms for a generalized orthoalgebra and argued that effect operators on a Hilbert space form such a structure [32]. The generalized orthoalgebras of Guintini and Greuling, which turned out to be mathematically equivalent to the Dposets of Kopka and Chovanec [52], have come to be called effect algebras [29]. It can be argued that fuzzy or unsharp propositions are properly represented as elements of an effect algebra [13, 14, 15, 16, 52]. In 1994, Bennett and Foulis [29] discovered a connection between effect algebras and partially ordered abelian groups. In subsequent papers, they went on to show that virtually every structure previously proposed for a quantum logic, and indeed every Boolean algebra, can be represented as an interval in a such a group. An interval in a partially ordered abelian group, organized in a natural way into an effect algebra, is called an interval effect algebra. As a mathematical theory, quantum logic is thus subsumed by the theory of partially ordered abelian groups. Because there was no serious work on quantum logic per se during the years 1938-1957, I consider that quantum logic has been under development for roughly half a century. The history of quantum logic has been a story of more and more general mathematical structures-Boolean algebras, orthomodular lattices, orthomodular posets, orthoalgebras, and effect algebras-being proposed as basic models for the logics affiliated with physical systems. Whether effect algebras are the end of the line remains to be seen. Those wishing to read more about quantum logic and its connections with quantum physics are encouraged to consult the following standard references [6, 12, 21, 35, 42, 43, 54, 55, 56, 59, 62, 65]. 4. THE FIR E FLY BOX AND ITS EVE N T LOG I C Now I invite you to contemplate with me some "thought experiments" involving a firefly in a box (Figure 1). The box is to have two translucent (but not transparent) windows, one on the front and one on the side. The remaining four sides of the box are opaque. At any given moment, the firefly might or might not have its light on. If the light is on, it can be seen as a blip by looking at either of the two windows. 10 An effect operator is a self-adjoint operator A such that 0 :::; A :::; 1 8 DAVID J. FOULIS b 1 r f Figure 1. The Firefly Box Looking directly at the front window of the box when the light is on, one can tell by the position of the blip whether the firefly is in the left (1) or right (r) half of the box. Likewise, looking directly at the side window when the light is on, one can tell whether the firefly is in the front (f) or back (b) half of the box. Because the windows are not transparent, one cannot rely on depth perception to determine from the front window whether the firefly is in the front or back half of the box, nor from the side window, whether the firefly is in the left or right half of the box. Now consider two experimental procedures F and S. Procedure F is conducted by looking directly at the front window and recording I, r, or nF according to whether the blip is on the left, on the right, or there is no blip, respectively. Procedure S is conducted by looking directly at the side window and recording f, b, or ns according to whether the blip is in the front, the back, or there is no blip, respectively. One cannot conduct both procedures F and S at the same time because of the necessity of looking directly at one window or the other. Indeed, if one stands in a position to see both windows, parallax could spoil the accuracy of the observationl l . 11 The situation is quite analogous to the fact that attempts to make simultaneous measurements of noncommuting quantum-mechanical observables lead to interference effects that spoil the accuracy of the measurements. This does not mean that such simultaneous measurements cannot or should not be made. It simply means that, when they are made, one has to deal with a certain amount of fuzzyness or unsharpness. A HALF-CENTURY OF QUANTUM LOGIC 9 Imagine that we plan to conduct an experimental study of the firefly's habits using the only means available to us, namely the two experimental procedures F and S. Our work will be guided by an emerging "firefly box theory" (FBT) that may have to be amended as we collect more and more experimental data or change our mind about what is going on inside the box. To begin with, let us provisionally incorporate into our FBT the simplifying assumption that there are no baffles within the box behind which the firefly might hide from either window. If this is so, then a blip would be seen on the front window if and only if it would be seen on the side window. (Note the use of the subjunctive here-we can not meaningfully perform both F and S simultaneously!) This assumption is implemented simply by identifying outcome nF of F with outcome ns of S. Thus, we set n := nF = ns. (The notation := means equals by definition.) Let EF := {l, r, n} and Es := {f, b, n} be the mutually exclusive and exhaustive outcome sets for the experimental procedures F and S, respectively. Execution of F will yield one and only one outcome l, r, or n; execution of S will yield one and only one outcome f, b, or n. By an event for F we will mean a subset A of EF. Including the empty set and EF itself, there are 8 such events, namely 0, {l}, {r}, {n}, {l,r}, {l, n}, {r, n}, {l, r, n}. If F is executed and the resulting outcome is e E EF, say that an event A ~ EF occurs if e E A and that it nonoccurs if e tt. A. Likewise, an event for S is understood to be one of the 8 subsets 0, {J}, {b}, {n}, {J,b}, {J,n}, {b,n}, {J,b,n} of Es. If S is executed, then an event A for S occurs or nonoccurs according to whether the outcome belongs or does not belong to A, respectively. An event A can only occur or nonoccur when tested by the execution of either F (if A ~ EF) or S (if A ~ Es). The null event 0, the event {n}, and only these two events, are tested by both F and S. Let E:= {0,{l},{r},{l,r},{l,n},{r,n},{l,r,n}, {n}, {J}, {b}, {J, b}, {J, n}, {b, n}, {J, b, n}} (1) be the collection of all events and let E:= EF uEs = {l,r,n,f,b} (2) be the set of all outcomes of the available experimental procedures. There are 32 subsets of E, but only 14 events in E. For example, {r, f} is a subset of E, but it is not an event since there is no conclusive way to test it. For instance, if S is executed and b is the outcome, we would hardly say that {r, J} failed to occur since F was not executed and it is meaningless to ask whether or not the outcome r was secured. 10 DAVID J. FOVLIS The set £ of all events can be organized into a rudimentary logical structure as follows: Let A, B, C E £. Say that A and B are compatible iff they are simultaneously testable in the sense that A, B ~ EF or A, B ~ Es. (We abbreviate if and only if as iff.) Call A and B orthogonal iff they are compatible and disjoint. (Two sets are disjoint iff they have no elements in common.) Say that A and C are local complements iff they are orthogonal and their union is either EF or Es. If A and B share a common local complement C, say that A and B are perspective with axis C. For instance, the events {l, r} and {r, n} are compatible, since they are both tested by F, but they are not orthogonal since they have a common outcome r. If two events are orthogonal, they can be tested simultaneously and, when so tested, at most one of them can occur. The events {l, r} and {n} are local complements, since they are disjoint and their union is E F. When tested by F, one and only one of them will occur. The events {I, b} and {n} are also local complements, and both are tested by S. Therefore, {l, r} and {j, b} are perspective with {n} as an axis. If A E £, then A has at least one local complement C in £. Indeed, if A ~ EF, then the complement C := EF\A of A in EF is a local complement of A. Likewise, if A ~ Es, then D = Es\A is a local complement of A. Therefore, every event A is perspective to itself (with any local complement of A as an axis). Note that the two events EF and Es are perspective, with 0 as an axis. There is an obvious sense in which perspective events are "logically equivalent." For instance {l, r} occurs iff the firefly's light is on, and likewise for {j, b}. Also, EF = {l, r, n} always occurs (when tested, of course), and so does Es = {j,b,n}. The collection £ is partially ordered by set containment ~. If A and B are events and A ~ B, there is an obvious sense in which A "implies" B. For instance, {n} ~ {r, n} and, if {n} occurs, the firefly's light is out, and presumably {r, n} would have occurred too, had it been tested. (Again, note the use of the subjunctive. Indeed, if S is executed and {n} occurs, then {r, n} was not tested, so it neither occurred nor nonoccurred.) Figure 2 shows a diagram (called a Hasse diagram) of the event logic for the firefly box. The 14 events in £ are shown as nodes in this diagram, and event A is a subset of an event B iff either A = B or it is possible to go upward from A to B along a sequence of connecting line segments. The perspective events {l, r} and {j, b}, as well as {l, r, n} and {j, b, n} are enclosed in shaded ellipses on the diagram. A HALF-CENTURY OF QUANTUM LOGIC ( ~ ~- ------------ -~ {f, b, n} ") {l, r, n} ------------ {r, n} {I } 11 ~ {f, n} {r} {n} {f} {b} Figure 2. Logic of events 5. THE LOG l e o F EX PER I MEN TAL PRO P 0 SIT ION S As we have seen, in the event logic £ of the firefly box, two perspective events are, in some sense, logically equivalent. The temptation to identify logically equivalent events is irresistible, and we do so now by collapsing the diagram in Figure 2 as indicated by the shaded ellipses. Call the elements of the collapsed diagram experimental propositions, denote the experimental proposition corresponding to an event A E £ by 7r(A) , and let II be the set of all7r(A) as A runs through £. For simplicity, if A = {e} is an event with only one outcome, we write 7r( e) rather than 7r( {e} ). Denote the proposition 7r(0) by 0 := 7r(0) E II. Define the proposition 1 E II by 1 := 7r({l,r,n}) = 7r({J,b,n}). The proposition 7r({l,r}) = 7r( {J, b}), which can be regarded as asserting that the firefly's light is on, is denoted by 7r(n'). Likewise, the proposition 7r( {l, n}) can be regarded as asserting, "it is false that the firefly is in the right half of the box with its light on," so we denote it by 7r(r'), etc ... The resulting Hasse diagram for the logic II of experimental propositions is shown in Figure 3. 12 DAVID J. FOVLIS 1 1t (I ') 1t (l) 1t (r') n (r) 1t (n') n (n) 1t (f) n(f) 1t (b') n (b) Figure 3. Logic of experimental propositions The partial order on II, depicted in Figure 3, is denoted by ::; and called implication, or entailment. Note that, if A,B E £, then 7r(A) ::; 7r(B) iff there is an event BI such that A ~ BI and BI is perspective to B. Evidently, 0::; 7r(A) ::; 1 for all 7r(A) ElI. To test an experimental proposition 7r(A) E II, we select any event Al (including A itself) such that 7r(A) = 7r(A I ), we choose a test for AI, and we carry out the test. If Al occurs, we say that the proposition 7r(A) is confirmed, otherwise, we say that it is refuted. Thus, confirmation and refutation of experimental propositions is linked to occurrence and nonoccurrence of events. For instance, to test whether 7r( n') is confirmed (i.e., whether the light is on), we can execute either F or S and conclude that the light is on iff the outcome n is not secured. In reporting that a proposition 7r(A) is confirmed or refuted it is not necessary to specify which test was executed. There is a natural notion of "logical negation" for the experimental propositions in the logic II. Indeed, if A E £, define 7r(A)' := 7r(C), where C E £ is any local complement of A. The proposition 7r(A)', which is A HALF-CENTURY OF QUANTUM LOGIC 13 easily seen to be well defined, is regarded as a logical negation, or denial, of 7f(A). Evidently, 0' = 1 and l' = O. In Figure 3, the logical negations of each proposition in the first row above 0 are located directly above that proposition in the first row below 1. For instance, 7f( l)' = 7f( l'). The partially ordered set II depicted in Figure 3 is actually a lattice; that is, any pair of propositions 7f(A) and 7f(B) have a least upper bound, or join, 7r(A)V7r(B) and a greatest lower bound, or meet, 7r(A)I\7f(B) with respect to the implication relation ::;. For instance, 7r(l) V7r(r) = 7r(n') and 7f(r) 1\ 7f(J) = O. The mapping 7f(A) f--+ 7f(A)' is an orthocomplementation on the lattice II in that it has the following properties for all experimental propositions p, q E II: (i) p I\p' = (ii) p V p' (iii) pI! ° =1 =P (iv) p::; q => q' ::; p'. As a consequence, II satisfies the De Morgan Laws: (v) (p /\ q)' = p' V q' and (vi) (p V q)' = p' /\ q'. Furthermore, the following orthomodular identity holds in II: (vii) p::; q => q = p V (q /\p'). Therefore, II forms a so-called orthomodular lattice [27, 44]. Say that propositions 7f(A) and 7f(B) in II are compatible iff there are compatible events Al and BI with 7f(A) = 7f(A I ) and 7f(B) = 7f(BI)' Note that a common test for the events Al and BI is then a common test for the propositions 7f(A) and 7f(B). For instance, 7f(l) and 7f(n) are compatible, 7f(n) and 7f(b) are compatible, but 7f(l) and 7f(b) are incompatible. Say that propositions 7f(A) and 7f(0) in II are orthogonal iff there are orthogonal events Al and BI with 7f(A) = 7f(A I ) and 7f(B) = 7f(BI)' Note that orthogonal propositions are necessarily compatible and that 7f(A) is orthogonal to 7f(C) iff 7f(A) ::; 7f(C)'. If 7f(A) ::; 7f(B) and if 7f(A) is confirmed, it is understood that 7f(B) is also confirmed and that every proposition 7f( 0) that is orthogonal to 7f(A) is refuted. Note that 7f(A) is confirmed iff 7f(A)' is refuted. Evidently, 1 is always confirmed, and is always refuted. In classical (Boolean) logic, the meet p/\ q of two propositions p and q is effective as their logical conjunction p&q. This is certainly not the case in the logic II; for instance, 7f( l) /\ 7f(J) = 0, whereas 7f( l) & 7f(J) would be the proposition asserting that the firefly is in the left front quadrant ° 14 DAVID J. FOULIS of the box with its light on. What is happening here is perfectly clearthe conjunction 7f( l) & 7f(f) is not in the logic II because there is no way to test it! How does one account for the nonclassical nature of the firefly box logic II, given that its source, a box with windows and a firefly, is utterly classical? The answer, as we shall see in Section 10 below, is that there are pairs of nonclassical quantum-mechanical experiments that yield the same event logic [;, and therefore the same experimental logic II, as the firefly box. The event logic [; does not "know" the difference between the firefly box and the quantum-mechanical system, so it produces an experimental logic II compatible with both. 6. THE LOGIC OF ATTRIBUTES Usually there are certain properties, or attributes, associated with a physical system S, such as "S is green," or "S carries an electric charge of 1.60217733 x 10- 19 coulomb," or "S has a spin component +~ in the z direction." An attribute can be either actual or potential. Those attributes that are always actual, such as the color of a raven or the charge of an electron, are said to be intrinsic. Attributes that can be either actual or potential, such as the color of a chameleon or the spin component of an electron, are called accidental. An actual attribute ex of a physical system S can manifest itself experimentally only in terms of outcomes of experimental procedures. In fact, ex induces a division of the set E of all outcomes of experimental procedures into two disjoint parts: P = those outcomes that are possible when ex is actual, and E\P = those outcomes that are impossible when ex is actual. What are the attributes associated with our firefly box? Attributes such as "the temperature in the box is 18° C" or "the box weighs 15 kg" do not concern us here since they are unrelated to the only experimental procedures at our disposal, namely F and S. However, consider the attribute ex = "either the firefly's light is off, or else it is on and the firefly is in the left front quadrant of the box." If ex is actual, outcomes rand b are impossible and the set of possible outcomes is P = {l, j, n}. Conversely, given that the possible outcomes when ex is actual are l, j, and n, one can easily identify the original attribute ex. More generally, those attributes ex of the firefly box that can manifest themselves by way of the experimental procedures F and S can always be recaptured as soon as we know the subset P of E consisting of outcomes that are possible when the attribute is actual. Therefore, the set P S;;; E A HALF-CENTURY OF QU ANTUM LOGIC 15 provides a perspicuous mathematical representation of the attribute a, and in what follows, we shall simply identify a with P. Suppose that P ~ E is an attribute of our firefly box and let A E [; be an event. If An P = 0, then A consists entirely of outcomes that are impossible when P is actual. Thus, if An P = 0 and P is actual, then A is impossible in the sense that it must nonoccur when tested. Recall that the two events {l, r} and {f, b} are logically equivalent; hence if {l, r} is impossible, so is {J, b} and vice versa. Consequently, P n {l, r} = 0 {=:} P n {f, b} = 0. (3) Of the 32 subsets of E = {l, r, n, f, b}, only 20 satisfy condition (1), and each of these can be interpreted as a meaningful attribute of the firefly box. Even the empty set 0 can be interpreted as an attribute, albeit one that is never actual. On the other hand, the set E of all outcomes satisfies (1) and represents an attribute that is always actual12 . Let us denote by A the collection of all attributes P ~ E, so that PEA iff P ~ E and P satisfies (1). The set A is partially ordered by the relation ~ of set containment. Furthermore, if P, Q E A, then P ~ Q iff Q is actual whenever P is actual. Therefore, ~ can be regarded as a kind of implication relation on A and, in this sense, A becomes a logical system called the attribute logic of the firefly box. It is easy to see that, if P, Q ~ E and both P and Q satisfy condition (1), then so does P U Q. Therefore, A is closed under the formation of unions. Thus, if P, Q E A, then P and Q have a join P V Q = P U Q in A. Also, if P, Q E A, then P and Q have a meet P A Q in A; in fact P A Q is the union of all attributes in A that are contained in both P and Q. Although A is closed under unions, it is not closed under intersections. For instance, P := {l, f, n} E A and Q := {l, b, n} E A, but P n Q = {l,n} r:J. A. In fact, P A Q = {n} -=f. P n Q. In general, if P, Q E A, then the attribute P A Q is a subset of the set P n Q, but P A Q -=f. P n Q unless it happens that P n Q E A. Suppose PEA and P is actual. If the experimental procedure F is executed, one of the outcomes l, r, or n in EF must be secured and, since the outcomes in EF \P are impossible, it follows that one of the outcomes in P n EF must be secured. In other words, the event P n EF necessarily occurs when P is actual. Likewise, if P is actual, the event pnEs necessarily occurs when tested. Furthermore, if P is actual, A E t:, and one of the conditions P n EF ~ A or P n Es ~ A holds, then A necessarily occurs when tested. Let us say that P guarantees A iff one 12 All of the intrinsic attributes are thus identified with E. 16 DAVID J. FOULIS of the conditions P n Ep ~ A or P n Es ~ A holds. The fact that the empty attribute 0 guarantees every event is harmless since the empty attribute is never actual. If A E E, let [A] denote the union of all attributes PEA that guarantee A. In other words, [A] is the largest attribute that, when actual, necessitates the occurrence of A when tested. It is easy to check that, if A,B E E, then 1I"(A) ::; 1I"(B) ~ [A] ~ [B], (4) so we can and do define [1I"(A)] := [A] for all events A E E. For simplicity, if e E E, we write [e] rather than [{ e}] and we write [e'] rather than [11" (e )']. An attribute of the form [A] is called a principal attribute. In view of (2), the mapping 1I"(A) I---> [A] embeds the experimental logic II in the attribute logic A, whence 12 of the 20 attributes in A are principal, and 8 are nonprincipal. Although the embedding 1I"(A) I---> [A] preserves joins, it fails to preserve meets. For instance, 11"( l) /\ 11"(1) = 0 in II, but [l] = {l,J, b}, [f] = {l, r, J} and [l] /\ [f] = {l, J} i- 0 in A. Each of the 8 non principal attributes in A can be written as a meet of principal attributes; for instance {l, j, n} is a nonprincipal attribute in A and {l,J, n} = [r'] /\ [b']. For simplicity, we write [r'b'] rather than [r'] /\ [b']. Similarly, we write [l f] rather than [l] /\ [f]' and so on. With this notation, the Hasse diagram for the attribute logic is shown in Figure 4. 7. ST A TES AND IRREDU eIB LE A TTRIB UTES Nearly every scientific theory utilizes, explicitly or implicitly, the notion of the state of a physical system. The usual understanding is that, at any given moment, the system is in a particular state 'l/;. All information about outcomes of experimental procedures executed on the system in state'l/; are supposed to be encoded into 'l/;. The state of the system can change in time under a deterministic or stochastic dynamical law , it can change because an experimental procedure is executed, or it can change spontaneously. Until now, our firefly box theory (FBT) has recognized only one explicit principle, namely n = np = ns. (However, one could argue that much of the discussion in Section 6 regarding the attributes of the firefly box constitutes a further evolution of the FBT). Now we have to face the issue of incorporating into our FBT a suitable mathematical representation for the set \)i of all possible states of the firefly box. 17 A HALF-CENTURY OF QUANTUM LOGIC 1 [r'] [l ] [b'] [r' b'] [l f] [n'] [f] [r' f] [l b] [f] [I'] [l ' b'] [b] [l' f] [n] [r f] [r] [r b] Figure 4. Logic of attributes We cannot see inside the box, but we are formulating our FBT under the supposition that the blips of light on the windows are caused by a firefly. The firefly could be located in anyone of the four quadrants of the box, and its light could be on or off, so there seem to be eight different possible states of the firefly box. However, when the light is off our available experimental procedures F and S provide no information about the location of the firefly. In view of this experimental limitation, it seems more reasonable to restrict our state space W to five possible states, namely (in Dirac's "ket" notation) \II = {Il!), Ilb), Ir!), Irb), In)}. (5) 18 DAVID J. FOULIS The first four states correspond to the location of the firefly in the leftfront, left-back, right-front, and right-back quadrant with its light on. In the fifth state In), the light is off. Notice in Figure 4 that there are exactly five minimal nonempty attributes in A, namely [If]' [lb], [r f], [rb], and [n]. These are the attributes that are irreducible in the sense that they cannot be decomposed into more elementary attributes, and they are in obvious one-to-one correspondence with the five states in \]I. Two new principles suggest themselves, and we now incorporate them into our FBT: Principle of Irreducible Attributes To each state 'lj; E \]I there corresponds a uniquely determined irreducible attribute P,p E A. The attribute P,p is actual iff the system is in state 'lj;. Principle of Actuality for Attributes An attribute PEA is actual iff the system is in a state 'lj; for which P,p ~ P. Thanks to the principle of irreducible attributes, one and only one of the irreducible attributes is actual at any given moment. As a consequence of both principles, this unique irreducible attribute is the meet of all the attributes that are actual at that moment. Suppose P, Q E A. In spite of the fact that P /\ Q is not necessarily P n Q, it turns out (and is not difficult to verify) that P /\ Q is actual iff both P and Q are actual. Thus, P /\ Q is effective as a true logical conjunction of P and Q in the attribute logic A. That the join P V Q = P U Q of two attributes is not necessarily a logical disjunction of P and Q is a profound observation first made by D. Aerts [1]. For instance, a glance at Figure 4 shows that [If] V [rb] = [n'l; yet the attribute [n'] can be actual (i.e., the light is on) in a state (e.g., Irf)) for which neither [If] nor [rb] is actual. The fact that a join of attributes need not be a logical disjunction of the attributes can and should be regarded as the true basis for the notion of "superposition of states." Say that a state 'lj; E \]I is a proper superposition of states ct, {3 E \]I iff P,p ~ Pa V P{3 but 'lj; =I- ct, (3. For instance, Ir f) is a proper superposition of Ilf) and Irb). Since the publication in 1930 of Dirac's seminal monograph on the mathematical foundations of quantum mechanics [21], it has been an article of faith among physicists that a fundamental distinction-if not the fundamental distinction-between quantum and classical mechanics is that there are proper superpositions of states in the former, but not in the latter. If this is so (and I am not entirely convinced that it is [7]), then our firefly box is already exhibiting quantal behavior! A HALF-CENTURY OF QU AN TUM LOGIC 19 8. PROBABILITY MODELS The system of real numbers is denoted by the symbollR, and the closed interval of real numbers between 0 and 1 is written as [0, 1]. By a probability model for the firefly box, we mean a function w : E ----t [0, 1] ~ JR mapping each outcome e E E into a real number w(e) between 0 and 1 in such a way that w(l) + w(r) + w(n) = 1 and w(j) + w(b) + w(n) = 1. (6) If e E E, then w(e) is to be interpreted as the probability, according to the model w, that the outcome e will be secured when an experimental procedure (F or S) is conducted for which e is a possible outcome. Denote by n the set of all probability models w for the firefly box. If A E £ is an element of the event logic and wEn is a probability model, we define w(A) := L w(e) (7) eEA and interpret w(A) as the probability, according to the model w, that the event A will occur if tested. In this way, probability models wEn can be "lifted" to the logic £ of events. If A, B E £ with A ~ B, it is clear that w(A) :S w(B). If A, C E £ and A is orthogonal to C, then w(A U C) = w(A) + w(C). Therefore, (6) implies that w(A) + w(C) = 1 for local complements A, C E £. If wEn and A and B are perspective events with axis C, then w(A) + w(C) = 1 = w(B) + w(C), and it follows that w(A) = w(B). Hence, for an experimental proposition 7r(A) we can and do define w(7r(A)) := w(A). In this way, probability models wEn can be lifted to the logic II of experimental propositions. Naturally, w(7r(A)) is interpreted as the probability, according to the model w, that 7r(A) will be confirmed if tested. If wEn, then, regarded as a function w : II ----t [0, 1] ~ JR, w is a probability measure in the sense that w(l) = 1 and, for orthogonal propositions 7r(A) and 7r(C), w(7r(A) V 7r(C)) = w(7r(A)) + w(7r(C)). For the firefly box, n provides a so-called full, or order determining, set of probability measures in the sense that, if w(7r(A)) :S w(7r(B)) for all wEn, then 7r(A) :S 7r(B). If WI, W2, ... , wn E nand tl, t2, ... tn are positive real numbers such that tl + t2 + ... + tn = 1, the function w : E ----t JR defined for all e E E by (8) 20 DAVID J. FOULIS is called a convex combination, or mixture, of WI,W2, ... ,Wn with mixing coefficients tl, t2, ... , tn. It is not difficult to see that such a mixture takes on values between 0 and 1 and satisfies 0, so it is again a probability model wEn. In other words, n is a convex set, i.e., it is closed under the formation of convex combinations. Each wEn is completely determined by the three real numbers (9) x := w(l), y := w(f), and z := w(n). Indeed, as a consequence of (6), w (r) = 1- x - z and w (b) = 1- y - z. (10) Of course, the numbers x, y, z are subject to the conditions o ::; x, y, z ::; 1, x + z ::; 1, and y +z ::; 1. (11) The set of all points (x, y, z) in coordinate 3-space IR3 that satisfy (11) is a pyramid with a square base (Figure 5). A point (x, y, z) in the pyramid may be identified with the corresponding state w by (9) and (10), so the pyramid provides a geometric representation of the space n of probability models for the firefly box. The five vertices WII!) := (1,1,0), Wllb) := (1, 0, 0), Wlrf) := (0, 1, 0), Wlrb) := (0, 0, 0), and win) := (0, 0, 1) of the pyramid correspond in an obvious way to the five states in W. For instance, x = Wllf)(l) = 1, y = WII!) (f) = 1, and z = WII!)(n) = 0 for the probability model Wllf). In the geometric representation of n as a pyramid (Figure 5), the five vertices correspond to extreme points of the convex set n, that is, points W that cannot be written in the form (3) unless WI = W2 = ... = w. For a polytope, such as n, there are only finitely many extreme points, and every point is a convex combination of extreme points. If wEn, we define the support of w, in symbols supp(w) by supp(W) := {e E EIO < w(e)}, (12) noting that supp(w) is the set of all outcomes e E E that are possible according to the model w. In view of our discussion in Section 7, it should come as no surprise (and it is easy to check) that supp(w) E A. Furthermore, the supports of extreme points produce irreducible attributes corresponding to states, just as one would expect. For instance, supp( WI If) ) = {l, J} = [l fJ which corresponds to the state Ilf) . Note that the support of a convex combination (8) is the union of the supports of WI, W2, ... , Wn E n from which it is formed. The five vertices, eight edges, four triangular faces, and the square base of the pyramid in Figure 5 are called faces of n. In addition it is A HALF-CENTURY OF QU ANTUM LOGIC 21 z 1 y x Figure 5. Geometric representation of n convenient to include the empty set 0 and fl itself as (improper) faces, making a total of twenty faces in all. The four triangular faces and the square base-that is, the maximal proper faces-are called facets. The five vertices are the minimal proper faces. The intersection of two faces is again a face, and, given any two faces there is a unique smallest face containing both. Therefore, partially ordered by inclusion ~, the faces form a lattice, called the face lattice of fl, and denoted by F. The fact that both the attribute logic A and the face lattice F have twenty elements is no accident. In fact there is a natural one-to-one correspondence P f---t <I> between attributes PEA and faces <I> E F given by <I> := {w E fl\supp(w) ~ P} and P := UWEi[>SUpp(w). Furthermore, 22 DAVID J. FOULIS the correspondence P f-----t <P is a lattice isomorphism in that it preserves meets and joins. Thus, the face lattice :F of [2 provides an alternative representation for the attributes of the firefly box. 9. TESTING AND INFERENCE For the firefly box we now have three related logical structures, namely £, II, and A, as well as the state space W, the convex set [2 of probability models, and the face lattice :F of fl, which is isomorphic to A. For A E £ we have three truth values: occur, nonoccur, and not tested. For 7r(A) E II we again have three truth values: confirmed, refuted, and not tested. By their very definitions, these truth values can be determined by executing an appropriate experimental procedure, either F or S. For an attribute PEA we have two truth values: actual and potential. Unlike the truth values for events and experimental propositions, it might not be possible to determine the truth value of an attribute P by conducting a single experiment. If P = [7r(A)] is a principal attribute, we can test P by testing 7r(A). If 7r(A) is refuted, then P cannot have been actual since its actuality guarantees 7r(A). If 7r(A) is confirmed, we have evidence that P might have been actual, but it may not be conclusive. If PEA is not principal, it can be written as a conjunction of principal attributes, one of which can be selected and tested, again supplying (usually inconclusive) evidence that P was either actual or only potential [63]. Apparently, inferences about which properties of a physical system are actual and which are only potential will have to depend on evidence gathered from repeated testing, either under circumstances in which one has reason to believe that the state of the system remains unchanged, or on a sequence of replicas of the underlying system all of which are presumed to be in the same state. As of now, however, there seems to have been no serious attempt to develop a mathematical theory of formal scientific inference regarding the attributes of a physical system. For each state 't/J E W, we have two truth values, in and not in. To test the state 't/J, we can test the corresponding irreducible attribute P'Ij; as indicated above. But then state testing will be as inconclusive as attribute testing and it will be hindered by the same lack of a theory of inference. For certain physical systems (if not for our firefly) it is possible to prepare a preassigned state 't/J, that is, to bring the system into the state 't/J by carrying out suitable procedures. When state preparation is possible, it may render moot the question of how to test states (and perhaps attributes as well). A HAL F - C E N T U R Y 0 F QUA N TUM LOG I C 23 Testing probability models is quite another matter-indeed this is what statistical inference is all about! The usual idea is that there exists a "true probability model" w* E n representing the habits of the firefly. Although we might not know which probability model is w*, we might be able to make some (perhaps tentative) conclusions about w* by repeatedly executing our procedures F and S and processing the experimental data thus obtained. It is often assumed that the repeated trials of F and S are "independent" in the sense that the firefly's habits are unaffected by our experiments. It is easy to challenge this assumption, but not so easy to design reliable strategies of statistical inference to take into account observation-induced changes in the firefly's behavior patterns. Two useful mathematical tools conventionally employed in statistical investigations are statistical hypotheses and parameters. By a statistical hypothesis is meant a subset A of n, usually subject to a condition that it be measurable in some appropriate sense (e.g., Borel or Lebesgue measurable). Let I denote the set of all statistical hypotheses. By a statistical parameter is meant a real valued function), : n ---+ JR satisfying the condition that, for every interval I ~ JR, the set ),-1(1) := {w E n I ),(w) E I} (13) is a statistical hypothesis in I. A statistical hypothesis of the form)' -1 (I) is called a parametric hypothesis. A statistical hypothesis A E I is understood to represent the proposition w* E A asserting that the true probability model belongs to A. Partially ordered by ~, I forms a logical system called the inductive logic, and in I the meet A 1\ r = A n r and join A V r = A u rare effective as the conjunction and disjunction, respectively, of statistical propositions A, rEI. Under these operations, I forms a Boolean algebra. The branch of statistics known as hypothesis testing is concerned with the problem of deciding whether to (tentatively) accept or reject a statistical hypothesis in the face of experimental data, or to hold it in abeyance. Thus, statistical hypotheses acquire three truth values: accepted, rejected, and held in abeyance. The "true value" of a statistical parameter)' is of course ),* := ),(w*) and parameter estimation is the branch of statistics devoted to the problem of estimating ), * on the basis of experimental data. A point estimation of ),* produces a real number Xthat one has reason to believe is a good approximation to ), *. An interval estimation of ), * yields a confidence interval I ~ JR with the understanding that the statistical hypothesis), -1 (1) is to be accepted. For our firefly box, the components x, y, z of the geometric point (x, y, z) representing the probability model 24 DAVID J. FOULIS wEn as in Figure 5 form a complete set of statistical parameters in the sense that knowledge of x*, y*, and z* would determine w*. Suppose the experimental procedure F is executed TF times and that the outcomes l, r, and n are secured N(l), N(r), and NF(n) times, respectively, during these trials. Likewise, suppose S is executed Ts times and that the outcomes f, b, and n are secured N(f), N(b), and Ns(n) times, respectively, during these trials. Thus, TF = N(l) + N(r) + NF(n) (14) and + N(b) + Ns(n) := TF + Ts trials. If we Ts = N(f) (15) for a total of T assume that the habits of the firefly are unaffected by our observations, then the sequential order in which F and S are executed is presumably irrelevant. We could carry out the TF trials of F first, then perform the Ts trials of S-or vice versa. We could alternate trials of F and S. We could even flip a coin after each trial to see whether to perform F or S on the next trial. In any case, all pertinent information derived from the T = TF + Ts trials is encoded in the observed frequency vector rJ := (N(l), N(r), NF(n), Ns(n), N(f), N(b)). (16) If ,\ is a statistical parameter, an estimator for ,\ is a function X (rJ) that provides a numerical estimate ,\* ~ ,\1rJ) of ,\* based on the experimentally observed frequencies. Statisticians have developed several techniques and conditions to assess and compare various proposed estimators. For instance, an estimator X is said to be unbiased iff, whenever the observed frequency vector rJ conforms exactly to a probability model w in the sense that N(l) = w(l)TF, N(r) = w(r)TF, NF(n) = w(n)TF, Ns(n) = w(n)Ts, N(f) = w(f)Ts, and N(b) = w(b)Ts, then ,\1rJ) = '\(w). A weaker, and perhaps more realistic condition is that the estimator be asymptotically unbiased in the sense that '\1w) approaches '\(w) as a limit when TF and Ts become larger and larger. 9.1 Example Let N(l, r) := N(l) + N(r), N(f, b) := N(f) + N(b), N n := NF(n)+Ns(n), and T:= TF+TN. Then the maximum likelihood estimators [48] for the statistical parameters x, y, z are given by x ~= y ~= N(l) N(l,r) [1 _N(n)] T' (17) N(f) N(f,b) [1 _N(n)] T' (18) A HAL F - C E N T U R Y 0 F QUA N TUM LOG I C ZA= ~. 25 (19) As is easily checked, the estimators in Example 9.1 are unbiased. 10. THE QUA N TUM FIR E FLY BOX Associated with a quantum-mechanical system S is a vector-like quantity called spin. However, it turns out that measurements of the spin component in a fixed direction can produce only finitely many different numerical outcomes rather than the continuum of possible components that would be expected for an ordinary vector quantity. In other words, the spin components of S in a given direction are "quantized." The behavior of S in regard to its spin is characterized by a number j which can be 0, a positive integer, or half of a positive integer. The spin component of a spin-j system S, measured in a fixed spatial direction d, can take on only 2j + 1 different values: -j, -j + 1, ... ,j -1, or j. For instance, the spin component of a spin-! system, measured in a given direction d, can only be -! or !. For a spin-! system, the outcomes -! and ! are called spin down and spin up in the direction d. An electron, for example, is a spin-! particle. Suppose S is a spin-! system and we have a spin detecting apparatus that will measure the spin component of S in the direction of a unit vector d = (d 1 , d2, d3), di + d§ + d§ = 1. It turns out that the possible states of the system S are represented by vectors 'l/J = ('l/Jl, 'l/J2, 'l/J3) with 'l/Jr + 'l/J~ + 'l/J~ ~ 1. Therefore, the state space \[1 of S can be visualized as a solid sphere of radius 1. According to the rules of quantum mechanics, the probability of spin-up in direction d when S is in state 'l/J E \[1 is given by Probv,,(spin-up in dir. d) = For the special case in which 'l/Jr + 'l/J~ the unit vectors d and \[1, ~(1 + 'l/Jldl + 'l/J2d2 + 'l/J3d3)' (20) + 'l/J§ = 1 and 'Y is the angle between Probv,,(spin-up in dir. d) = cos2(~). (21) experimental apparatus is rarely 100 percent efficient, and the probabilities given by (20) and (21) have to be regarded as conditional probabilities, given that the detector actually produces a response. Imagine now that our firefly is really a spin-! system, that a spin detector is inside the box, and that it signals spin-up or spin-down by 26 DAVID J. POVLIS producing a blip of light on the front or side window. For the front window, with the same symbols used in Section 4, suppose the detector is set so that spin-up in direction dp produces outcome l, spin-down in direction dp produces outcome r, and outcome np (no blip on the front window) simply means that the detector failed to respond. Likewise, for the side window, spin-up in direction ds produces outcome j, spindown in direction ds produces outcome b, and outcome ns means that the detector failed to respond. As before, we must look directly at one window or the other 13 . Suppose the spin detector has the same detection efficiency, say 100E%, o :::; E :::; 1, in anyone direction dp as in any other direction ds. Then we are (almost literally) "in the dark" about the spin of S when we look at the front window about as often as when we look at the side window. In this case, there seems to be no harm in setting n := np = ns as we did in Section 4. Then, for either window, the probability of outcome n is 1 - E. Let us choose our coordinate system so that dp = (1, 0, 0) and ds = (cos a, sin a, 0) with 0 < a :::; 7r /2. The state vector 'ljJ can then be written in terms of spherical coordinates 0 :::; p :::; 1, 0 :::; 0 :::; 27r, and 0:::; <1>:::;7r as o 'ljJ = (p cos 0 sin <1>, psin sin <1>, pcos <1». (22) Then, with the same notation as in Section 8, x = Prob?fJ(l) = ~E(l y = Prob?fJ(r) = ~E(l + pcosOsin<1», (23) + pcos(a - (24) 0) sin <1», and z = 1- E. (25) For the quantum firefly box, not every probability model w in the pyramid n of Section 8 corresponds to a possible state 'ljJ E \Ii. For instance, the four vertices on the bottom square are unattainable from (23), (24) and (25), even if E = 1. In fact, for E = 1 and 0 < a :::; 7r /2, the set r a of all points in n that correspond to possible states 'ljJ of the quantum firefly box is a convex region in the square base of n bounded by an ellipse with 13 Although there is no Heisenberg principle of uncertainty for spin measurements in different directions, the Hilbert-space operators for such measurements fail to commute. A HAL F - C EN T U R Y 0 F QUA N TUM LOG I C center (~, ~), 27 major axis along the line y = x, and having semi-major 1 1 and semi-minor axes of lengths 2-2 cos(a/2) and 2-2 sin(a/2), respectively. If a = 7r /2, then r a is a circular disk tangent to the boundary of the square base of n at the four midpoints (~, 0), (1, ~), (~, 1), and (0, ~ ). For our original firefly box, r a is a statistical hypothesis asserting that the firefly is behaving like a quantum firefly. 11. CONCLUSION---WHAT HAVE WE LEARNED? Our thought experiments with the firefly box have provided illustrations of some of the more important ideas and tools employed in the scientific study of physical systems: outcomes, events, experimentally testable propositions, states, attributes, probability models, tests, statistical hypotheses, and statistical parameters. Of course, this list is far from complete-what about observables, dynamics, symmetries, invariants, conservation laws, amplitudes, coupled systems, relativistic physics, causality, and so on? Although most of these ideas can also be illustrated and studied in the context of the firefly box (or firefly boxes), it is already possible to discuss what I consider to be the main lessons of quantum logic, and I propose to do that now. Here then are my personal candidates for the answers to the question posed in the title of this article: Logical Connectives. In dealing with propositions associated with a physical system, one must question the meaning and even the existence of the basic connectives of classical logic-and, or, denial, and implication. In our firefly example: (1) The event logic £ is not even a lattice, a fact which warns us not to try forming the logical disjunction-let alone the join-of event propositions such as {l} and {j} that cannot be tested simultaneously. There is no meaningful denial connective on the event logic £. For instance, what would be the denial of the event {n}? Is it {l, r}? Or is it {j, b}? (2) The logic II of experimental propositions is a lattice, but the meet (respectively, join) of experimental propositions is not their logical conjunction (respectively, disjunction) unless the propositions are simultaneously testable For instance 7r( l) 1\ 7r(J) = 0, which in no way corresponds to a logical conjunction 7r(l) & 7r(J) of 7r(l) and 7r(J). However, the logic II does carry a rather perspicuous logical negation p 1--7 pi (which in fact is an orthocomplementation). (3) The logic A of attributes is again a lattice, and the meet P 1\ Q of attributes P and Q is effective as their logical conjunction. But, if there are irreducible attributes contained in P and Q that admit proper 28 DAVID J. FOULIS superpositions, then the join P V Q cannot be construed as a logical disjunction of P and Q. Also, the logic A, does not admit any reasonable denial connective. In particular, there is no orthocomplementation on A. (4) The implication connective (p, q) f---7 P :J q is even more conspicuously absent in quantum logic than the conjunction and disjunction connectives. The material implication connective p :J q := p' V q of classical (Boolean) logic has been the subject of considerable philosophical criticism and debate; in quantum logics modelled by orthomodular lattices, one has to forfeit even this suspect connective and make do, if at all, with severely attenuated versions thereof [37]. Note, however, that all of the logical systems £, II, A, and I admit perspicuous implication relations, namely their respective partial order relations <;::;, :s;, <;::;, and <;::;. (5) It is only when one reaches the level of the inductive logic I of statistical hypotheses (a Boolean algebra) that one encounters a logical system with a secure and well-understood meaning of conjunction, disjunction, and denial as well as a (material) implication connective. (6) Even at the level of the inductive logic, a conditional hypothesis qA (i.e., r given A) cannot be construed as a material implication A :J r, and the logic I has to be enlarged to a Heyting algebra I I I to accommodate this important alternative notion of implication [73]. Conditional events, conditional propositions, and conditional hypotheses are currently under intense study by electronic engineers and computer scientists because of the necessity of codifying conditional information in expert systems. A Hierarchy of Logical Systems. There is a hierarchy of related but distinct logical structures affiliated with a physical system. These include, but are not limited to, the event logic £, the logic II of experimental propositions, the attribute logic A, the face lattice F, the inductive logicI, and the logic I I I of conditional hypotheses. Propositions in the various logics are different in kind, are tested in different ways, and have their own distinct modalities. I have a coin. (i) I can toss the coin, observe the outcome and determine whether or not the event "heads" occurred. (ii) I can make a prediction that the coin will fall "heads" on the very next toss. (iii) I can assert that the coin is fair and that I am willing to bet at odds of 1:1 on either "heads" or "tails" on the next toss. (iv) I can claim that, in a sufficiently long sequence of independent tosses, the proportion of "heads" will be very close to 0.5. It is patently obvious that the observation (i), the prediction (ii), the assignment (iii) of betting odds, and the claim (iv) regarding long run frequency are four propositions of completely different characters. The folly of attempting to formulate a single "unified logic" comprising all propositions affiliated with a physical system is manifest. A HAL F - C EN T U R Y 0 F QUA N TUM LOG I C 29 Perhaps the source of all such unfortunate attempts is the use of the definite article the in the title of the seminal paper of Birkhoff and von Neumann [10]. Events vs. Experimental Propositions. One cannot necessarily formulate quantum logic purely on the basis of the logic II of experimental propositions and higher-level logics built upon II. Indeed, some of the information implicit in the observation that a certain outcome was obtained (or that a certain event occurred) may be lost in the passage from events to experimental propositions. For the firefly box, the loss of information in the passage from E to IT is of little concern. Neither is it particularly serious in Hilbert-space based quantum mechanics, provided that one is dealing with a single isolated observation, e.g., a measurement of a spin-component with a Stern-Gerlach apparatus. In such a case, one can safely use elements of the standard quantum logic of Hilbert-space projection operators to carry the pertinent experimental information. However, if one has to deal with sequential or compound observations, e.g., iterated Stern-Gerlach measurements [75], the phase and amplitude information encoded in the complex wavefunction becomes critical. In the passage from an orthonormal set ('lj;i) of Hilbert-space state vectors to the corresponding projection operator P onto the closed linear subspace spanned by these vectors, all phase and amplitude information is wiped out! Experimental Propositions vs. Attributes. experimentally testable propositions about a physical system are one thing; attributes or properties of that system are quite another. By universal agreement, the genesis of what is now called quantum logic is von Neumann's Grundlagen der Quantenmechanik [59]. Nowhere in the Grundlagen does von Neumann refer to propositions about a physical system; he refers only to properties (i.e., what we have been calling attributes) of that system. However, four years after the publication of the Grundlagen, von Neumann (in collaboration with Birkhoff), writes only of experimental propositions and propositional calculi-there is no further mention of properties. I do not know how to account for von Neumann's abrupt transition from a logic of properties to a logic of experimental propositions. I do know that, since then, many (but not all [1, 2, 3, 58, 62, 63, 69]!) quantum logicians have routinely identified experimental propositions about a physical system and attributes of that system. This is a mistake, and a serious one! For our firefly box, we have seen that the logic II of experimental propositions and the logic A of attributes are separate, distinct, and 30 DAVID J. FOULIS nonisomorphic logical systems, albeit linked by the mapping 7r(A) 1-+ [AJ. I know of no more compelling illustration of the error of confusing experimental propositions and attributes. States and Probability Models. Whereas it may be useful to assume, as is often done, that there is a probability model w'IjJ E n corresponding to each physical state 't/J E \]i, there is no a priori reason that the mapping 't/J 1-+ w'IjJ has to be either injective or surjective. In the literature of pure mathematics, certain linear functionals, measures, or homomorphisms are referred to as states because they or their analogues do in fact represent physical states in some conventional theory of mathematical physics. Quantum logicians need to be more careful! Even in conventional Hilbert-spaced based quantum theory, where (pure) states are represented by vectors 't/J in the unit sphere, there is a distinction between a state 'ljJ and the corresponding probability measure w'IjJ on the logic II of projection operators. Indeed, the probability measure w'IjJ, defined for P E II by w'IjJ(P) = (P't/J, 't/J), determines 't/J only up to a phase factor, and the identification of 't/J with w'IjJ would wipe out all phase information incorporated in the state vector. Thus, in conventional quantum mechanics, the mapping 't/J 1-+ w'IjJ is surjective (by a celebrated theorem of Gleason [23]), but not injective. For our quantum firefly in Section 10, the mapping 't/J 1-+ w'IjJ from state vectors 't/J to probability models wEn is injective, but not surjective. For the quantum firefly, the lack of surjectivity of't/J 1-+ w'IjJ can be ascribed to the fact that we can only measure the spin component in two different directions. If we append spin component measurements in additional directions to our list of available experimental procedures, we find that n gets smaller and smaller until, finally, the mapping 't/J 1-+ w'IjJ becomes surjective. An assumption that the mapping 't/J 1-+ w'IjJ from the state space 't/J of a physical system to the convex set n of all probability models for the system is injective, surjective, or both constitutes a significant physical assumption about the system and the available experimental procedures for its study. Hidden Variables. Quantum logic enables the construction of simple finite models that can help us understand the so-called problem of hidden variables. The question of whether apparent quantal behavior can be explained by classical experimental procedures that are currently unknown or unavailable is called the problem of hidden variables. For instance, our (non quantal) firefly box certainly admits hidden variables in the sense that a third window on top of the box would remove all apparent quantal behavior! A HALF-CENTURY OF QU ANTUM LOGIC 31 A mathematical proof showing that the quantal behavior of a particular physical system cannot be accounted for by hidden variables is called a no go proof. The first convincing no go proof was given by von Neumann himself in the Grundlagen [59]. An excellent survey of no go proofs up until about 1973 can be found in [5]. Complementarity. Quantum logic also enables the construction of simple models that can help us appreciate the so-called principle of complementarity. In the writings of a number of philosophers and scientists, not the least of whom was Bohr himself, Bohr's principle of complementarity has often been burdened with confusing metaphysical embellishments. Stripped of these encumbrances, the principle seems to affirm that there may be different experimental procedures, each of which can reveal aspects of a physical system necessary for a complete determination of its state, but whose conditions of execution are mutually exclusive. Our firefly box with the two experimental procedures F and S provides a perfect example of this situation, and also exposes the strong connection between complementarity and the problem of hidden variables. At roughly the same time that scientists were confronted by a breakdown in the Newtonian mechanical view of the physical world, artists were discovering a "principle of complementarity" in their own world. I doubt that many artists had any direct understanding of the physical principle of complementarity in quantum mechanics. Nevertheless, there was a sympathetic resonance between the two worlds, which I leave it to the reader to contemplate after perusing the following words of the art critic Marco Valsecchi [64]. "The idea was to arrange the forms in a plane so that an object or figure could be recognized not through perspective illusion, but through an analysis of its form, and also so that it could be seen from several points of view. These multiple analyses of total vision were put into a single image, thus giving an immediate unity to what has been seen, deduced and imagined. . .. to bring together all the multiple aspects of an object and to reduce them to the plane of the painting, like a summation all at the same time of all the different instances of poetic and rational perception." 32 DAVID J. FOULIS AFFILIATION David J. Foulis Department of Mathematics and Statistics University of Massachusetts Amherst, MA 01003, USA [email protected]. edu REFERENCES [1] Aerts, D., The One and the Many, Doctoral Dissertation, Brussels Free University, 1981. [2] Aerts, D., "Description of many physical entities without the paradoxes encountered in quantum mechanics", Found. Phys., 12, 1982, pp. 1131-1170. [3] Aerts, D. and Daubechies, I., "Physical justification for using tensor product to describe two quantum systems as one joint system", Helv. Phys. Acta, 51, 1978, pp. 661-675. [4] Alfsen, E.M. and Shultz, F.W., Non-commutative Spectral Theory for Affine Function Spaces on Convex Sets, Memoirs of the Amer. Math. Soc., 172, 1976. [5] Belinfante, F.J., A Survey of hidden variables Theories, Pergamon Press, Oxford, 1973. [6] Beltrametti, E. and Cassinelli, G., The Logic of quantum mechanics, Addison-Wesley, Reading, MA, 1981. [7] Bennett, M.K. and Foulis, D.J., "Superposition in quantum and classical mechanics", Found. Physics, 20, No.6, 1990, pp. 733-744. [8] Bennett, M.K. and Foulis, D.J., "Tensor products of orthoalgebras", Order, 10, No.3, 1993, pp. 271-282. [9] Birkhoff, G., lattice Theory, Third Edition, American Mathematical Society Colloquium Publications, XXV, Providence, RI, 1967. [10] Birkhoff, G. and von Neumann, J., "The logic of quantum mechanics", Ann. Math., 37, 1936, pp. 823-843. [11] Bunce, L. and Wright, J.D.M., "The Mackey-Gleason problem", Bull. Amer. Math. Soc., 26, No.2, 1992, pp. 288-293. [12] Busch, P., Lahti, P.J., and Mittelstaedt, P., The Quantum Theory of measurement, Lecture Notes in Physics, m2, Springer-Verlag, Berlin/Heidelberg/New York, 1991. A HAL F - C EN T U R Y 0 F QUA N TUM LOG I C 33 [13J Cattaneo, G. and Nistico, G., "Brouwer-Zadeh posets and threevalued Lukasiewicz posets", Int. J. Fuzzy Sets Syst., 33, 1989, pp. 165190. [14J Dalla Chiara, M.L., "Unsharp quantum logics", International 1. Theor. Phys., 34, No.8, 1995, pp. 1331-1336. [15] Dalla Chiara, M.L., Cattaneo, G., and Giuntini, R., "Fuzzyintuitionistic quantum logic", Studia Logica, 52, 1993, pp. 1-24. [16J Dalla Chiara, M.L. and Giuntini, R., "Paraconsistent quantum logics", Found. Physics, 19, No.7, 1989, pp. 891-904. [17J D'Andrea, A.B. and De Lucia, P., "The Brooks-Jewett theorem on an orthomodular lattice", J. Math. Anal. Appl., 154, 1991, pp. 507552. [18] D'Andrea, A.B., De Lucia, P., and Morales, P., "The Lebesgue decomposition theorem and the Nikodym convergence theorem on an orthomodular poset", Atti. Sem. Mat. Fis. Univ. Modena, 34, 1991, pp. 137-158. [19] Davies, E.B. and Lewis, J.T., "An operational approach to quantum probability", Commun. Math. Phys., 17, 1970, pp. 239-260. [20] DeLucia, P. and Dvurecenskij, A., "Yoshida-Hewitt decompositions of Riesz-space valued measures on orthoalgebras", Tatra Mountains Math. Publications, 3, 1993, pp. 101-110. [21] Dirac, P.A.M., The Principles of quantum mechanics, Clarendon Press, Oxford, 1980. [22] Dvurecenskij, A., "Quantum logics and completeness criteria of inner product spaces", International J. Theor. Phys., 31, 1992, pp. 18991907. [23] Dvurecenskij, A., Gleason's Theorem and its Applications, Kluwer, Dordrecht/Boston/London, 1993. [24] Eckert, A., "Quantum computation: Theory and experiments", in: Hellwig, K.-E. et al. (eds.), Book of Abstracts, quantum structures '96, Berlin, Technische Universitat, Berlin, 1996, pp. 35-36. [25] Edwards, C.M. and Riittimann, G.T., "On the facial structure of the unit balls in a GL-space and its dual", Math. Proc. Camb. Phil. Soc., 98, 1985, pp. 305-322. [26] Feldman, D. and WHce, A., "a -Additivity in manuals and orthoalgebras", Order, 10, 1993, pp. 383-392. [27] Foulis, D., "A note on orthomodular lattices", Portugaliae Math. 21, 1962, pp. 65-72. 34 DAVID J. FOULIS [28] Foulis, D., "Coupled physical systems", Found. Phys., 7, 1989, pp. 905-922. [29] Foulis, D. and Bennett, M.K., "Effect algebras and unsharp quantum logics", Found. Physics, 24, No. 10, 1994, pp. 1331-1352. [30] Foulis, D., Greechie, R., and Riittimann, G., "Filters and supports in orthoalgebras", International J. Theor. Phys., 31, No.5, 1992, pp. 789-807. [31] Foulis, D. and Randall, C., "Empirical logic and tensor products", in: Neumann, H. (ed.) Interpretations and Foundations of Quantum Theory, 5, Bibliographisches Institut Mannheim, Wien, 1981. [32] Giuntini, R. and Greuling, H., "Toward a formal language for unsharp properties", Found. Physics, 19, No.7, 1989, pp. 931-945. [33] Greechie, R.J. and Foulis, D.J., "The transition to effect algebras", International J. Theor. Phys., 34, No.8, 1995, pp. 1369-1382. [34] Gudder, S.P., "Spectral methods for a generalized probability theory", Trans. Amer. Math. Soc., 119, 1965, pp. 428-442. [35] Gudder, S.P., Quantum Probability, Academic Press, San Diego, 1988. [36] Habil, E., Orthoalgebras and Noncommutative Measure Theory, Ph.D. Dissertation, Kansas state University, 1993. [37] Herman, L., Marsden, E., and Piziak, R., "Implication connectives in orthomodular lattices", Notre Dame J. Formal Logic, XVI, 1975, pp. 305-328. [38] Holevo, A.S., "Statistical decision theory for quantum systems", J. of Multivariate Analysis, 3, 1973, pp. 337-394. [39] Holevo, A.S., "Probabilistic and Statistical Aspects of Quantum Theory", in: Krishnaiah, P.R. and Rao, C.R. (eds.), North-Holland Series in Statistics and Probability, Vol. 1, North Holland, Amsterdam/New York/Oxford, 1982. [40] Holland, S.S., "Orthomodularity in infinite dimensions; a theorem of M. Soler", Bull. Amer. Math. Soc., 32, 1995, pp. 205-232. [41] Husimi, K., "Studies on the foundations of quantum mechanics I", Proc. Physico-Mathematical Soc. Japan, 19, 1937, pp. 766-78. [42] Jammer, Max, The Philosophy of quantum mechanics, Wiley, New York, 1974. [43] Jauch, J.M., Foundations of quantum mechanics, Addison-Wesley, Reading, Mass., 1968. [44] Kalmbach, G., Orthomodular lattices, Academic Press, N.Y., 1983. A HAL F - C E N T U R Y 0 F QUA N TUM LOG I C 35 [45] Kalmbach, G., Measures on Hilbert lattices, World Scientific, Singapore, 1986. [46] Keller, H., "Ein Nicht-klassischer Hilbertscher Raum", Math. Zeit., 172, 1980, pp. 41-49. [47] Klay, M.P., "Einstein-Podolsky-Rosen experiments: The structure of the probability space. I" , Found. Phys. Letters, 1, No.3, 1988, pp. 205244. [48] KUiy, M.P. and Foulis, D.J., "Maximum likelihood estimation on generalized sample spaces: An alternative resolution of Simpson's paradox", Found. Physics, 20, No.7, 1990, pp. 777-799. [49] Klay, M.P., Randall, C.H., and Foulis, D.J., "Tensor products and probability weights", International Jour. of Theor. Phys., 26, No. 3, 1987, pp. 199-219. [50] Kochen, S. and Specker, E.P., "The problem of hidden variables in quantum mechanics", J. Math. Mech., 17, 1967, pp. 59-87. [51] Kolmogorov, A.N., Grundbegriffe der Wahrscheinlichkeitsrechnung, 1933, English Translation, Foundations of the Theory of Probability, Chelsea, New York, 1950. [52] Kopka, F., "D-posets of fuzzy sets", Tatra Mountains Mathematical Publications, 1, 1992, pp. 83-87. [53] Loomis, L., The lattice Theoretic Background of the Dimension Theory of operator Algebras, Memoirs of the Amer. Math. Soc., 18, 1955. [54] Ludwig, G., Foundations of quantum mechanics Vols. I and II, Springer, New York, 1983/85. [55] Mackey, G., "Quantum mechanics and Hilbert space", Amer. Math. Monthly, 64, 1957, pp. 45-57. [56] Mackey, G., The Mathematical Foundations of quantum mechanics, Benjamin, N.Y., 1963. [57] Maeda, S., "Dimension functions on certain general lattices", J. Sci. Hiroshima Univ., A19, 1955, pp. 211-237. [58] Mielnik, B., "Quantum logic: Is it necessarily orthocomplemented?", in: Flato, M. et al (eds.), Quantum mechanics, Determinism, Causality, and Particles, Reidel, Dordrecht, 1976. [59] von Neumann, J., Grundlagen der Quantenmechanik, Springer Verlag, Berlin, Heidelberg, New York, 1932; English translation: Mathematical Foundations of quantum mechanics, Princeton University Press, Princeton, NJ, 1955. [60] von Neumann, J., Continuous Geometry, Princeton Univ. Press, Princeton, NJ, 1960. 36 DAVID J. FOULIS [61] von Neumann, J. and Murray, F.J., "On rings of operators III", Ann. of Math., 41, 1940, pp. 94-161, in: J. von Neumann Collected Works, III, Pergamon Press, Oxford, 1961. [62] Piron, C., Foundations of Quantum Physics, Mathematical Physics Monograph Series, W.A. Benjamin, Reading, MA, 1976. [63] Piron, C., "Ideal measurement and probability in quantum mechanics", Erkenntnis, 16, 1981, pp. 397-401. [64] Porzio, D. and Valseechi, M., Pablo Picasso, Man and His Work, Chartwell Books, Inc., Secaucus, NJ, 1974. [65] Ptak, P. and Pulmannova, S., Orthomodular Structures as Quantum Logics, Kluwer, Dordrecht/Boston/London, 1991. [66] Randall, C. and Foulis, D., New Definitions and Theorems, University of Massachusetts Mimeographed Notes, Amherst, MA, 1979. [67] Randall, C. and Foulis, D., "Tensor products of quantum logics do not exist", Notices Amer. Math. Soc., 26, No.6, 1979, A-p. 557. [68] Randall, C.H., and Foulis, D.J., "operational statistics and tensor products", in: Neumann, H. (ed.), Interpretations and Foundations of Quantum Theory, Band 5, Wissenschaftsverlag, Bibliographisches Institut, Mannheim/Wien/Ziirich, 1981, pp. 21-28. [69] Randall, C.H. and Foulis, D.J., "Properties and operational propositions in quantum mechanics", Found. Physics, 13, No.8, 1983, pp. 843-863. [70] Riittimann, G.T., Non-commutative Measure Theory, Habilitationsschrift, Universitat Bern, 1980. [71] Riittimann, G.T., "The approximate Jordan-Hahn decomposition", Canadian J. of Math., 41, No.6, 1989, pp. 1124-1146. [72] Soler, M.P., "Characterization of Hilbert space by orthomodular spaces", Comm. Algebra, 23, 1995, pp. 219-243. [73] Walker, E.A., "Stone algebras, conditional events, and three valued logic", IEEE Transactions on Systems, Man, and Cybernetics, 24, No. 12, 1994, pp. 1699-1707. [74] Wilee, A., "Tensor products in generalized measure theory", International J. Theor. Phys., 31, No. 11, 1992, pp. 1915-1928. [75] Wright, R., "Spin manuals", in: Marlow, A.R. (ed.), Mathematical Foundations of Quantum Theory, Academic Press, N.Y., 1978. [76] Younce, M.B., Random Variables on Non-Boolean Structures, Ph.D. dissertation, University of Massachusetts, 1987. STANLEY GUDDER QUANTUM MECHANICAL MEASUREMENTS 1. INTRODUCTION One of the main purposes of physics is to gain an understanding of the natural universe. Such an understanding cannot be achieved all at once so the physicist concentrates on a limited physical system S (sayan electron). The experimentalist performs experiments or measurements on S in an attempt to discover various properties. The theoretician uses these properties to try to find regularities and patterns in the behavior of S under prescribed conditions. In this way, a theory may be developed that predicts the behavior of S and this predicted behavior may be checked by the experimentalist in the laboratory. If the predictions are experimentally verified, the theory stands for the time being and if they are refuted the theory must be altered or abandoned. Since the main enterprise of physics (and practically all the basic natural sciences) is performing and predicting the results of measurements, it is important to understand the structure of a set of measurements. For example, can measurements be decomposed into simpler ones? Conversely, how can measurements be combined to form new measurements and what are the properties of these combinations? Investigators who have studied measurement theory have developed mathematical models for the description of measurements. In this article, we shall discuss one of these models in very simple terms. Although we shall primarily be concerned with quantum mechanical measurements, what we say can be applied to practically any natural science. Moreover, there is the intriguing possibility that similar models may be employed in the arts, humanities and social sciences. Although it has evolved from more restrictive models over a period of many years, the model that we shall present has been developed only recently. Quantum measurement theory is not a closed subject but is still intensively studied and discussed. One can usually reduce the study of measurements or experiments to a consideration of simple two-valued measurements. Such simple experiments are frequently called yes-no measurements. For example, suppose M is a real-valued measurement such as the determination of the energy of a particle and A is a subset of the real numbers R If M is executed under certain prescribed experimental conditions, we can ask whether M results in a value in A. If this experiment gives a value for M in A, 37 © 1999 Kluwer Academic Publishers. 38 STANLEY GUDDER then the answer to our question is yes, if not, the answer is no. In this way, M can be replaced by a set of simpler yes-no measurements, each of which corresponds to a subset of JR.. Following G. Ludwig, we shall call a yes-no measurement an effect [8]. An effect can be thought of as corresponding to a physical phenomenon that either occurs or does not occur. There are two main types of effects, those that are sharp and those that are unsharp (fuzzy). A sharp effect corresponds to a perfectly accurate measurement. These are impossible to attain in practice, although experimentalists attempt to improve their techniques and equipment to approach such ideal measurements as closely as possible. Sharp effects are theoretical idealizations that have a simpler mathematical structure than unsharp effects and orthodox quantum mechanics usually only considers such effects. However, since unsharp effects give more realistic models for experimental practice, they have recently drawn considerable attention. Moreover, their more general nature has important mathematical consequences [1, 2, 6, 7, 8]. 2. PARTICLE DETECTORS An important example of an effect is a measurement utilizing a particle detector such as a Geiger or photon counter. Let us denote such a detector by d. Now d will have a certain domain of sensitivity given by a subset D of three-dimension space JR.3. If a particle (sayan electron) enters the domain D, then the particle triggers a response in d. We shall then say that d clicks. For each run of an experiment involving such a particle, either d clicks or it does not, so d is an effect. What we have just described is a sharp detector, but in practice, no detector is perfectly accurate. For example, the particle may enter D and on rare occasions d may not click, or d may click on rare occasions when the particle does not enter D. Moreover, D will never have a perfectly sharp boundary and d may not be as sensitive to a particle near the edge of D. In order to describe the behavior of d, we need a method for measuring its sensitivity at points in three-dimensional space. One technique for measuring the accuracy of d is given by a confidence function k We use the standard notation fd: JR.3 --+ [0, 1] which means that fd is a function that takes points in JR.3 to a number between and 1. Assuming that d is at a fixed location, D will encompass a fixed set within JR.3. Then for any point x in ]R3, fd(X) gives the confidence, on a scale between and 1, of an experimenter that d will click when the particle is located at x. The function fd is an intrinsic property of the ° ° qu ANTUM MECHANICAL MEASUREMENTS 1"":'" • • " . " ·-,....---0"""""·,, " " 0: u 39 "0------41",,, v r Figure 1. particle detector d and can be determined by independent, preliminary calibration experiments. In general, different detectors will have different confidence functions. Presumably, fd(X) is close to 1 when x is in D and x is not very close to the boundary aD of D. This indicates that in this case the confidence is high that d will click. However, for x in D and close to aD, fd(x) may not be close to 1 and the confidence of a click is lower. Similarly, for x outside of D but close to aD, fd(x) may be positive so there is a nonzero confidence of a click. The characteristic function of a set D is the function ID: IR3 - 7 [0,1] that has the value 1 on D and the value 0 on the complement DC of D. Of course, DC is the set of all points in IR3 that are not in D. If d is sharp, there is complete confidence that d will click when the particle is in D and that d will not click otherwise. Hence, for a sharp detector d, fd will coincide with ID and the closer fd is to ID, the more accurate d will be. To illustrate this discussion in a simple way, let us pretend that the detector experiment is carried out on an interval [0, r] of the real line R Suppose the experiment consists of a particle producing apparatus that is located at the origin O. The apparatus injects a particle to the right at time 0 and we attempt to detect the particle's location at time 1. We imagine that a is a one-dimensional detector which is placed so that its sensitivity domain is the interval [u, v] ~ [0, r]. If a is sharp, then fa = I[u,vl as shown in Figure 1. However, in practice, a will be unsharp and fa takes a form as illustrated in Figure 2. In this case, there is high confidence that a will click if the particle is near the center of [u, v] but low confidence if it is near the boundary u or v. We think of [u, v] as the physical shape of detector a and this is the domain in which a would faithfully respond to the presence of a particle if it were perfectly accurate. N ow we can assume the existence of an imaginary detector denoted by 1 whose confidence function is f1 = I[O,rl. We do not have to actually construct this detector, we just record a click no matter what happens. Similarly, we can assume there is a detector 0 whose confidence function fo has the constant value o. Again, we need not construct this detector, 40 STANLEY GUDDER u v r Figure 2. 1...;----0 0: •u 0>__----- .' v +, r Figure 3. we just record a nonclick no matter what happens. Moreover, for any detector d, we can imagine another "detector" d' whose confidence function is fd = fl - fd, That is, for any x in [0, r], we have fd(x) = 1 - fd(X). Notice that we can also write this as fd + fd = kNow d' may not be an actual detector but its equivalent is provided by d itself. Once d is in place, we just register a click if d does not click and a nonclick if d clicks. We call d' the orthosupplement of d. The confidence functions for the orthosupplements of Figure 1 and Figure 2 are shown in Figures 3 and 4. As far as this discussion is concerned, an actual detector d is determined by its confidence function fd. This function describes the accuracy of d and its domain of sensitivity and thus where d is placed. Moreover, fd determines the orthosupplement d' whose confidence function is fd =!1-fd, We now discuss various ways that detectors can be combined to form new detectors or new effects. Since for our purposes, a detector is com- r Figure 4. 41 QUANTUM MECHANICAL MEASUREMENTS u v w x r Figure 5. pletely described by its confidence function, we can accomplish this by discussing ways in which confidence functions can be combined to form new confidence functions. In our one-dimensional situation a confidence function will be any function f: [0, r] --+ [0,1]. Let a and b be detectors with corresponding confidence functions fa, fb respectively. It is clear that fa + fb is still a confidence function if and only if fa + fb ::; iI (that is, fa(x) + fb(X) ::; 1 for all x E [0, rD. If fa + fb ::; iI, we say that a EEl b exists and consider c = a EEl b to be an effect with confidence function fe = fa + fb. In this case, we also write fe = fa EEl!b- Roughly speaking, we may think of c as the effect (two-valued measurement) that clicks if a or b (or both) click and does not click otherwise. Now c may not correspond to an actual detector that can be constructed in practice. For example, if the sensitivity domains for a and b overlap, then such a detector c might be difficult or impossible to construct. However, we can still think of c as an idealized combination of a and b that has the confidence function fe = fa EEl !bIt is possible that a EEl b may not exist. For example, there may be a point Xo E [0, r] at which fa(xo) + fb(XO) > 1. In this case, fa + fb would not be a confidence function. On the other hand, if fa and fb are not simultaneously positive, then a EEl b does exist and c = a EEl b corresponds to a constructible detector. We merely place a and b in their designated positions and consider c = a EEl b to be the single detector obtained by combining the two detectors a and b. For example, let a be the detector illustrated in Figure 2 and let b be a similar detector that is placed so that its sensitivity domain is the interval [w, x] (Figure 5). Then c = aEElb can be considered to be a detector with confidence function fe as illustrated in Figure 5. However, if a and b are right next to each other (that is, v = w), then we have the situation shown in Figure 6. This combination of a and b may not give a EEl b because a particle located at v may very well not result in a click in either a or b which contradicts the fact that fa(v) + fb(V) = 1. But it is quite possible that a detector c = aEElb can be constructed. It would have a configuration similar to that of a and b but with a larger sensitivity domain. In this case fe appears as in Figure 7. 42 STANLEY GUDDER 0 u v w x r x r Figure 6. ILL 0 u ~ .. I v w Figure 7. Notice that for any detector d, the "detector" d' always exists (if only in our imagination) and dEB d' = 1. Also, note that if 1 EB d exists, then d = O. Indeed, we would then have !I + fd ::; !I so that fd = fo· In certain situations, we can even combine overlapping detectors. As long as fa + fb ::; !I, the effect c = a EB b exists and corresponds to the confidence function fe = fa EB fh. Carrying this to the extreme, we might be able to combine detector a with itself. If fa(x) ::; 1/2 for all x E [0, rJ, then the effect 2a = a EB a exists with confidence function ha = fa EB fa = 2fa. If 2a exists, then a is called an isotropic detector. This method of combining detectors can be continued to three or more detectors. For example, if a, band c are detectors (or more generally, effects) whose confidence functions satisfy fa +fb+ fe ::; !I, then d = aEBbEB c exists and is an effect whose confidence function satisfies fd = fa + fb +fc. To be more precise, if a EB band (a EB b) EB c exist, then b EB c and a EB (b EB c) exist and a EB (b EB c) = (a EB b) EB c. This last equation is called the associative law and is interpreted as meaning that the two sides of the equation are effects with the same confidence function. We conclude that parentheses are not needed when we consider EB combinations of three or more effects. Another important property of EB is that if a EB b exists, then bEBa exists and bEBa = aEBb. We thus say that EB is commutative. We call EB a partial binary operation on the set of effects (or confidence function). This is because EB is only defined for certain pairs of effects, namely those that satisfy fa + fb ::; !I. This is in contrast to a total binary operation such as the ordinary addition + for numbers or functions which is defined for all pairs of elements. In the type of mathematical model QU ANTUM MECHANICAL MEAS UREMENTS 43 that we are considering here, E9 is taken as the basic partial operation and all properties of the system are derived from the properties of EEl. To make a systematic study of the properties of EEl in our one-dimensional detector example, let C be a collection of functions from [0, r] into [0,1]. We say that C is a confidence algebra if C satisfies the following conditions. (1) Io,iI E C. I' = h - IE C. (3) If I, 9 E C and I + 9 ~ iI, then I + 9 E C. (2) If IE C, If f then f, 9 E C and f + 9 ::; EEl 9 is defined or that iI, we define f EEl 9 = f + 9 and say that f EEl 9 exists. Thus, a confidence algebra is contains fo, iI, is closed under I and is closed a set of functions that under EEl whenever f EEl 9 is defined. Naturally, we call the elements of C confidence functions. Confidence algebras exist because the set of all functions from [0, rJ into [0,1 J is a confidence algebra. A simpler example is the confidence algebra consisting of only fo and k It is easy to check that the intersection of any collection of confidence algebras is again a confidence algebra. Now let D be a set of detectors on [0, rJ which we identify with their corresponding confidence functions. As we have noted previously, if fa,fb E D with fa + fb ~ iI, then fa EEl fb need not be in D so D may not be a confidence algebra. However, the intersection C(D) of all the confidence algebras containing D is a confidence algebra and C(D) is the smallest confidence algebra containing D. We call C(D) the confidence algebra generated by D. It can be shown that any element of C(D) can be obtained from elements of D in a finite number of steps by performing EEl combinations and orthosupplements '. Thus, the elements of D correspond to detectors and the other elements of C(D) correspond to effects that can be derived from basic combinations of detectors. When we speak of effects in the remainder of this section and in Section 3, we are referring to elements of C(D). We can employ the partial operation EEl to define a natural order for effects. For effects f and g, we write f ~ 9 if there exists an effect h such that fEEl h = g. In particular, if a and b are detectors, we write a ~ b if fa ~ fb. It is interesting that this order f ::; 9 is the same as the usual order for functions given by f(x) ::; g(x) for all x E [0, rJ. However, we shall not use this fact here because we are adhering to the philosophy that the properties of effects should be given in terms of the basic partial operation EEl. Moreover, there are more general examples of effects that cannot be represented as functions so the function order is not applicable there. 44 STANLEY GUDDER What are the properties of the order S:.? Since f EB fo = f, we have f S:. f for any effect f. This is called reflexivity of S:.. Suppose that f S:. 9 and 9 S:. f· Then there exist effects hand i such that f EEl h = 9 and gEEli = f. Hence, fEElhEEli = f and it follows that hEEli = fo. Since h and i have nonnegative values, we conclude that h = i = fo. Therefore, f = g. In particular, a S:. band b S:. a imply that a = b. This is called the antisymmetry property of S:.. Finally, suppose that f S:. hand h S:. g. Then there exist effects i, j such that fEEl i = hand hEEl j = g. We then have fEEl (i EEl j) = (J EEl i) EEl j = hEEl j = 9 so that f S:. g. In particular, if a S:. band b S:. c, then a S:. c. This property is called the transitivity of S:.. These three properties show that S:. is a partial order relation for the set of effects. The relation S:. has many of the properties of the usual order for numbers. However, the dichotomy property of numbers does not hold for effects. Dichotomy states that for any a and b, either a S:. b or b S:. a. A partial order relation that satisfies dichotomy is called a total order relation. The orthosupplementation operation ' is order inverting which means that f S:. 9 implies that g' S:. f'. To show this, suppose that f S:. g. Then there exists an effect h such that fEElh = g. Hence, 11- fEElh = l1-g and we have g'+h=fl-g+h=l1-f=j' Thus g' EEl h = f' and g' S:. f'· Also, since 11 - (11 - J) = f, we conclude that f" = f for any effect f. We then say that ' is an involution. Moreover, we can show that f EEl 9 exists if and only if f S:. g'. Indeed, if f S:. g', then there exists an effect h such that f EEl h = g'. But then f + 9 = 11 - h = h' Hence, f EEl 9 exists and f EEl 9 = h'. Conversely, suppose that h exists. Then h' = fl - f - 9 so that f + h' = It follows that fEEl h' = g' so that 11 - = f EEl 9 9 = g' f S:. g'. 3. OTHER WAYS OF COMBINING EFFECTS In the previous section we began with a set of detectors V and formed the confidence algebra C(V) generated by V. The elements of C(V) were interpreted as effects that can be derived from detectors in V using QU ANTUM MECHANICAL MEASUREMENTS 0: u w v x 45 r Figure 8. l~ 0: u 0 v w "------11-------11··· . x r Figure 9. the "plus" partial operation EB and the orthosupplement operation '. By employing this EB partial operation, we can also define a "minus" partial operation. For f, 9 E C(V), if f :S g, then there exists an effect h such that f EB h = 9 and we write h = 9 e f. In this way, we have fEB(geJ) = g. Of course, h(x) = g(x)- f(x) for all x E [0, r] so gef is the usual difference of functions. However, e is a partial operation because it is only defined when f :S g. Just as EB has some of the properties of the usual plus + for numbers, e has some of the properties of the usual minus - for numbers. We shall pursue these properties in the next section, but now we consider other ways of combining effects. For effects f, 9 E C(V), we define the function h = max(J, g) by h(x) = max(J(x),g(x)) for every x E [O,r]. In other words, h(x) is the larger of f(x) and g(x) for every x E [0, r]. (If f(x) = g(x), then either one will do.) Although h: [0, r] ~ [0,1]' h need not be an effect even when fEB 9 exists. If f :S g, then max(J, g) = 9 which, of course, is an effect. For the detectors in Figure 5, max(fa,fb) = fa EB fb is an effect. However, these are very special cases. For the detectors in Figure 6, max(fa,fb) has the form illustrated in Figure 8. In general, this would not correspond to a detector or even an effect. In a similar way, we define i = min(J, g) by i(x) = min(J(x) , g(x)) for every x E [0, r]. Again, min(J, g) need not be an effect even when fEB 9 exists. For the detectors in Figure 6, min(fa,fb) has the form illustrated in Figure 9. Since C(V) need not be closed under max and min, these give partial binary operation on C(V). For effects f, 9 E C(V), the least upper bound f V 9 of f and 9 is the smallest effect that is greater than or equal to f and 9 in terms 46 STANLEY GUDDER of the order :S. In general, f V 9 need not exist so again V is a partial binary operation on C(V). In symbols, if h = f V 9 exists, then f :S h, 9 :S h and if i E C(V) satisfies f :S i, 9 :S i, then h :S i. There is a relationship between f V 9 and max(f, g). In fact, it is easy to check that if max(f, g) E C(V), then max(f, g) = fV g. However, if fV 9 exists, then f V 9 need not equal max(f, g). In a similar way, we define the greatest lower bound f !\ 9 of f and 9 as the largest effect that is less than or equal to f and 9 relative to the order :S. Again, f !\ 9 need not exist and !\ is a partial binary operation on C(V). As before, if min(f, g) E C(V), then min(f, g) = f !\ 9 but if f !\ 9 exists, then f !\ 9 need not equal min(f, g). A simple argument shows that if f V 9 exists, then f' !\ g' exists and f' !\ g' = (f V g)'. Moreover, if f !\ 9 exists, then f' V g' exists and f' V g' = (f !\ g)'. These equations are called De Morgan's laws. De Morgan's laws also hold for max(f, g) and min(f,g). Recall that a detector d is sharp if fd is the characteristic function of a set or equivalently if fd has only the values 0 or 1. In a similar way, we say that an effect f is sharp if f has only the values 0 or 1. It is clear that f is sharp if and only if f = f2. (This means that f(x) = (f(x))2 for every x E [0, rJ.) However, for our later work, a more useful characterization of sharpness is given in the following theorem. Theorem. For an effect f, the following statements are equivalent. (i) f is sharp. (ii) min(f, f') = fa. (iii) max(f, f') = fI. Proof. Suppose that f is sharp. Then f(x) = 0 or 1 for all x E [0, rJ. Moreover, f(x) = 1 if and only if f'(x) = 1 - f(x) = o. It follows that min(f, f') = fa. Conversely, suppose that min(f, f') = fa. If f(x) =f. 0, then 1 - f(x) = f'(x) = 0 so that f(x) = 1. Hence, f(x) = 0 or 1 for all x E [0, rJ so f is sharp. This shows that (i) and (ii) are equivalent. The equivalence of (iii) follows from De Morgan's laws. 0 4. EFFECT ALGEBRAS In Section 2, we discussed a method for combining two detectors in terms of a partial binary operation EB. In order to obtain a mathematical model with well behaved properties, we were forced to extend the definition of EB to effects that were derived from combinations of detectors. We also noticed that EB possesses various properties that the usual + operation of numbers possess. Now a detector corresponds to a position measurement for a particle, but there are many other measurements that an experimentalist may want to perform. For example, there could be measurements of energy, momentum, angular momentum, spin, charge, etc. QUANTUM MECHANICAL MEASUREMENTS 47 Moreover, the experimentalist may want to perform more complicated measurements involving the scattering or decay of many particles. For this reason, investigators have studied general sets of measurements and ways of combining them. We now present a mathematical framework or model for such studies [3, 4, 5]. An effect algebra is a system (P, 0,1, EB), where P is a set of elements called effects, 0 and 1 are distinct elements of P, and EB is a partial binary operation on P that has the following properties. (1) If a EB b is defined, then b EB a is defined and b EB a = a EB b. (Commutativity) (2) If a EB b and (a EB b) EB c are defined, then b EB c and a EB (b EB c) are defined and a EB (b EB c) = (a EB b) EB c. (Associativity) (3) For every a in P there exists a unique a' in P such that a EB a' is defined and a EB a' = 1. (Orthosupplementation) (4) If 1 EB a is defined, then a = O. (0 - 1 Law) For brevity, we frequently denote an effect algebra by the single letter P. We have seen in Section 2 that a confidence algebra is an effect algebra. For a and b in P, we write a ~ b if there exists a c in P such that b = a EB c. If a ~ b', we write a 1- b and say that a and b are orthogonal. We denote the least upper bound and greatest lower bound of a and b, if they exist, by a V b and a 1\ b, respectively. The next theorem summarizes some of the basic properties of effect algebras and its proof may be found in [4]. Theorem. Let P be an effect algebra. (i) ~ is a partial order relation on P such that 0 ~ a ~ 1 for all a in P. (ii) 0' = 1, l' = 0 and aEBO = a for all a in P. (iii) a EB b is defined if and only if a 1- b. (iv) a" = a for all a in P and a ~ b implies that b' ~ a'. (v) If a V b exists, then a' 1\ b' exists and a' 1\ b' = (a Vb)'. If a 1\ b exists, then a' V b' exists and a' V b' = (a 1\ b)'. When we discussed detectors, it was clear how sharp detectors should be defined. However, in the context of a general effect algebra P, such a definition is not at all obvious. We cannot say that a E P is sharp if a is a characteristic function because the elements of P may not be real-valued functions. For the same reason, the partial operations max and min do not make sense for a general effect algebra so the characterization given in the theorem in Section 3 cannot be used to define sharpness. In fact, there is a considerable controversy among researchers concerning the best definition of sharpness in an effect algebra. We shall consider one of the several definitions that have been proposed. One way of motivating this definition is to consider the theorem in Section 3 and 48 STANLEY GUDDER replace max by V and min by /\. This is legitimate because V and /\ make sense in an arbitrary effect algebra. To differentiate this definition from our previous definition of sharpness, we shall use the term crisp instead of sharp. Unfortunately, crispness and sharpness are not equivalent for confidence algebras. Indeed, our discussion in Section 3 shows that if an element f of a confidence algebra is sharp, then it is crisp, but f may be crisp without being sharp. In the most important mathematical model for quantum mechanics, namely the Hilbert space model mentioned in Section 5, crispness is strong enough to describe perfectly accurate measurements. An element a in P is crisp if a V a' exists and equals 1. This is equivalent to a /\ a' exists and equals o. Does an effect algebra have a stronger structure if all of its elements are crisp? To answer this question we introduce a definition and prove a theorem. An orthoalgebra is a system (P, 0,1, EEl) that satisfies conditions (1), (2), (3) and (4') if aEEla is defined, then a = O. Condition (4') states that 0 is the only isotropic element of P. Theorem (i) An orthoalgebra is an effect algebra. (ii) An effect algebra is an orthoalgebra if and only if all of its elements are crisp. Proof (i) Suppose P is an orthoalgebra. To show that property (4) holds, assume that 1 EEl a is defined. Since a' EEl a = 1, (a' EEl a) EEl a is defined. By property (2) a EEl a is defined so by (4'), a = O. (ii) Suppose P is an orthoalgebra and a is in P. Assume that b 2: a, a'. Then b' ::; a' ::; b = b" Hence, b' -1 b' so b' EEl b' is defined. Therefore, b' = 0 and b = 1. Thus, a V a' = 1 and a is crisp. Conversely, suppose P is an effect algebra in which every element is crisp. To show that P is an orthoalgebra, suppose that a EEl a is defined. Then a ::; a' so 1 = a V a' = a'. Hence, a = 0 so P is an orthoalgebra. 0 We can also define a "minus" operation on an effect algebra. If a ::; b, then there exists a c such that a EEl c = b. We then write c = be a. It is not hard to show that e satisfies the following three conditions for all a,b,c in P. (5) bea::; b (6)be(bea)=a (7) a ::; b ::; c implies c e b ::; c e a and (c e a) e (c e b) = be a A partially ordered set P with a greatest element 1 and a partial binary operation e such that a e b is defined if and only if a ::; b and that satisfies (5), (6) and (7) is called a difference poset [3]. It can be shown QU ANTUM MECHANICAL MEASUREMENTS 49 that an effect algebra and a difference poset are equivalent mathematical structures. In this way, we can use either EB or e to define a mathematical model for measurements. 5. EXAMPLES An effect algebra P is said to be a lattice if a V b and a 1\ b exist for all a, bin P. We call P distributive if P is a lattice and the distributive law al\(bVc) = (al\b)V(al\c) holds for all a, band c in P. We first present some examples of distributive effect algebras. Example 1. Let P be the unit interval [0,1] in R. For a, b in [0,1]' aEBb is defined if a + b :::; 1 and in this case a EB b = a + b. Then ([0,1]' 0,1, EB) is a distributive effect algebra with a' = 1 - a. In this example, :::; is the usual order of numbers so :::; is a total order relation. Notice that a is crisp if and only if a = or 1. Hence, the crisp elements of [0,1] form a suborthoalgebra of [0,1]. That is, if a and b are crisp and a EB b exists, then a EB b is crisp. 0 ° Example 2. Let X be a nonempty set and let P be the collection of all subsets of X. Let be the empty set, 1 = X, and for a, b in P, a EB b is defined if their intersection a n b = and in this case a EB b is their union aU b. Then (P, 0,1, EB) is a distributive effect algebra with a' equal to the complement a C • In this example, :::; is the same as set-theoretic inclusion <;;:;. Every element a in P is crisp because a V a' = a U a' = 1. Hence, P is an orthoalgebra (in fact, P is a Boolean algebra). 0 ° ° Example 3. Let X be a nonempty set and let P be the set of all functions f: X ---t [0,1] <;;:; R. Let o(x) = 0, l(x) = 1 be the constant and 1 functions, respectively, and for f, 9 in P, f EB 9 is defined if f(x) + g(x) :::; 1 for all x in X and in this case fEB 9 = f + g. Then (P, 0,1, EB) is a distributive effect algebra with l' = I-f. In this example, :::; is the usual order f :::; 9 if and only if f(x) :::; g(x) for all x in X. It follows that f V 9 = max(f, g) and f 1\ 9 = min(f, g). Hence, f is crisp if and only if f is the characteristic function of a subset of X. Thus, the crisp elements form a suborthoalgebra (even a sub-Boolean algebra) of P. The crisp elements may be identified with the collection of all subsets of X which reduces to Example 2. The structure of P corresponds to a fuzzy set theory or fuzzy logic and is closely related to our discussion of detectors in Sections 2 and 3. Moreover, P may be thought of as a ° STANLEY G UDDER 50 confidence algebra on the set X and in this case crispness and sharpness are equivalent. 0 In these last three examples, the crisp elements of an effect algebra formed a suborthoalgebra. This also applies to the Hilbert space effect algebras that we shall mention later. Is this always true? That is, in an effect algebra, if a and b are crisp and a EEl b is defined, is a EEl b necessarily crisp? The answer is no and the following counterexample is due to D. Foulis and R. Greechie. This is also an example of a nondistributive effect algebra. Example 4. Let P be the effect algebra with elements 0,1, a, b, c, a', b', c' and the following EEl table. In this table we do not include the trivial elements 0, 1 and dash indicates that the corresponding EEl is not defined. EEl a a' b b' a 1 b b' c' - 1 - - - c' - - - b' 1 a' - - a - C b' c' - - - c b c' - - - at - - - - - - 1 1 1 - Then a and c are crisp and a EEl c = b'. However, b' is not crisp because b' V b = b' =1= 1. Note that P is not a lattice and hence is not distributive. This is because a V band c V b do not exist. For example, c' and b' are minimal upper bounds for a and b. 0 The most important example of a nondistributive effect algebra is called a Hilbert space effect algebra. Hilbert space effect algebras are frequently employed as mathematical models for quantum mechanics. However, their definition involves some rather sophisticated mathematics and we shall not pursue them here. The interested reader can consult any of the following references. ACKNOWLEDGMENTS The author would like to thank J. Pykacz for a very careful reading of the preliminary version of this paper and for some useful suggestions for improving it. These suggestions were incorporated in the final version. QUANTUM MECHANICAL MEASUREMENTS 51 AFFILIATION Stanley Gudder Department of Mathematics and Computer Science University of Denver U.S.A. [email protected] REFERENCES [1] Busch, P., Lahti, P. and Mittelstaedt, P., The Quantum Theory of measurements, Springer Verlag, Berlin, 1991. [2] Davies, E.B., Quantum Theory of Open Systems, Academic Press, London, 1976. [3J Dvurecenskij, A. and Pulmannova, S., "Difference posets, effects and quantum measurements", Intern. J. Theor. Phys., 33, 1994, pp. 819-850. [4] Foulis, D. and Bennett, M.K., "Effect algebras and unsharp quantum logics", Found. Phys., 24, 1994, pp. 1331-1352. [5] Greechie, R. and D. Foulis, D. "The transition to effect algebras" , Intern. 1. Theor. Phys., 34, 1995, pp. 1-14. [6] Holevo, A.S., Probabilistic and Statistical Aspects of Quantum Theory, North-Holland, Amsterdam, 1982. [7] Kraus, K., States, Effects, and Operations, Springer-Verlag, Berlin, 1983. [8] Ludwig, G., Foundations of quantum mechanics Vol. I and II, Springer-Verlag, Berlin, 1983 and 1985. G. CATTANEO AND F. LAUDISA FROM LOGIC TO PHYSICS: THE LOGICO-ALGEBRAIC FOUNDATIONS OF QUANTUM THEORY 1. INTRODUCTION The relevance of the axiomatic method to the science and philosophy of our century need not be emphasized. The introduction and the development ofaxiomatization did not prove immensely useful only in the field of foundations of mathematics at the turn of the century, but also in the general analysis of the foundations of sciences. With particular reference to physics, axiomatic formulations of physical theories provide a framework in which all assumptions--especially the physical ones-are explicitly stated, a framework which enables one to clearly assess different approaches to particular problems of the theories and-by a specification of a mathematical model-to avoid mathematical inconsistencies: most important by a philosophical viewpoint, the common "universe of discourse" provided by the axiomatic framework enables one to analyse the conceptual foundations of a physical theory. This task appears to be especially important in the case of quantum mechanics, a physical theory full of interpretational difficulties in spite of its spectacular success by the experimental viewpoint. In this paper we will survey the quantum-logical approach to quantum mechanics, both in its historical and conceptual motivations and in some of its more recent formal developments and generalizations. In Section 2, the historical origins of the subject are reviewed, whereas in Section 3 the idea of sharp quantum logic and its formal structure are introduced. Finally, in Section 4, we will outline the main elements of the generalization of sharp quantum logic into the unsharp or fuzzy quantum logic, based on the possibility of defining degrees of membership of a value to a set (typically, a number interval). 2. THE ORIGIN S 0 F QUANTUM LOGIC: A SHORT HISTORY The investigations on the logical nature of quantum-mechanical statements historically arise from the attempt to apply the three-valued logic, 53 © 1999 Kluwer Academic Publishers. 54 G. CATTANEO AND F. LAUDISA introduced by Lukasiewicz [23] in 1920, to the domain of quantum phenomena. The suggestion of employing this new logic in the analysis of the formal structure of quantum theory derived essentially from the assumption that a logic satisfying the general principle of bivalence could not ground a physical theory such as quantum mechanics, that embodies Heisenberg uncertainty relations and therefore indeterminism. Although the first step was taken in 1931 by the Polish logician Zawirski, (cf. [16, p. 344]) the most systematic attempt to construct a suitable three-valued logic of quantum mechanics was to be developed in Reichenbach's book Philosophic Foundations of quantum mechanics [33]1. Relevant it might have been by a historical viewpoint, however, the rejection of a principle of bivalence was not the conceptual source of a possible logic of quantum theory. The successful development of a quantum logic lies in the rejection of the distributivity law, a condition that qualifies the algebraic structure of classical logic. This step, that gives rise officially to the logical (or logico-algebraic) approach to quantum mechanics can be said to develop from the same source as the standard mathematical formulation of quantum mechanics, namely from von Neumann's Die Mathematische Grundlagen der Quantenmechanik [25]. In this book, von Neumann proposes to take into account, in addition to the class of physical quantities relevant to a given physical system S, mathematically represented by self-adjoint operators on the Hilbert space H associated with S, the class of the properties of the states of the system: To each property t: we can assign a quantity which we define as follows: each measurement which distinguished between the presence or absence of t: is considered as a measurement of this quantity, such that its value is 1 if t: is verified, and zero in the opposite case [25, p. 249]. Since each property of this kind is shown to be represented by a projection operator on H, the relation between the properties of a physical system on the one hand, and the projections on the other, makes possible a sort of logical calculus with these. However, in contrast to the concepts of ordinary logic, this system is extended by the concept of "simultaneous decidability" which is characteristic for quantum mechanics. 2 Again the Heisenberg uncertainty relations force upon the theory a limitation, but this time the lack of overall "simultaneous decidability" will 1 As Max Jammer describes [16, pp. 364-370]' this book was subject to heavy criticisms, notably by Carl Hempel and Ernest Nagel. A defense of Reichenbach in [29] was likewise criticized by Feyerabend [8]. 2 von Neumann [25, p. 253], our emphasis. FROM LOGIC TO PHYSICS 55 yield not the rejection of bivalence but that of distributivity. Opening their seminal paper "The logic of quantum mechanics" [3]' Birkhoff and von Neumann explicitly state: One of the aspects of quantum theory which has attracted the most general attention is the novelty of the logical notions which it presupposes. It asserts that even a complete mathematical description of a physical system S does not in general enable one to predict with certainty the result of an experiment on S, and that in particular one can never predict with certainty both the position and the momentum of S (Heisenberg's Uncertainty Principle). It further asserts that most pairs of observations are incompatible, and cannot be made on S simultaneously (Principle of Non-commutativity of Observations). The object of the present paper is to discover what logical structure one may hope to find in physical theories which, like quantum mechanics, do not conform to classical logic. Our main conclusion, based on admittedly heuristic arguments, is that one can reasonably expect to find a calculus of propositions which is formally indistinguishable from the calculus of linear subspaces with respect to set products, linear sums, and orthogonal complements--and resembles the usual calculus of propositions with respect to and, or and not". Let us briefly outline then the essential lines of the Birkhoff-von Neumann approach. First of all we must stress that in the above quoted paper the authors take into account both the classical situation and the quantum one. They realize that these two cases are characterized by a propositional language based on the set 0 of all possible observations one can measure on a physical system. The concept of a physically observable "physical system" is present in all branches of physics, and we shall assume it. It is clear that an observation of a physical system 6 can be described generally as a writing down of the readings from various compatible measurements. Thus if the measurements are denoted by the symbols a1, ... , an, then an observation of 6 amounts to specifying numbers Xl, ... ,Xn corresponding to the different ak. It follows that the most general form of prediction concerning 6 is that the point (Xl' ... ' xn) determined by actually measuring (a1, ... , an) will lie in a subset t:. of (Xl' ... ' xn)-space. Hence, if we call the (Xl' ... ' xn)-spaces associated with 6, its "observationspaces, " we may call the subsets of the observation-spaces associated with any physical system 6, the "experimental propositions" concerning 6. In a footnote, Birkhoff and von Neumann point out that: One may regard a set of compatible measurements [(a1, ... , an)) as a single composite measurement [(which we denote by A))-and also admit non-numerical readings-without interfering with subsequent arguments. 3 [3, p. 1] of the reprinted version in [15] 56 G. CATT ANEO AND F. LA UDISA Among conspicuous observables in quantum theory are position, momentum, energy, and (non-numerical) symmetry. Therefore, in Birkhoff-von Neumann point of view the most general statement, or experimental proposition, which one can formulate about a physical system is that the value ofa physical magnitude (shortlyobservable) A which can be observed on it lies in a subset ~ of the set of possible values (shortly observation space) associated with A. A statement of this kind is represented by a pair (A,~) and corresponds to the elementary statement: val(A) E ~ = "the value of the observable (physical magnitude) A lies in the subset ~ of the observation space of A." This agrees with the following quotation of Varadarajan [34J: "The center of the stage of the present discussion is occupied by a physical system and the experimental propositions that are associated with it. ( ... ) if A is an observable, to each Borel set ~ of the real line lR is associated the proposition the value of A lies in ~." In the literature the "experimental proposition" of Birkhoff and von Neumann [3] and Varadarajan [34] is also called the "physical statement", "question", "theoretical sentence" , "elementary statement". experimental propositions (A, ~l)' (B, ~2)" .. which can be stated for a physical system are for instance "the particle has passed through the slit 1 of the screen", "the spin of the particle along the z direction is up", "the position of the particle is between 100Km and 250Km from a reference point of its trajectory", "the energy of the photon is between lOMe V and lOOMe V" and so on. Making use of the connectives "and" (&), "or" (Q), "not" ("""), it is possible to construct on the basis of elementary sentences r = (A, ~l)' s = (B, ~2), ... the complex sentences "r & s" , "rQs" , ".....,r" of the sentential language of the involved physical theory. For instance the sentence "the particle has passed through the slit 1 or through the slit 2", is a complex sentence of the form "( Q, 1) Q (Q, 2)"; or the sentence "the particle has position in the interval [Xl, X2] and velocity in the interval [VI, V2J" , is a complex sentence of the form "(Q, [XI,X2]) & (P, [VI.V2])". As another example involving a system consisting of two separated particle (1 + 2), the statement "the spin of particle 1 along the z direction is "up" in region ~1 and the spin of particle 2 along the x direction is "down" in region ~2", is described by a complex sentence of the form "(8}1), ~r) & (812 ), ~)". In the case of a classical physical system (J"c with n degrees of freedom it is possible to associate a set 8 (called the phase space of (J"c): this set is representable as the Cartesian product M x lRn (~ lR 2n ), where M (~ lRn) is the set of all the possible configurations of the system and lRn is the set of momentum vectors. The points in 8 are in one-to-one cor- FROM LOGIC TO PHYSICS 57 respondence with the mechanical states of (jc, since this is characterized in each state by a particular configuration and a particular momentum vector. Let us now denote by £(S) a (j-algebra of subsets of S, whose elements are interpreted as classical events. The set-theoretical operations of intersection, union, and complementation on the Boolean (j-algebra £(S) resembles the usual calculus of proposition with respect to and, Of, and not. The physical quantities A are mathematically represented, first of all, by real valued functions fA on S associating with any classical state XES the real number fA (x) E JR.; this real number represents the value possessed by A in the classical state x. For instance, the energy of a classical system is a function associating with any state of the system a real number: the energy of the particle in this state. But, in order to be a representation of an observable A, for any Borel set f::. of the real line the collection f"Al(f::.) of all states xES in which the observable A takes values fA (x) in ~ must be a collection of classical events: formally for any ~, the collection of classical states fA I (~) = {x E S : fA (x) E ~} must be an event from £ (S); for example, the set of all classical states in which a particle possesses an energy between 10 and 25 Joule must be an event from £(S); similarly, both the set of all classical states in which it possesses a position in the interval [Xl, X2] and in which it possesses a velocity in the interval [VI, V2] must be events. In this way the map fAI associating with any subset of possible values ~ E 8(JR) the collection E £(S) is an event-valued measure, realizing the obof states fAI(~) servable A. Since the physical quantity A is represented by a real-valued function fA on Sand fAI is an event-valued measure, it is possible to set up a correspondence between certain physically relevant statements about (jc, e.g., (A, ~), and certain subsets of the phase space S, e.g., (classical events) fAI(~). In this way the propositional language ofpossible statements about (j c (closed with respect the connectives "and", "or", "not") is realized by the logic of (j c, namely, the set of possible properties (events) of the system (closed with respect to set theoretic operations of "intersection", "union", "complement"). When a quantum system (jq is taken into account, the "quantum phase space" is a complex Hilbert space H, whose non-zero vectors represent states of (jq. The relevant physical quantities are represented by selfadjoint operators on H, whereas the quantum events are represented by orthogonal projectors on H. The role of the algebra of subsets of the phase space-as structure of the possible properties of a quantum system-is played by a more general structure than a Boolean algebra, i.e., an orthomodular complemented lattice. The key point is that this structure is more general exactly in that it fails to satisfy the distribu- 58 G. CATTANEO AND F. LAUDISA tivity law. 4 The Birkhoff and von Neumann paper was reviewed in 1937 by Alonzo Church, who described the authors' position as founded on the assumption that "the advance of an experimental science may some day require revision of the system of logic on which the mathematical theory of the science is based". However, such paper remained virtually ignored for twenty-five years, until the physicists of the "Geneva group" (J.M. Jauch, G. Emch, C. Piron and others) and, on the other side of the ocean, G.W. Mackey re-discovered it, giving rise to the area of research properly known today as quantum logic. 5 The possible philosophical import of the quantum-logical ideas was emphasized in some Finkelstein's papers of the sixties and early seventies, according to which the inquiry on the laws governing the physical world can be pursued at three levels of increasing abstraction and generality: the mechanical, the geometrical and finally the logical level. Physics has many layers, and one of the deeper layers, that of world geometry, has undergone a profound upheaval in our times. As a result of this revolution, certain geometric ideas of Riemann have been generally accepted, and it is entirely in order for a physicist to contribute his present conception of the world geometry. [... ] We are presently in the midst of an analogous development of the logic of physical systems, a stratum that conceptually underlies even that of physical geometry. I think that besides mathematical logic there is now also a physical or world logic, different in principle and describing at a very deep and general level the way inanimate physical systems interact. To the extent that there are physical systems, let them be men or machines, that behave the way symbolic systems are supposed to behave, much of mathematical logic recurs as a special or limiting case of physical logic, but more general physical systems may in principle and do in fact obey more general laws. [... ] It is therefore appropriate to indicate how it is at all possible. let alone necessary, to discover laws of physical logic from experience [10, pp. 48-49] (See also [9]). The analogy with non-Euclidean geometry actually was the starting point of the widely discussed Putnam's paper "Is Logic Empirical?" [30] where, with reference to the quantum-logical approach to quantum mechanics, he argued that some "necessary truths" of logic might turn out to be false for empirical reasons: empirical data or results might force us to deny some logical laws and reinforce the idea that logic too is an empirical science. 6 4 5 [3], pp. 9-10 of the reprinted version in [15]. Twenty-five years of silence explain also why Popper [28] could argue as late as 1968 for no less than the logical inconsistency of the Birkhoff and von Neumann paper. For a recent, thorough analysis of Popper's misinterpretation of Birkhoff and von Neumann paper, cf. [6]. 6 For a review of the debate engendered by this Putnam's claim, see [32]. FROM LOGIC TO PHYSICS 59 3. THE FORMAL STRUCTURE OF SHARP QUANTUM LOGIC After more than two decades of silence about the quantum-logical approach to the foundations of quantum mechanics, G.W. Mackey gave a twist to the field in the early sixties. In his Mathematical Foundations of quantum mechanics [24], Mackey assumed the notions of state and observable for a given physical system (J' as primitive, and defined a probability measure expressing the probability distribution of a given observable in a given state: if 0 is the set of the observables of (J', S the set of the states of (J' and B(lR) is the set of all Borel subsets of the real line, a (probability) function P : S x 0 x B(lR) f---t [0,1] is defined such that the quantity between 0 and 1, for 0: E S and A E 0 fixed, P(o:, A, ~), is the probability that a measurement of the observable A in the state 0: yield a value lying in ~, for any ~ E B(lR). The two sets of states and observables and the function P had to satisfy six preliminary axioms according to which: - for 0: E S and A E 0 fixed, P is a probability measure on the Borel subsets of the real line; - a pair of observables, in order to be distinguishable, have to have a different probability distribution in at least one state (principle of indistinguishability for observables); - a pair of states, in order to be distinguishable, have to ascribe different probability distributions to at least one observable (principle of indistinguishability for states); - any (Borel) function of an observable is still an observable (corresponding to the same measuring apparatus with a change of its reading scale according to the Borel function); - the set of states is convex, namely any convex combination of different states is still a state (corresponding to the possibility of a statistical mixing of states); - for any sequence of orthogonal questions there always exists their "sum". Let us remark that if A is an observable with set of possible real values KA ad f : lR f---t IR. is any Borel real function, then f(A) is the observable whose set of possible real values is {J(k) : k E KA}, as a result of the change of the reading scale of the instrument which measure both A and f(A). For example, we can observe the "deflection angle" (in radians) of a beam of electrons under the influence of a uniform magnetic field; with a change of the reading scale of the instrument we can read the same experimental observation as "charge over mass" (in coulombs/gram). 60 G. CATTANEO AND F. LAUDISA The last axiom of the list refers to the notion of a question, precisely a particular kind of observable introduced by Mackey, whose set of values includes only 0 and 1. The set [; of these particular observables was singled out from the whole set of the observables of (]" via the application of characteristic functions Xl> : JR f-+ [0, 1] of Borel subsets .6. of JR, whose only values can be 1 (when the point x from JR belongs to.6.) and 0 (when the point x does not belong to .6.). To be precise, if A is any observable with set of possible values KA and Xl> is the characteristic function of a Borel set .6., then the question Xl>(A) is an observable whose set of possible values {Xdk) : k E KA} consists only of 0 or 1. From the above axioms it turns out that for any state a E S, P(a, A,.6.) = P(a, Xl>(A), {I}), namely, for any ~ E B(lR) the probability that a measurement of the observable A in the state a yields a value lying in ~ is just the probability that a measurement of the question Xl>(A) in the state a yield the value 1. The set of the questions obtained in this way can be ordered with respect to a suitable order relation based on the probability measures on the questions (intuitively, a question el is "less or equal" to a question e2 if and only if the probability that a measurement of el in any state a yield 1 is less or equal to the probability that a measurement of e2 in the same state a yield 1; formally, for any state a, P(a, el, {I}) :::; P(a, e2, {I} )). The resulting structure is that of a partially ordered set, which can be endowed with several important mathematical properties provided one makes some further assumptions, though very natural by a mathematical viewpoint. 4. DOES MEMBERSHIP TO A SET ADMIT DEGREES? THE UNSHARP GENERALIZATION OF QUANTUM LOGIC The physical idea underlying the notion of a question is simple. An experimental apparatus can be associated with the pair composed by the physical quantity A and the numerical interval .6. such that the reading scale of A is covered leaving open only the particular portion of the scale corresponding to .6.. Given that the physical system is prepared in a state a, in the event of a measurement of A in the state a, the value of Xl> (A) will be 1 if the pointer is found in the portion of the scale left open by the cover, and 0 otherwise [2, Ch. 13]. With a terminology that will be useful later, a macroscopic device of sharp localization can be said to correspond to the characteristic function introduced above. This 61 FROM LOGIC TO PHYSICS device can be represented as a test of the membership of a value to the fixed numerical interval ~. It is now worth pointing out a simple mathematical fact. When we have defined the characteristic function, we have stated that the only possible values were 0 and 1, whereas the proper range was the interval [0,1]. The generalization toward an unsharp (or fuzzy) generalized formulation of quantum logic can be summarized in the idea of "taking seriously" all the numbers in the interval as possible values: the notion of membership would then admit degrees, associated with all the numbers between 0 and 1. The unsharp quantum logic originates then with the application of some suggestions coming from fuzzy logic and fuzzy set theory to the logico-algebraic foundations of quantum mechanics. Fuzzy logic and fuzzy set theory was originally developed in the sixties by Lotfi Zadeh, in order to deal with approximate reasoning by a logical and cognitive viewpoint: according to him "[ ... ] the notion of a fuzzy [device] is a convenient point of departure for the construction of a conceptual framework [... ] which provides a natural way of dealing with problems in which the source of imprecision is the absence of sharply defined criteria of class membership" [36]. As emphasized in a later paper, Traditionally logical systems have aimed at the construction of exact models of exact reasoning-models in which there is no place for imprecision, vagueness or ambiguity. In a sharp break with this deeply entrenched tradition, the model of reasoning embodied in fuzzy logic aims, instead, at an accommodation with the pervasive imprecision of human thinking and cognition. [... ] To provide an appropriate conceptual framework for approximate reasoning, fuzzy logic is based on the premise that human perceptions involve, for the most part, fuzzy sets, that is, classes of objects in which the transition from membership to non-membership is gradual rather than abrupt." [1, p. 105] Let us return then to the set of questions and let us briefly investigate the consequences of its "fuzzification". To this aim, we generalize the notion of macroscopic device of sharp localization into the notion of concrete macroscopic device of localization, i.e., when a ~ E 8(JR) is fixed, any mapping W,6. : JR 1---+ [0,1] whose range consists not only of the two numbers and 1, but of the set of all real numbers in the real unit interval [0,1]. For any point x the real number w,6.(x), between and 1, represents the degree of membership of x in ~ The application of the sharp localization mappings X,6. gives rise to the (partially ordered) set of questions, so that the question naturally arises of what is the set of objects generated by the application of the unsharp localization mappings W,6.. Such objects turn out to be the so-called effects [19] [20] [21] [22], that may be intuitively seen as the mathematical ° ° 62 G. CATTANEO AND F. LAUDISA representatives of approximate properties instantiated by the physical system in the framework of a concretely realizable experiment. In analogy with the Zadeh's quotations concerning logical systems, the spectrum of the properties of physical systems that we can investigate can be much wider if we allow approximation into our commonly idealized descriptions of physical systems. As far as quantum mechanics itself is concerned, the unsharp or fuzzy approach (sometimes called also operationa0 was originally developed in different areas of fundamental quantum physics such as, respectively, the quantum field theory [14J and the quantum theory of open systems [7J. In a more abstract and axiomatic vein, however, an unsharp generalization of the main elements of the standard Hilbert space formulation of orthodox quantum mechanics has been elaborated in more recent times, usually in the context of a realistic and individual interpretation of quantum theory (see, e.g., [4]). If we agree to consider the notions of state and observable as the fundamental ones, the unsharp approach to Hilbert space quantum mechanics has generalized the notion of observable as represented by a PV (projection-valued) measure into the notion of POV (positive operator-valued) measure, since the already available generalization of notion of the state vector into the notion of density matrix has been shown to be the most general mathematical representation of a quantum state. As recalled above, a sharp property of a quantum system is represented in a Hilbert space framework by a projection operator, i.e., an idempotent self-adjoint bounded operator whose spectrum includes only the values 0 and 1. The set of these operators has a lattice structure and this lattice can be considered as the totality of the sharp properties of the system. However, if approximation has to be taken into account, the notion of sharp property of a quantum system has to be relaxed, in order to envisage a class of generalized events that instantiate properties testable on the system in an unsharp and approximate way. Each element of this class of approximate properties is mathematically represented by an effect operator, namely a (non-idempotent) self-adjoint bounded operator whose spectrum includes all values between 0 and 1. The effects represent the events of particular measurement outcomes, and the expectation values of the effects represent the probabilities of these events. Unlike the case with sharp properties, one is no more entitled to state that, after a certain measurement on a quantum system, a property either holds or it does not hold, but rather that such property holds approximately, with the degree of approximation given by the probability of occurrence of the effect that is tested in the experiment. FROM LOGIC TO PHYSICS 63 AFFILIATION Gianpiero Cattaneo Dipartimento di Scienze dell' InJormazione Universita di Milano Italy Frederico Laudisa Dipartimento di Filosofia Universita di Firenze Italy REFERENCES [1J Bellman, R.E., Zadeh, L.A., "Local and fuzzy logics", in: Dunn, M., Epstein, G. (eds.), Modern Uses oj Multiple- Valued Logic, Dordrecht, Reidel, 1977. [2] Beltrametti, E., Cassinelli, G., The Logic oj quantum mechanics, Cambridge, Mass., Addison-Wesley, 1981. [3] Birkhoff, G., von Neumann, J., "The logic of quantum mechanics", Annals oj Mathematics, 37, 1936, pp. 823-843 (reprinted in [15])). [4] Busch, P., Grabowski, M., Lahti, P., Operational quantum mechanics, Berlin, Springer, 1996. [5] Cattaneo, G., Laudisa, F., "Axiomatic unsharp quantum theory", Foundations oj Physics, 24, 1994, pp. 631-681. [6] Dalla Chiara, M.L., Giuntini, R., "Popper and the logic of quantum mechanics", 1995, preprint. [7] Davies, E.B., Quantum Theory oj Open Systems, London, Academic Press, 1976. [8] Feyerabend, P.K., "Reichenbach's interpretation of quantum mechanics", Philosophical Studies, 9, 1958, pp. 49-59. [9] Finkelstein, D., "Matter, space and logic", Boston Studies Philosophy oj Science, vol. 5, 1969. [10] Finkelstein, D., "The physics of logic", in: Colodny, R.G. (ed.), Paradigms and paradoxes, University of Pittsburgh Press, 1972. [11] van Fraassen, B.C., "Semantical analysis of quantum logic", in: Hooker, C.A. (ed.), Contemporary Research in the Foundations oj Quantum Theory, Reidel, Dordrecht, 1973. [12] van Fraassen, B.C., "The labyrinth of quantum logics", in: Cohen, R. S. and Wartofsky, M.W. (eds.), Boston Studies in the Philosophy oj Science, Vol. XIII, Reidel, Dordrecht, 1974. 64 G. CATTANEO AND F. LAUD IS A [13] Gudder, S.P., Quantum Probability, Academic Press, NY, 1988. [14] Haag, R., Kastler, D., "An algebraic approach to quantum field theory", Journal of Mathematical Physics, 5, 1964, pp. 846-86l. [15] Hooker, C.A. (ed.), The Logico Algebraic Approach to quantum mechanics, vol. I, Dordrecht, Reidel, 1975. [16] Jammer, M., The Philosophy of quantum mechanics, New York, Wiley and Sons, 1974. [17] Jauch, J .M., Foundations of quantum mechanics, Addison-Wesley, Reading, Mass., 1968. [18] Jauch, J.M., and Piron, C., "On the structure of quantal proposition systems", Helvetica Physica Acta, 42, 1969, p. 842. [19] Ludwig, G., "The measuring process and an axiomatic foundation of quantum mechanics", in: d'Espagnat, B. (ed.), Foundations of quantum mechanics, New York, Academic Press, 1971. [20] Ludwig, G., "Measuring and preparing processes", in: Hartkamper, A., Neumann, H. (eds.), Foundations of quantum mechanics and Ordered Linear Spaces, Berlin, Springer, 1974. [21] Ludwig, G., "A theoretical description of single microsystem", in: Price, W.C., Chissick, S.S. (eds.), The Uncertainty Principle and Foundations of quantum mechanics, New York, Wiley and Sons, 1977. [22] Ludwig, G., Foundations of quantum mechanics, New York, Springer, 1983. [23] Lukasiewicz, J., "0 logice trojwartosciowej" (On three-valued logic), Ruch FilozoJiczny, 5, 1920, pp. 169-171 (reprinted in: Selected Works, Amsterdam, North-Holland, 1970, pp. 87-88). [24] Mackey, G.W., The Mathematical Foundations of quantum mechanics, New York, Benjamin, 1963. [25] Neumann, von J., Die Mathematische Grundlagen der Quantenmechanik, Berlin, Springer, 1932. (English translation The Mathematical Foundations of quantum mechanics, Princeton, Princeton University Press, 1955). [26] Piron, C., "Axiomatique quantique", Helvetica Physica Acta, 37, 1964, p. 439. [27] Piron, C., Foundations of Quantum Physics, Benjamin, Reading, Mass., 1976. [28] Popper, K.R., "Birkhoff and von Neumann's interpretation of quantum mechanics", Nature, 219, 1968, pp. 682-685. [29] Putnam, H., "Three-valued logic", Philosophical Studies, 8, 1957, pp. 73-80. FROM LOGIC TO PHYSICS 65 [30] Putnam, H., "Is logic empirical?", Boston Studies Philosophy of Science, vol. 5, 1969, pp. 216-24l. [31] Redhead, M.L.G., Incompleteness, Nonlocality and Realism, Oxford, Clarendon Press, 1987. [32] Redhead, M.L.G., "Logic, quanta and the two-slit experiment", in: Hale, R., Clark, P. (eds.), Reading Putnam, Oxford, Blackwell, 1994. [33] Reichenbach, H., Philosophic Foundations of quantum mechanics, Berkeley-Los Angeles, University of California Press, 1944. [34] Varadarajan, V.S., "Probability in physics and a theorem on simultaneous observability", Communications in Pure and Applied Mathematics, 15, 1962, p. 189. [35] Varadarajan, V.S., Geometry of Quantum Theory, Vol. I, Van Nostrand, Princeton, 1968. [36] Zadeh, L.A., "Fuzzy Sets", Information and Control, 8, 1965, pp. 338-353. J AROSLA W PYKACZ NON-CLASSICAL LOGICS, NON-CLASSICAL SETS, AND NON-CLASSICAL PHYSICS 1. INTRODUCTION Quantum physics and many-valued logics were born nearly simultaneously in the third decade of the XX Century. However, the early attempts at identifying logic able to describe quantum systems with some versions of a three-valued logic failed and the opinion that "quantum logic", although non-classical, is a two-valued logic prevailed. The recently observed revival of interest in applying many-valued logics to the description of quantum physical systems is closely connected with a new and rapidly developing branch of mathematics: the fuzzy set theory. Fuzzy sets, which remain in the same relation to the infinite-valued logic as traditional sets to the classical two-valued logic, form a bridge by which one can pass from the "orthodox" quantum logic in the Birkhoffvon Neumann sense to the infinitely-valued Lukasiewicz logic. The paper is organized as follows: A brief introduction to many-valued logics is given in Section 2 while Section 3 gives rudiments of the fuzzy set theory and Section 4 shows its links with infinite-valued Lukasiewicz logic. The aim of Section 5 is to amuse and surprise the reader by some mysteries of the microworld based on the most standard example of Young double-slit experiment and finishes with some indications that many-valued logic could be better-suited for the description of this experiment. Sections 6 and 7 contain short historical survey of already made attempts at applying non-classical logics to the description of quantum phenomena out of which only Birkhoff-von Neumann proposal of using two-valued but non-distributive logic gained wide popularity and is still in use nowadays. The paper is concluded by Section 8 in which further indications for using many-valued logics in quantum physics are given and it is shown that it is possible to represent Birkhoff-von Neumann experimental propositions as propositional functions that belong to the domain of Lukasiewicz infinite-valued logic. After such "translation" Birkhoff-von Neuman quantum logic becomes a (partial) infinitevalued Lukasiewicz logic which solves the long-lasting problem of finding the proper models for disjunction and conjunction, and unifies two competing approaches: many-valued, and two-valued but non-distributive, existing in the quantum logic theory since its very beginning. 67 © 1999 Kluwer Academic Publishers. 68 JAROSLAW PYKACZ 2. NON - C LAS SIC ALL 0 G I C S The classical, two-valued logic deals exclusively with statements which can be unambiguously classified as being either true or false. Other statements simply do not belong to the domain of classical logic. In particular this refers to the statements concerning future events and this problem was noticed already by Aristotle who considered since then numerously quoted statement: "There will be a sea battle tomorrow". It should be mentioned that despite the tradition of naming the classical two-valued logic "Aristotelian logic" there are indications [1] that Aristotle himself classified "future contingents", i.e., statements about future events which are not yet decided, as neither true nor false!. In the Middle Ages the problem of future contingents was also discussed and it seems that such thinkers as Duns Scotus and William of Ockham in the thirteenth and the fourteenth centuries and Peter de Rivo in the fifteenth century considered such statements as indeterminate. Modern attempts at establishing non-classical logical systems, mostly three-valued ones, begun in the end of the nineteenth century. In 1897 Hugh MacColl investigated so called "three-dimensional logic" and in 1909 Charles Peirce considered "triadic logic" as a possible basis for "trichotomic mathematics". In 1919 Nicolai Vasil'ev in Kazan, Russia, built a system of three-valued "imaginary (non-Aristotelian) logic" whose name obviously referred to "imaginary (non-Euclidean) geometry" presented for the first time at the same university 84 years earlier by Nicolai Lobachevskij. Jan Lukasiewicz is generally recognized as a founding father of the modern theory of many-valued logics and his numerous papers on this subject published since 1920 until his death in 1956 are well-known. Contrary to these papers, his booklet "Die logischen Grundlagen der Wahrscheinlichkeitsrechnung" [3] published in 1913 although evaluated as "one of Lukasiewicz's most valuable works" in the foreword to Lukasiewicz's "Selected Works" [2], is relatively less-known. In this booklet he considered statements containing a variable, e.g., "x is an Englishman" and he attributed to them truth-value equal to the ratio of the number of values of a variable for which this statement is true to the total number 1 Lukasiewicz in many of his papers [2] claimed that the law of bivalence is actually due to the Stoics, especially Chrisippus who " ... appears to have been the first logician to consciously set up and stubbornly defend the theorem that every proposition is either true or false" (quotation from [1]). Therefore, Lukasiewicz proposed to call his many-valued logic "non-Chrisippean" rather than "non-Aristotelian". NON-CLASSICAL LOGICS 69 of values of this variable. Since he assumed the total number of values of a variable to be finite, the logic thus obtained is n-valued with n being a natural number depending on the particular situation described by a sentence. Lukasiewicz's principal aim in his 1913 paper [3] was to give the logical background to the notion of probability which at that time was much more alien to the rest of mathematics than it is now. An n-valued non-classical logic, which nowadays can be classified as a probability logic was only a kind of a by-product of these efforts and never gained such popularity as his later versions of many-valued logics investigated since 1920. The year 1920 is generally recognized as the year of the birth of the modern theory of many-valued logics. Actually, in this year two seminal papers on this theory were published independently by Jan Lukasiewicz in Poland [4] and by Emil Post in the USA [5]. Lukasiewicz arrived at his construction of a three-valued logic after the long period of philosophical investigations concerning the problem of determinism (cf. his numerous papers collected in [2], especially [6]), and of modal propositions, i.e., propositions of the form it is possible (impossible, contingent, necessary) that... [1]. He openly declared himself as devoted adherent of indeterdeclared a spiritual war minism who, according to his own words [7], upon all coercion that restricts man's free creative activity". Chrisippean law of bivalence which states that every proposition is necessarily either true or false occurred in this war to be a fortress which had to be blown up since it blocked the way towards indeterminism: Lukasiewicz argued that determinism follows necessarily from the law of bivalence, not from the law of excluded middle which only states that the disjunction of any proposition and its negation, e.g., "there will be a sea battle tomorrow OR there will not be a sea battle tomorrow" is always a true proposition. According to Lukasiewicz, who also claimed that this had been the original position taken by Aristotle, such disjunction remains true even though both its constituents can be neither true nor false. Lukasiewicz usually interpreted the third truth value as "indeterminacy,,2 and in the majority of his papers on three-valued logic he denoted this additional truth-value by the number 1/2. It is possible that in the beginning he did it just because 1/2 lays between 0 and 1- the generally accepted symbols of falsehood and truth, but later on this choice occurred to be very fortunate since it made the generalization of his three-valued logic to an n-valued or infinite-valued logic almost straightforward. (I••• 2 In the beginning [4] he interpreted it as "possibility" but later on he abandoned this idea probably in order not to confuse this name with the name of the possibility functor of modal logic. 70 JAROS LAW PYKACZ Lukasiewicz basic idea was to supplement two-valued logic with a third truth value in such a way that the obtained three-valued logic should deviate least from the ordinary logic. He did it by adopting the following truth table for implication: -+ 0 "2 1 1 o 1 1 1 1 1 1 "2 "2 1 1 1 0 "2 1 Table 1. Truth values of implication p -+ q (p implies q or if p then q) in Lukasiewicz three-valued logic. and by assuming that the following formulas which are in fact tautologies in two-valued logic define, respectively, negation, disjunction, conjunction, and equivalence3 : negation: de! -,p = p -+ (1) 0 (not p means the same as p implies falsehood) disjunction: pVq de! = (p-+q ) -+q (2) (p or q means the same as (p implies q) implies q) conjunction: p 1\ q de! = -,( -,p V -,q) (3) (p and q means the same as not (not p or not q)) equivalence: p == q de! = (p -+ q) 1\ (q -+ p) (4) (p and q are equivalent means the same as p implies q and q implies p) It can be easily checked that these definitions together with Table 1 yield the following truth tables for negation, disjunction, conjunction, and equivalence in Lukasiewicz three-valued logic: 3 Lukasiewicz did not consider three-valued equivalence in [4]. For the first time he did it in the realm of many-valued logics in 1922 [8] but the formula (4) is a standard one in two-valued logic and it was often used in many of Lukasiewicz papers published both before and after 1920. NON-CLASSICAL LOGICS o 1 1 1 "2 1 "2 0 Table 2. Truth values of negation....,p v 0 2 0 0 1 1 "2 1 "2 1 "2 1 71 "2 1 1 1 1 1 1 1 (not p). Table 3. Truth values of disjunction p V q (p or q). A 0 "2 1 1 0 0 0 0 0 0 1 "2 1 1 1 "2 "2 "2 1 1 Table 4. Truth values of conjunction p A q (p and q). o "2 o 1 1 1 "2 1 "2 o 1 1 "2 1 o 1 "2 1 "2 1 1 == q (p is equivalent to q or p if and only if q). Table 5. Truth values of equivalence p When the first break-through was made, further generalization to nvalued logics and infinite-valued logics was not so difficult and Lukasiewicz actually did it soon afterwards [8, 9]. Of course except for the case of n-valued logics with n being relatively small number, truth values of logical functors cannot be presented in the form of tables. Fortunately, Lukasiewicz found algebraic expressions, the same for all systems of many-valued logics (both finite and infinite-valued), which yield truth 72 JAROSLAW PYKACZ values of compound propositions as functions of their constituents, i.e., all Lukasiewicz many-valued logics are truth-functional. The basic logical functor for Lukasiewicz was implication and the following expression allows to calculate truth value of implication t(p -7 q) when the truth values t(p) and t(q) of the antecedent p and the consequent q are known [8,9]: t(p -7 q) = min[1- t(p) + t(q), 1]. (5) It can be easily checked that this formula applied to definitions (1)-(4) yields the following formulas for truth values of the remaining logical connectives: = 1 - t(p) (6) t(p V q) = max[t(p) , t(q)] (7) t(p 1\ q) = min [t(p) , t(q)] (8) t(-.p) t(p == q) = 1-lt(p) - t(q)l. (9) Lukasiewicz assumed that the set of truth values of an n-valued logic with 0 ~ k ~ n - 1. It is consists of all fractions of the form n~l straightforward to check that the formulas (5)-(9) yield Tables 1-5 for n=3. Lukasiewicz was not so much concerned with the interpretation of his subsequently developed n-valued (for n > 3) or infinite-valued logics as with the interpretation of the third truth value of his original threevalued logic, but he mentioned in [8] that "... 0 is interpreted as falsehood, 1 as truth, and the other numbers in the interval 0-1 as the degrees of probability corresponding to various possibilities ... ". Therefore, it is clear that at least in the year 1922 he still maintained his idea, expressed for the first time in the year 1913 [3], of interpreting non-classical truth values as degrees of probability. Contrary to Jan Lukasiewicz, the second founding father of the modern theory of many-valued logics: Emil Leon Post does not seem to be very much concerned with interpretation of non-classical truth values. His investigations were not so much founded on philosophical considerations but were rather of formal algebraic nature. Loosely speaking we can say that he studied algebraic aspects of n-valued logics without bothering to express them linguistically and in this respect his papers [5, 10] are more close in their style to modern treatises on many-valued logics (see, e.g., [11]) than contemporary papers by Lukasiewicz. Post based his n-valued propositional calculi on the linearly ordered set of truth values {tl' t2, ... t n } where the extreme elements express ''full NON-CLASSICAL LOGICS 73 truth" and ''full falsehood" and he followed Whitehead and Russell's Principia Mathematica [12] in choosing negation and disjunction as basic connectives. However, although his disjunction was the same as that of Lukasiewicz, Le., its truth value was the bigger of truth values of its constituents, Post's basic negation, which could be called cyclical was quite different 4 : p -,p Table 6. Truth values of Post's "cyclical" negation. Because of so defined negation if other connectives (conjunction, implication, and equivalence) are defined with the aid of tautologies known from two-valued logic, then they exhibit rather unexpected and counterintuitive features. In spite of this fact in modern times Post logics find their application in studying electronic networks and in Computer Science [13, 14]. It should be also mentioned that due to this particular negation n-valued propositional calculi of Post, contrary to these of Lukasiewicz, are functionally complete: any conceivable connective can be defined by basic connectives of negation and disjunction. Of course no intuitive interpretation can be given to the vast majority of connectives obtained in such a way. After the first break-through made by Lukasiewicz and Post many other systems of three-valued and n-valued propositional calculi were proposed. I can mention here three-valued calculi of Kleene [15]' [16]' Bochvar [17], and Finn [18] motivated by epistemological considerations concerning lack of meaning of some statements, a similarly motivated group of papers dealing with so-called nonsense-logics [19, 20, 21, 22], attempts at describing intuitionistic propositional calculus in terms of many-valued logics [23, 24], or papers motivated by considerations concerning peculiarities of quantum physis [25, 26, 27, 28, 29, 30, 31, 32J. Mathematically experienced reader can find a recent survey of most of the above-mentioned three-valued logics and some n-valued logics [33, 4 Besides of this "cyclical" negation Post considered also the other negation, identical with that of Lukasiewicz. However, it seems that he treated "cyclical" negation as more important. 74 JAROS LAW PYKACZ 34] in Chapters 3 and 4 of a recent book [11] by Bole and Borowik. Some examples of many-valued logics motivated by physical considerations are described in the Chapter 8 of Jammer's book [35]. It should be mentioned that many-valued Lukasiewicz logic endowed with negation (1), (6), disjunction (2), (7), and conjunction (3), (8) was criticized by Gonseth [36] in 1938 since it satisfies neither the law of excluded middle t(p V --,p) = 1 (it is true that p or not p) (10) nor the law of contradiction t(p A --,p) = 0 (it is false that p and not p). (11) Actually, neither of these formulas is satisfied for t(p) =I- 0,1, for example they both assume truth value! for t(p) = !. Most probably Gonseth did not know Polish so he could not have read the paper [37] published by Zawirski already in 19345 in which Zawirski noticed that if we replace the right-hand side of the formula (2) by which Lukasiewicz defined disjunction in [4] by the other (in two-valued logic equivalent) expression: --'p --7 q, then the disjunction obtained in this way: pUq t(p U q) de! = --,p --7 q = min[t(p) + t(q), 1] (12) and the conjunction adjoint to it by De Morgan's Law (3): pnq t(p n q) de! = --,( --,p U --,q) max[t(p) + t(q) - 1,0] (13) satisfy both the law of excluded middle: t(p U --,p) = min[t(p) + 1 - t(p), 1] = 1 (14) and the law of contradiction: t(p n --,p) = max[t(p) + 1 - t(p) - 1,0] = O. 5 The same idea was published in English by Orrin Frink Jr. in 1938 in [38]. (15) NON-CLASSICAL LOGICS 75 Therefore, Gonseth's critique cannot be applied to Lukasiewicz manyvalued logic endowed with his original implication (5), negation (1), (6), and Zawirski disjunction (12) and conjunction (13). As we shall see in the sequel this set of connectives seems also to be better suited for the description of behaviour of quantum physical systems than the set of connectives originally defined and studied by Lukasiewicz. 3. NON - C LAS SIC A L SET S The classical, two-valued logic is a basis of traditional mathematics and, in particular, of the traditional set theory. Although there exist wellelaborated systems of axioms for the classical set theory for all practical purposes it is enough to distinguish a set that we are interested in by a predicate which, according to two-valued logic, allows to divide unambiguously all considered objects into two disjoint classes: objects that belong to a set and objects that do not belong to a set and form its complement. For example, let U be a set consisting of speakers at the "Einstein meets Magritte" Conference. This predicate is precise enough to define this set as soon as the Conference is finished (otherwise we could fall into the Aristotelian trap of future contingents). All propositions of the form: x belongs to the set U where x denotes a name of an individual person are, as soon as the Conference is finished, either true or false, i.e., they belong to the domain of classical two-valued logic. Although every traditional set is defined by a "sharp" predicate, not every predicate is good enough to define a traditional set in an unambiguous way. Let us try to distinguish a subset A of the above-mentioned set of speakers U consisting of speakers whose talks were interesting. Even if we choose only one umpire in order not to deal with various opinions we are likely to get, besides "sharp" judgements of the form: "the talk of Prof. X was not interesting", "the talk of Dr. Y was interesting" also a lot of statements of the form: "the talk of ... was ... a little bit/only pariially/not so much/quite/in most of its paris/nearly almost ... interesting". Therefore, we see that besides the speakers who, like Prof. X surely do not belong to the set A and like Dr. Y surely belong to it, both membership and non-membership of other speakers to the set A is doubtful. It also would not be good to group all these other speakers into one category since from various judgements of our umpire we infer that different talks were intersting to him to different extent. The best solution would be to evaluate numerically degrees to which talks of various speakers were interesting and to say that "degrees of membership" of various speakers to the set A are proportional to these numbers. 76 JAROSLAW PYKACZ This is exactly the idea of a fuzzy set: If a is a fuzzy subset of the universe of discourse U (in our case the set U consists of all speakers), then some elements of U surely belong to A, some surely do not belong to it, but also all intermediate cases of "partial membership" are allowed. Moreover, membership is "graded": according to the original idea of Lotfi A. Zadeh [41], who is generally recognized as a founding father of the fuzzy set theory6, membership of an element x to a fuzzy set A, denoted /-LA(X) or simply A(x) can vary from 0 (full nonmembership) to 1 (full membership), i.e., it can assume all values in the interval [0,1]. Therefore, a membership function /-LA : x I--t /-LA(X) E [0,1] completely characterizes the fuzzy set A and it is an obvious generalization of a characteristic function XA(X) of a traditional set: 0 for x tf. A XA (x) = { 1 for x E A (16) Fuzzy subsets of a plane can be easily visualized as areas which, contrary to traditional sets (usually called crisp sets in the fuzzy set theory) have no sharp boundaries and vanish gradually (Figs. 1 and 2). They are smeared, blurred or simply fuzzy. Our everyday language provides us with numerous examples of "nonsharp" predicates which can define only non-crisp sets, e.g., young (man), ripe (apple), old (painting), fast (car), famous (artist), etc. In all these cases we can easily distinguish elements which surely belong to a set of objects defined by a given predicate, elements which surely do not belong to it, and elements whose membership is more or less doubtful. Actually, I dare say that in everyday communication "sharp" predicates which defne crisp sets are rather an exception than a rule. Of course in some cases it is possible to draw a borderline in a more or less arbitrary way to recover sharp discrimination between members and non-members of a set. For example we could state that a car Xl which can go faster than 150 km/h belongs to the set of fast cars which, according to twovalued logic, implies that a car X2 which can go at most 149, 999 km/h is, by the very definition, not fast so it does not belong to the set of fast cars. However, we feel that the car X2 "almost belongs" to the set of fast cars and should not be treated in the same way as a car X3 which can go at most 50 km/h. It is more natural to state that the grade of membership of the car X2 to the set of fast cars is very close to 1 while the grade of membership of the car X3 to this set is close to o. Thus, the 6 It seems that C. C. Chang [39] and D. Klaua [40] elaborated similar ideas independently of [41] and published them even slightly before [41]. It is a problem for historians of science to explain why their papers, contrary to [41] are almost neglected. 77 NON-CLASSICAL LOGICS I 1 '--_ _ _-'---_----' ____ 1 ___ -'-_ _ _-'- ______ _ Figure 1. Crisp set and its characteristic function along the axis X. Point Xl belongs to A, therefore XA(XI) = 1; point X2 does not belong to A, therefore XA(X2) = o. Each point either entirely belongs or does not belong to A . .: , I··.· .:1 0'0 1 - _____ l_--- e' ",I _____ -~l_ ~- I 1 "2 _ _ _ _ 1_ _ _ _ _ _ _ _ _ _ _ I _ _ _ _ _ _ _ _ _ _ _ ..J _ _ _ _ _ I Figure 2. Fuzzy set and its membership function along axis X. Point Xl belongs to A in 100%, therefore JLA(Xt) = 1; point X2 belongs to A in 0% (does not belong to A in 100%), therefore JLA(X2) = O. Other points belong to A to the extent expressed by respective values of the membership function JLA, e.g., points X3, X4, X5 and Xa half-belong to A, therefore, JLA(x3-a) = ~. idea of representing the collection of fast cars in a form of a fuzzy set is very appealing, although it should be mentioned that in general it is not 78 JAROSLAW PYKACZ at all obvious what a membership function of a specific fuzzy set should precisely look like7 . As soon as membership functions of fuzzy sets are established, these sets are characterized to the full extent and we can define on them all relations and operations known from the traditional set theory8. This is much expected since classical sets are actually special cases of fuzzy sets: they are fuzzy sets whose membership functions assume only two values: 0 and 1, i.e., these membership functions are in fact characteristic functions (16), and because all set-theoretic relations and operations on classical sets can be expressed in terms of these characteristic functions. The basic relations and operations on fuzzy sets were defined already by Zadeh in his historic paper [41] and they are till now the most frequently used in all contributions to and applications of the fuzzy set theory. We shall see in the sequel that Zadeh's intuitive choice was so natural because these operations follow from the connectives of Lukasiewicz many-valued logic exactly in the same way as operations on classical sets follow from the connectives of classical logic. Zadeh basic relations and operations are defined with the aid of membership functions as follows (we assume, as it is usually done in the fuzzy set theory, that all considered fuzzy sets are in fact fuzzy subsets of a fixed universe of discourse U; we shall represent fuzzy sets on pictures by their membership functions as it was done on the lower part of Figure 2): Equality of fuzzy sets (Fig. 3): A = B iff for all elements x in the universe U (17) Inclusion of fuzzy sets (Fig. 4): A universe U ~ B iff for all elements x in the (18) Complement (negation) of a fuzzy set (Fig. 5): A' is a complement of A iff for all elements x in the universe U (19) 7 This observation gave rise to the notion of probabilistic fuzzy sets introduced by Hirota [42] whose membership functions are themselves "fuzzy". Of course this procedure can be continued, but objects obtained in this way are less and less convenient to deal with. 8 There are also operations which can be defined on fuzzy sets but not on crisp sets, e.g., an operation of "sharpening" of a set which makes it "less fuzzy". NON-CLASSICAL LOGICS 79 1 ---------/ c A=B Figure 3. Equality and non-equality of fuzzy sets. 1 ?-~ J.l.B (x) ;;2 J.l.A (x) Figure 4. Inclusion of fuzzy sets. Union (sum) of fuzzy sets (Fig. 6): Au B is a union of A and B iff for all elements x in the universe U (20) 1 ---------, , \ A' \\ A ------------------7--/ / / _ - - / J.l.A' (x) = 1 - J.l.A (x) 1 ---------------:;. :2 / / J.l.A (x) / \ / \ Figure 5. Complement (negation) of fuzzy sets. 80 JAROSLAW PYKACZ Figure 6. Union (sum) of fuzzy sets. 1 Figure 7. Intersection (product) of fuzzy sets. Intersection (product) of fuzzy sets (Fig. 7): A n B is an intersection of A and B iff for all elements x of the universe U (21) The fuzzy set theory is by no means only a mathematical game. Although in the beginning it was treated with a kind of reservation by traditionally oriented "crisp" mathematicians it finds numerous practical applications which vary from earthquake forecasting, computer medical diagnoses, decision making and pattern recognition to the production of control systems for the underground and more efficient vacuum cleaners [43]. Moreover, it seems to be a very natural tool for all "soft" sciences which deal with vagueness or imprecision caused either, like in meteorology, by the excess of data or, like in economics, sociology, psychology, etc., by the human factor. Actually, the "applicational" aspect of fuzzy sets is maybe even better known than their theoretical aspects which still seem to be undervalued by the society of "crisp" mathematicians. NON-CLASSICAL LOGICS 81 4. FU Z ZY SET SAND INFINI TEVALUED LUKASIEWICZ LOGIC In order to explain why obvious links between fuzzy sets and manyvalued Lukasiewicz logics were not studied9 during the whole first decade of vivid development of the fuzzy set theory one should take into account two possible reasons: On the one hand Lotfi Zadeh and his followers seemed to be interested mostly in applications of the newly established theory and were not so much occupied with clarification of its foundations. On the other hand "pure" mathematicians of that time did not bother a lot about a theory which probably was seen by them as too simple in comparison with sophisticated problems emerging on very frontiers of contemporary "crisp" mathematics. Actually, the relation of classical logic to classical set theory, in particular definitions of settheoretic operations in terms of connectives of classical logic are taught in the beginning of the secondary school lO • They remain the same when classical logic is replaced by infinite-valued Lukasiewicz logic and classical sets are replaced by fuzzy sets, but this observation was published by Robin Giles [44]10 years after l l the successful launching of the idea of fuzzy sets by Zadeh in 1965. Let us remind relations between propositions and sets known from school and see how they work when classical logic and classical sets are replaced by Lukasiewicz infinite-valued logic and fuzzy sets. The notion of a set is adopted at school as a primitive notion and it is tacitly assumed that we know a set when we know all its elements. Therefore, any set A can be described as a collection of objects whose names turn a propositional function "x belongs to A" ("x E A") into a true proposition. Symbolically: A = {x : t("x E A") = 1} (22) Because of the equality in the bracket the set A defined by the formula (22) is unavoidably crisp even if we replace classical two-valued logic by infinite-valued logic, However, if we rewrite this formula into the following (in two-valued logic equivalent) form: A = {x : t("x E A") i= O} (23) then, since in infinite-valued logic a truth value of a non-false proposition can assume, besides 1, any value between 0 and 1, the set A occurs to 9 except in almost neglected papers [39] and [40] mentioned in footnote 6 10 at least in Poland 11 but cf. footnotes 9 and 6 82 J AROSLA W PYKACZ be a fuzzy set with membership function defined by truth values of propositions of the form "x belongs to A": f.LA(X) = t("x E A") (24) (the degree of membership of an object x to the fuzzy set A is equal to the truth value of the proposition "x belongs to A"). This equality allows to use well-known school definitions of a complement, union, and intersection of classical sets together with Lukasiewicz formulas for truth values of negation(6), disjunction (7), and conjunction (8) to justify Zadeh's intuitive choice of basic operations on fuzzy sets (19), (20), and (21): Membership function of a complement (negation) of a fuzzy set: f.LAI(X) = t("x E A''') = t("x rf. A") = t(-,"x E A") = 1- t("x E A") = 1 -/-tA(X). (25) Membership function of a union (sum) of fuzzy sets: f.LAUB(X) = t("x E A" V "x E B") max[t("x E A"), t("x E B")J = max[f.LA(x), f.LB)X )J. (26) Membership function of an intersection (product) of fuzzy sets: f.LAnB(X) = t("x E A" /\ "x E B") min[t("x E A"), t("x E B")J = min[f.LA(x) , f.LB)X)J. = (27) However, as it was mentioned at the end of Section 1, the original Lukasiewicz disjunction and conjunction are not the only conceivable connectives of this type. Therefore, Zawirski disjunction (12) and conjunction (13) placed inside (26) and (27) yield other union and intersection of fuzzy sets, usually called bold operations after Giles [44J (other names : Giles, truncated, bounded, arithmetic, Prink, Lukasiewicz operations) 12: t( "x f.LAUB(X) 12 E A" u "x = min[t("x E A") = min[f.LA(x) E B") + t("x E B"), 1J + f.LB(X), 1J (28) It is sure that Giles [44] was not aware of Zawirski 1934 paper [37] where these operations appeared for the first time (in the domain of many-valued logics), and most probably he was also not aware of Frink 1938 paper [38]. It seems also that these operations were rediscovered many times by various authors which explains multiplicity of their names. NON-CLASSICAL LOGICS J.LAnB(X) = = = t("X E A" n "x E B") max[t{"x E A") + t{"x E B") - 1,0] max[ILA(x) + ILB(X) - 1,0] 83 (29) It is obvious that other disjunction-like and conjunction-like connectives of infinite-valued logic13 define in the same way other operations of fuzzy set union and intersection and vice versa: All operations on fuzzy sets interpretable as fuzzy set union and intersection 14 yield disjunctionlike and conjunction-like connectives of infinite-valued Lukasiewicz logic. 5. NON - C LAS SIC ALP H Y SIC S Richard Feynman, the Nobel Prize winner in physics, undoubtedly one of those who knew quantum mechanics best said [47]: "I think I can safely say that nobody understands quantum mechanics". Actually, the behaviour of microobjects is sometimes so peculiar that, although it can be very well described and predicted with the aid of well-elaborated mathematical formalism, it cannot be comprehended to the full extent by our brain experienced by macroscopic world and, I dare say, twovalued logic. Let us consider a standard example of a double-slit experiment (Figs. 8 and 9) which illustrates famous wave-particle duality of quantum objects and which, again according to Richard Feynman [48] u... has in it the heart of quantum mechanics. In reality, it contains the only mystery". The double-slit experiment is an old experiment designed by Thomas Young in the beginning of the 19th century in order to show interference of light. Waves emitted by a source S pass through two narrow slits A and B made in a non-transparent barrier and interfere in the area behind this barrier where they overlap (Fig. 8). The interference is visible either directly (in the case of water waves) or after placing a screen or a system of detectors in the area of interference, as it is usually done in the case of light or acoustic waves. 13 2 There are 2(2 ) = 16 conceivable two-argument connectives in 2-valued logic, 3(3 2 ) = 19.683 two-argument connectives in 3-valued logic, n(n 2 ) two-argument connectives in n-valued logic and obviously infinity of two-argument connectives in infinite-valued logic. Of course not all of them could be in a reasonable way interpreted as disjunction or conjunction. Some of them could be interpreted as implication or equivalence, but the overwhelming majority of them have no counterparts in two-valued logic. 14 Even the whole families of such operations, parametrized by real numbers, have been already studied (see, e.g., [45, 46, 43]) 84 JAROSLAW PYKACZ barrier screen Figure 8. Double-slit experiment in the configuration W ("waves"). Wave-like aspects of light are shown. It should be noticed that interference is a typical wave phenomenon: When two waves of the same amplitude meet their "movements" sum up so the amplitude of the emerging wave is doubled in areas where both constituing waves have the same phase (totally constructive interference), disappears in areas where constituing waves have opposite phase (totally destructive interference) and is intermediate in other places. Specially the case of totally destructive interference shows the crucial difference between waves and particles of any kind: We cannot expect that in a place where two streams of particles (e.g., bullets from two machine guns) meet they could annihilate leaving no trace of their existence at all, like destructively interfering waves do. Interference experiments, relatively easy to be performed with light, convinced for some time the majority of physicists that light is a typical wave. However, the study of photoelectric effect by Albert Einstein led him to the conclusion that energy of light spreads in indivisible packets ("quanta", which gave name to quantum physics). A slight modification of a double-slit experiment (Fig. 9) shows that light quanta, usually called photons, are well-localized: It is enough to place a detector behind each slit and use very weak source of light, in which case photons "leak" one by one, to state that: (i) each of the detectors either registers the "whole" photon or nothing at all, nothing like a "fragment of a photon" is ever registered, (ii) detectors never register a photon simultaneously which implies that each photon passes through one slit only, i.e., photons NON-CLASSICAL LOGICS very weak source 85 no coincidence! Figure 9. Double-slit experiment in the configuration P ("particles"). Particle-like aspects of light are shown. behave in this situation like well-defined particles 15 . The double-slit experiment and its modifications reveal to us, macroscopic creatures, a lot of surprising mysteries of the microworld: MYSTERY 1. Interference pattern in the configuration W is built after enough long time even when the source of light is so weak that photons "leak one by one" 16. This shows that interference is not caused by collective motion or mutual influence of many photons but happens even for a single photon which "interferes with itself' . MYSTERY 2. Interference experiments can be performed not only with light but also with "solid" objects the particle-like character of which seems to be beyond any question. This hypothesis was formulated by Louis de Broglie in his famous doctoral dissertation [51] in 1923 and confirmed experimentally for electrons in 1927 by Davisson and Germer [52]. Nowadays wave-like aspects of electrons or even quite heavy atoms are utilized for practical purposes in every hospital or laboratory where electron or ion microscope is used. MYSTERY 3. If someone, in spite of the results of a double-slit experiment performed in the configuration P maintains that photons, electrons, etc., are "extended particles" which travel simultaneously through both slits, he should take into account that in some already performed versions of this experiment the two allowed paths were separated by several me15 The described experiment is a "gedanken" one but similar experiments were actually performed (see, e.g., [49]) and the results were just as described above. 16 Such experiments were done already in 1909 by G. I. Taylor [50). 86 JAROS LAW PYKACZ possible paths of a photon gra vitational lens made by remote galaxy Figure 10. Cosmic-scale double-slit experiment. ters. Moreover, the ingenious proposal of "cosmic-scale double-slit experiment" (Fig. 10) made by Wheeler [53J would require micro(!)-objects to extend over a distance of millions of light-years. MYSTERY 4. It seems that the very nature of microobjects is so mischievous that they never show us both their faces: wave-like and particlelike at the same time. Actually, any attempt to establish through which slit a photon passes inevitably leads either to catching it in a detector or to such disturbance of its movement (e.g., if it bounces against another microobject which in turn is registered in a detector) that the interference pattern is destroyed anyway. This is the essence of the famous Complementarity Principle: Microobjects always reveal to us only one of their complementary faces, in particular, we can never experience both their wave-like (e.g., interference) and particle-like (e.g., defined trajectory) features in the same experiment. However, the last statement belongs to the original Copenhagen paradigm which nowadays has to be slightly modified. In 1979 Wooters and Zurek [54J demonstrated that mathematical formalism of quantum mechanics allows to obtain in a double-slit experiment both non-perfect knowledge about a path of a quantum object together with non-perfect interference pattern. These predictions were experimentally confirmed in an experiment performed by Mittelstaedt, Prieur, and Schieder [55J in 1987 and actually in the course of this experiment partial interference pattern was obtained together with partial knowledge of a trajectory of a photon which allowed the authors to conclude that: "A photon possesses simultaneously particle properties and wave properties". In this way we encounter for the first time the situation in which many-valued logic seems to be better-suited for the description of quantum phenomena than classical two-valued logic: According to the orthodox Copenhagen formulation of the Complementarity Principle wave-like and particle-like properties of microobjects are mutually exclusive in the NON-CLASSICAL LOGICS 87 "absolute" sense. Therefore, propositions like PA,W (PA,P) = "an experiment A reveals wave-like (resp. particle-like) properties of photons" are supposed to be either entirely false or entirely true, i.e., it is believed that for any conceivable experiment they belong to the domain of classical two-valued logic. However, Mittelstaedt, Prieur, and Schieder (MPS) found in the course of their experiment that: "Even for a very high value of the particle property, there is still a nonvanishing amount of the wave property" and they for example wrote about "... a photon with 98,2% particle property and ... 1,8% wave property", i.e., they were not only able to state simultaneous "partial" or "non-perfect" existence of these properties, but also to express numerically the "degree" to which they hold. These percentages, after dividing by 100%, can be in a straightforward way identified with truth values t(PA,W), t(PA,P) of propositions PA,W andpA,p and in the course of the MPS-type experiment these truth values are usually different from 0 or 1. Of course it is still possible to avoid using non-classical truth values t(PA,W), t(PA,P) i- 0,1 and describe MPS-type experiments entirely in terms of classical two-valued logic using instead of propositions like PA,W or PA,r> propositions of the form "the amount of wave-like properties of photons in an experiment is x%" whose truth-values (at least in the idealized case when we do not allow any imperfections) are again either o or 117. However, there are also other indications for using non-classical logics in the description of quantum systems. We shall study them in a more detailed way in the section that follows. 6. THREE-VAL UED LO GICS IN FOUNDATIONS OF QU ANTUM MECHANICS In the years 1925-1926 the development of quantum physics experienced itself a "quantum jump": Under the influential works by Heisenberg, Schrodinger, Born, Jordan and Dirac [56, 57, 58, 59, 60, 61] physicists abandoned so called "older quantum theory" which was merely an amalgamate of ideas and models taken from classical physics with ad hoc added "quantum conditions" and developed quantum mechanics as internally consistent, although mathematically highly sophisticated theory which, at least in its nonrelativistic part, persists without drastic changes till today. However, some implications of the new theory were so bizarre that there were scientists who claimed that quantum theory cannot be comprehended on the ground of classical two-valued logic. 17 This is a standard procedure allowed by the fact that classical two-valued logic is a metalogic for many-valued logics. 88 JAROSLAW PYKACZ The first one who expressed such claims was a Polish logician Zygmunt Zawirski who was looking for the possible fields of application of Lukasiewicz many-valued logic. In the papers published in 1931 [25] and in 1932 [26] Zawirski argued that the equivalence of "complementary theories", e.g., wave and particle pictures in the description of microobjects is possible only on the ground of (at least) three-valued logic since in two-valued logic a statement like "light is a wave AND light consists of particles" is a statement which is a conjunction of two propositions which cannot be simultaneously true. Therefore, according to the laws of classical two-valued logic such conjunction is necessarily a false proposition. According to Zawirski complementarity, typical to quantum mechanics, can be comprehended only on the ground of (at least) three-valued logic when we ascribe to two mutually exclusive theories a third truth value interpreted as "possibility" or "equal probability". Indeed, if a truth value of two propositions p and q equals ~, then according to formula (8) t(p 1\ q) = min[!, ~J = ~ (30) so the conjunction of two "possible" statements is again "possible". Zawirski's papers: [25J published in Polish in a local journal, [26J published in French but in a journal not popular among physicists, as well as [37J published again in Polish, passed relatively unnoticed 18. More fortunate in spreading out his ideas was Fritz Zwicky [62J but the bestknown attempts at basing quantum mechanics on three-valued logics were elaborated in the fourties and early fifties by Paulette DestouchesFevrier [27, 28J and Hans Reichenbach [29, 30, 31, 32J. Specially the Reichenbach's book [29J in which he consequently tried to explain quantum mysteries on the ground of three-valued logic was, for some time, widely discussed. Reichenbach's ideas were pursued after his death by Hilary Putnam [63J but all attempts at using three-valued logics in order to describe quantum phenomena never gained such popularity as attempts based on quite other type of non-classical logic introduced in 1936 by Birkhoff and von Neumann which are described in the next section. 18 even in Poland: When in 1991 in the Polish National Library in Warsaw I got to my hands a copy of Zawirski's paper [37] it occured that the pages of a booklet were still not cut apart, i.e., most probably no one had read this copy during the whole 60 years! The homage should be paid to Max Jammer who mentions Zawirski papers in his famous book [35] on philosophy of quantum mechanics. The interested reader will find in Chapter 8 of this book more detailed historical survey of applications of many-valued logics in foundations of quantum mechanics up to the early seventies. NON-CLASSICAL LOGICS 89 In the author's opinion the attempts at founding quantum mechanics on any version of three-valued logics were bound to fail also because three-valued logics are not enough "rich in truth values" and, therefore, are not enough "flexible": One should not hope to be able to describe within them the whole variety of quantum phenomena, specially when truth values are supposed to be connected with numerical results of experiments, if there is only one truth value (besides the classical 0 and 1) at hand. 7. STILL OTHER NON-CLASSICAL LOGIC FOR QUANTUM MECHANICS Garrett Birkhoff and John von Neumann published in 1936 a seminal paper [64] whose title "The logic of quantum mechanics" gave name to the vast and vivid field of activity that stemmed from this paper and which was later on called the theory of quantum logics 19 . At that time quantum mechanics already achieved its mature form 2o whose basic ingredients were linear operators acting on some abstract vector space called Hilbert space after great mathematician David Hilbert who was the first to study such spaces in the beginning of the century. Some of these operators (so called projection operators) were regarded by Birkhoff and von Neumann as representing dichotomic experimental propositions, i.e., such propositions, that can be unambiguously classified as either true or false when a suitable experiment is completed. Please note that this resembles very much Aristotle's statement "There will be a sea battle tomorrow" which, when the experiment consisting in placing two navies in close vicinity is completed is also bound to be either true or false, but before it is completed it can be regarded as belonging to the domain of many-valued logic. However, the possibility of interpreting Birkhoffvon Neumann experimental propositions as many-valued propositions was for the long time overlooked and the opinion prevailed that these propositions are two-valued (cf., p. 346 of Jammer's book [35]). The non-classical character of Birkhoff-von Neumann experimental propositions was supposed to lay not in the number of their truth values but in the algebraic structure of the set they form. This structure is obviously the same as the structure of the set of projection opera19 The most recent trend is to speak about quantum structures rather than about quantum logics (vide: International quantum structures Association) since in the last years the mathematical structures studied within this field considerably transgressed the boundaries encircled by Birkhoff and von Neumann's paper (64). 20 von Neumann's own contribution to this form was one of the most influential. 90 JAROSLAW PYKACZ tors acting on an underlying Hilbert space and it is nowadays called an orthomodular lattice. Orthomodular lattices are slightly more general structures than Boolean algebras 21 which in turn are algebraic models (so called Lindenbaum algebras) of families of propositions which obey laws of classical two-valued logic. The law valid in Boolean algebras that is not secured in orthomodular lattices is the distributivity law which, written in logical notation, looks as follows: pl\(qVr) pV(ql\r) = (pl\q) V (pl\r) (pVq) 1\ (pVr). (31) (32) Therefore, the hypothetical quantum logic, i.e., a system of experimentally verifiable propositions that can be legitimately used in the description of properties and behaviour of quantum objects was judged to be non-distributive although two-valued. We shall argue in the next section that it should be rather judged as non-distributive and many-valued. 8. THE UNIFICATION OF TWO APPROACHES TO QUANTUM LOGIC: BIRKHOFF-VON NEUMANN QUANTUM LOGIC AS PARTIAL INFINITE-VALUED LUKASIEWICZ LOGIC As it was already mentioned, a Birkhoff-von Neumann experimental proposition p assumes one of two classical truth values 0 or 1 when suitable experiment is completed. However, before this is done, the mathematical formalism of quantum mechanics allows only to calculate probability that a proposition will be true or false. It is worth stressing that in the case of quantum physical systems, contrary to classical ones, this probability is "ontological", not "epistemological", i.e., it does not result from a lack of knowledge about actual values of some parameters that could have characterized a studied system and which, had they been known, would have allowed to predict the results of experiments without any uncertainty22. 21 The most typical example of a Boolean algebra is a family of (crisp) subsets of a fixed set which is closed with respect to the standard set-theoretic operations of union, intersection, and complement. Actually, due to the famous Stone Theorem [65] any Boolean algebra is isomorphic to a Boolean algebra of this kind. 22 The idea of supplementing quantum theory by such parameters, usually called hidden variables, was considered by various physicists since the very birth of quantum theory. However, some recent experiments [66] indicate that this idea can be maintained only if hidden variables act on quantum objects in a non-local way and, therefore, is rejected by the majority of contemporary physicists. NON-CLASSICAL LOGICS 91 In general the same physical system can be in different states (which "operationally" manifests itself by using different procedures to prepare it for an experiment) and probabilities that an experimental proposition p will turn out to be true are, in general, different for different states of a physical system23 • Therefore, we see that with any experimental proposition p and any state a of a physical system the mathematical formalism of quantum mechanics associates in an unambiguous way a number a(p) E [0, 1] interpreted as a probability of stating experimentally that the proposition p is true. In the mathematical parlance it is said that states of a physical system are represented by probability measures defined on a logic of a system, i.e., on a family of experimentally verifiable propositions pertaining to the studied physical system. This is in a full agreement with mathematical description of quantum systems in Hilbert spaces where actually, due to the famous Gleason Theorem [67], states of physical systems generate probability measures on families of projection operators. However, there is also the other possibility in which a number a(p) E [0,1] can be interpreted. It agrees with an old idea launched by Lukasiewicz already in 1913 [3] and elaborated further by Zawirski [37], according to which numerical value of a probability of a random event c coincides with a truth value of a (many-valued) proposition: "The event c will happen,,24. This possibility was pursued by the present author in a series of papers [68, 69, 70] and consists in assuming that a number a(p) E [0,1] is a truth value of an experimental proposition p(a) which can be linguistically expressed, e.g., in the form: "If we perform a dichotomic (yes-no) experiment designed to check an experimentally verifiable proposition p on a physical system that is in a state a, then p will turn out to be true", or simply: "If a system is in a state a, then p will be true", i.e., that a(p) = t(p(a)). (33) Let us stress once more that the Future Tense used in these linguistic statements clearly shows that at least in the case of quantum physical 23 If they are not, there is no reason to claim that these states of a system are actually different. 24 Actually, neither of these Authors paid much attention to distinguish this statement expressed in the Future Tense from the statement: "The event & happened" . In my opinion such distinction helps to maintain clarity: The first statement belongs to the domain of many-valued logic while the second one to the classical two-valued logic (provided that we exclude "ill-defined" events and "ill-performed" experiments which, after an experiment is completed, leave doubts whether an event actually happened or not). 92 JAROSLAW PYKACZ systems these statements do not, in general, belong to the domain of the classical two-valued logic which only rarely admits statements about future events. Since we cannot attach any truth value to palone (which means that it is not a logical proposition) but only to p(a), where a represents a state of a system, a statement p regarded after Birkhoff and von Neumann as a two-valued proposition actually turns out to be a many-valued propositional function defined on a set of states of a physical system. Of course there can exist also constant propositional functions of this form, i.e., statements which assume the same truth value on each state of a system. Two of them play particular role and obviously can be added to any family of propositional functions that describes any physical system: The always-false propositional function F and the always-true propositional function T which assume, respectively, truth values 0 and 1 on any state a of a physical system25 . It was proved by the author [68] that a set of propositional functions that possesses an algebraic structure of an orthomodular lattice (e.g., that is generated by a set of projection operators acting on a Hilbert space) is "minimal" in the sense that it does not contain any constant propositional function except F and T. However, many-valued models of recently studied more general quantum structures (orthoalgebras, effect algebras) do admit constant propositional functions other than F and T. Let us now consider what logical operations can be performed on a set of many-valued propositional functions associated to a given quantum system. The most obvious of them (and also causing least troubles) is negation. According to the original ideas of Birkhoff and von Neumann it is always possible to negate an experimental proposition p by interchanging meanings attached to the two possible outcomes of a dichotomic experiment designed to check p. The same procedure can be maintained to obtain negation -,p(.) of a propositional function p(.) also in our approach. It follows from our reinterpretation of Birkhoff-von Neumann probability a(p) as a truth value of a many-valued proposition p(a) that t(-,p(·)) = 1- t(p(·)). (34) Actually, whatever is a state a of a physical system, probability a( -,p) of getting the outcome "no" in any dichotomic experiment with random outcomes obviously equals 1 - a(p) where a(p) is the probability of 25 F can be linguistically expressed, for example, as: "If a system is in a state ex, then it does not exist" and T as: "If a system is in a state ex, then it exists", but there are also other linguistic expressions for F and T possible. 93 NON-CLASSICAL LOGICS getting the outcome "yes". Therefore, according to (33) we get for any state a of a physical system t(...,p(a)) = a(...,p) = 1- a(p) = 1- t(p(a)) (35) from which the general formula (34), which is in full accordance with the original Lukasiewicz formula (6), follows. The problem with a proper expressions for conjunctions and disjunctions of our propositional functions is much less trivial. The first difficulty consists in the abundance of possibilities mentioned at the end of Section 3. Secondly, the proposals made originally by Birkhoff and von Neumann [64] according to which conjunctions and disjunctions should be modelled, respectively, by order-theoretic operations of the greatest lower bound (g.l.b.) and the least upper bound (l.u.b.) with respect to the partial order relation introduced in the set of experimentally verifiable propositions, are so far from being satisfactory that they cannot guide us as they did in the case of negation. Actually, one of the features by which quantum systems differ from classical systems the most is the phenomenon expressed in famous Heisenberg Uncertainty Principle which says that there exist physical quantities that cannot be simultaneously measured with arbitrary accuracy. The standard example of such quantities is position x and momentum p of a microobject for which uncertainty relation takes form ~xp> h - 471" (36) where ~x and ~p denote, respectively, unavoidable uncertainties in finding the values of x and p in any conceivable experiment. It follows from (36) that if we manage to measure position of a microobject with great close to zero), then we do not know practically anything accuracy (~x about simultaneously measured value of momentum (~p very big) and vice versa. Therefore, although truth values of two experimentally verifiable propositions a = "the value of position is Xo ± ~x" and b = "the value of momentum is Po ± ~p" can be established in two independently performed experiments whatever and ~p, the conjunction "a AND b" is are the assumed uncertainties ~x not an experimentally verifiable proposition if ~xp < 4~ since there is 94 J AROSLA W PYKACZ no experimental possibility to measure simultaneously the position and the momentum of a microobject with arbitrary accuracy. Birkhoff and von Neumann were well aware that this fact makes the interpretation of g.l.b. and l.u.b. of two experimentally verifiable propositions as conjunction and disjunction questionable for a very simple reason: The family of all projection operators on a Hilbert space, which is an archetype of all "quantum logics" is a lattice, i.e., in this family g.l.b. and l.u.b. of any two elements exist, which should not be the case in view of just studied example. In other words, a "good" quantum logic should be a partial logic in a sense that it should be possible to form conjunctions and disjunctions only within some classes of propositions which are usually called compatible or simultaneously measurable. The other difficulty in attempts at treating g.l.b. and l.u.b. of two Birkhoff-von Neumann propositions as their conjunction and disjunction follows from the fact that, according to the logical parlance, these operations are not truth functional, which means that the knowledge of truth values of two propositions a and b is not sufficient to establish the truth values of their g.l.b. and l.u.b. This is not a very serious obstacle but, anyway, it would be better if we could calculate truth values of the conjunction "a AND b" and the disjunction "a OR b" from the truth values of a and b alone. Thus, we are left with the infinity of possible candidates for manyvalued conjunctions and disjunctions to be used for our propositional functions that describe probabilities of getting various results of future experiments performed on quantum systems. The task of choosing among them the right ones seems to be almost impossible. Of course, we can proceed a little bit by elimination: For example, the original Lukasiewicz disjunction (7) and conjunction (8) do not seem to be the proper ones since, as it was already mentioned in Section 1, they do not satisfy the law of excluded middle (10) and the law of contradiction (11), while algebraic counterparts of these laws hold true in Hilbert-space-based Birkhoff-von Neumann quantum logic. Happily, it occurs that Zawirski disjunction (12) and conjunction (13) do not only satisfy these laws, but it can be also shown [71, 72] that after suitable translation of Birkhoff-von Neumann logic, via families of fuzzy sets, into the language of infinite-valued Lukasiewicz logic [68, 69] these operations satisfy all the previously listed requirements best: They are only partially defined on families of propositional functions associated with a studied quantum system which, in view of previously listed arguments, is rather a virtue than a drawback. Moreover, it can be shown [72] that if they are defined on a given pair of propositional functions p(.), q(.), then Birkhoff-von Neumann counterparts of these NON-CLASSICAL LOGICS 95 propositional functions are compatible (i.e., simultaneously measurable) propositions (as they should be, of course) and that in this case Zawirski operations coincide with order-theoretic operations of g.l.b. and 1. u. b. that can be independently introduced on a given family of propositional functions. This finally explains why order-theoretic operations of g.1.b. and 1.u.b. were for such a long time regarded as proper models of quantum-logical conjunction and disjunction in spite of well-known interpretational difficulties encountered when they are applied to noncompatible propositions which already annoyed the founding fathers of the quantum logic theory. To sum up, according to the author, the present state-of-the-art is the following: It is a misunderstanding to speak about any "global" quantum logic, understood as a new kind of propositional calculus or a new way of reasoning, that could replace the old, good, two-valued Aristotelian logic in its role of being the base of scientific method of comprehending physical phenomena. However, there do exist "local" quantum logics consisting of propositional functions that predict results of not-yet-performed experiments. The "locality" of these logics follows from the fact that each of them consists of propositional functions that pertain to a specific physical system. These propositional functions are defined on a set of states of a studied physical system (which once more stresses the fact that the logic is "local": it consists only of statements about this physical system), and become propositions when a free variable in a propositional function is replaced by a definite state of a system. Since propositions obtained in this way speak about future events (results of experiments that will be performed on a physical system but, in the considered moment of time are not yet performed), these "local logics" are unavoidably manyvalued logics, otherwise all future events would be strictly determined. The meaning of truth values of these many-valued propositions is probabilistic, for example, t(p(a)) = 0,8 means that the probability that any experiment designed to check a proposition p when a physical system is in a state a will show that p is true equals 0,8. The mathematical results obtained in the course of "translating" Birkhoff-von Neumann quantum logics into the language of Lukasiewicz infinite-valued logic show that a family C consisting of all propositional functions of the form p( .) associated with any physical system has to fulfill the following conditions [69, 71]: (i) The always-false propositional function F and the always-true propositional function T belong to C. (ii) If a propositional function pO belongs to C, then also its negation .p(.) belongs to C, and for any state a, t(.p(a)) = 1- t(p(a)). 96 JAROSLAW PYKACZ (iii) For any (finite or countable) sequence of propositional functions {Pi (. )} that are pairwisely exclusive in the Zawirski sense26 there exists in ,c a propositional function that is a Zawirski disjunction (12) of all Pi(')' (iv) The always-false propositional function F is the only one propositional function in ,c that is exclusive27 with itself. It was proved by the author [69] that every quantum logic in the Birkhoff-von Neumann sense with so called ordering family of probability measures (which, in the case of logics associated with physical systems represent states of these systems) can be uniquely expressed in the form of such a family of propositional functions defined on its set of probability measures, and vice versa: Any family of many-valued propositional functions that satisfies conditions (i)-(iv) possess an algebraic structure that characterizes Birkhoff-von Neumann quantum logics. Therefore, Birkhoff-von Neumann quantum logic can be seen as being both non-distributive (which follows from its algebraic structure "inherited" from a Hilbert space) and many-valued logic, which finally unifies these two lines of thought existing in the quantum logic theory since its very beginning. ACKNOWLEDGMENTS Writing of this paper was financially supported by the joint PolishFlemish Project No. 007 (Poland) / VLW 12 (Flanders) and by the University of Gdansk Research Grant BW/5100-5-0276-7. The Author is very grateful to the whole staff of the Center "Leo Apostel" (Brussels) where the main part of this paper was written for the technical assistance and warm hospitality extended to him during his frequent stayings at CLEA, and to Mr. Wojciech Mostowski for producing computer drawings that illustrate the paper. 26 This means that Zawirski conjunctions (13) of all possible pairs of propositions Pi(a), pj(a) obtained from Pi(-) and pj(-) for i =1= j and for all possible states a of a physical system are false propositions, i.e., Vi#-/Vat(pi(a) npj(a» = 0, which can be equivalently denoted as Pi(') n pj(-) = F for all i =1= j. 27 In the Zawirski sense explained in the previous footnote. NON-CLASSICAL LOGICS 97 AFFILIA TION Jaroslaw Pykacz Instytut Matematyki Uniwersytet Gdanski 80-952 Gdansk, Poland pykacz@ksinet. univ. gda. pl REFERENCES [1] Lukasiewicz, J., "Philosophische Bemerkungen zu mehrwertigen Systemen des Aussagenkalkiils", Comptes rendus des seances de ia Societe des Sciences et des Lettres de Varsovie, Cl. III, 23, 1930, pp. 51-77; reprinted as "Philosophical remarks on many-valued systems of propositionallogic" in [2], pp. 153-178. [2] Lukasiewicz, J., Selected Works, North-Holland, Amsterdam, and PWN-Polish Scientific Publishers, Warszawa, 1970. [3] Lukasiewicz, J., Die logischen Grundlagen der Wahrscheinlichkeitsrechnung, Acad. der Wiss. Krakow, 1913; reprinted as "Logical foundations of probability theory" in [2], pp. 16-63. [4] Lukasiewicz, J., "0 logice trojwartosciowej", Ruch Filozoficzny, 5, 1920, pp. 170-171; reprinted as: "On three-valued logic" in [2], pp. 87-88. [5] Post, E. L., "Introduction to a general theory of elementary propositions", Bulletin of the American Mathematical Society, 26, 1920, p. 437. [6] Lukasiewicz, J., An Address Delivered as a Rector of the Warsaw University at the Inauguration of the Academic Year 1922/1933; reprinted as: "On determinism" in [2], pp. 110-128. [7] Lukasiewicz, J., Farewell Lecture by Professor Jan Lukasiewicz, delivered in the Warsaw University Lecture Hall on March 7, 1918; reprinted in: [2], pp. 84-86. [8] Lukasiewicz, J., Lecture delivered at the 232nd Meeting of the Polish Philosophical Society in Lwow on October 14, 1922, published in Ruch Filozoficzny, 7, 1923, pp. 92-93 (in Polish); reprinted as: "A numerical interpretation of the theory of propositions" in [2], pp. 129-130. [9] Lukasiewicz, J. and Tarski, A., "Untersuchungen iiber den Aussagenkalkiil", Comptes rendus des seances de la Societe des Sciences et des Lettres de Varsovie, Cl. III, 23, 1930, pp. 39-50; reprinted as: "Investigations into the sentential calculus" in [2], pp. 131-152. [10] Post, E.L., "Introduction to a general theory of elementary propositions", American Journal of Mathematics, 43, 1921, pp. 163-185. 98 JAROSLAW PYKACZ [11] Bole, L. and Borowik, P., Many- Valued Logics, Springer-Verlag, Berlin, 1992. [12] Whitehead, A.N. and Russell, B., Principia Mathematica, Cambridge University Press, Cambridge, 1910. [13] Epstein, G., Frieder, G., and Rine, D.C., "The development of multiple-valued logic as related to Computer Science", Computer, 7, 1974, pp. 20-32. [14] Rine, D.C. (ed), Computer Science and Multiple- Valued Logic. Theory and Applications, North-Holland, Amsterdam, 1977. [15] Kleene, S.C., "On a notation for ordinal numbers", The Journal of Symbolic Logic, 3, 1938, pp. 150-155. [16] Kleene, S.C., Introduction to Metamathematics, North-Holland, Amsterdam, 1952. [17] Boehvar, D.A., "On a three-valued logical calculus and its application to the analysis of contradictions", Matematiceskij Sbornik, 4, 1939, pp. 287-308 (in Russian). [18] Finn, V.K., An Axiomatization of Some Propositional Calculi and their Algebras, Vsiechsojuznyi Institut Nauceskoi Informacii Akademii Nauk SSSR, Moskva, 1972 (in Russian). [19] Hallden, S., "The logic of nonsense", Uppsala Universitets Arsskrijt, 9, 1949, p. 132. [20] Aqvist, L., "Reflections on the logic of nonsense" Theoria, 28, 1962, pp. 138-158. [21] Segerberg, K., "A contribution to nonsense logic", Theoria, 31, 1965, pp. 199-217. [22] Pirog-Rzepeeka, K., Systemy Nonsense-Logics, PWN-Polish Scientific Publishers, Warszawa, 1977 (in Polish). [23] Heyting, A., Intuitionism. An Introduction, North-Holland, Amsterdam, 1966. [24] Jaskowski, S., "Recherches sur Ie systeme de la logique intuitioniste", in: Actes du Congres International de Philosophie Scientifique, Part 6, Philosophie des matMmatiques. Paris, 1936. Actualites scientifiques et industrielles, 393, 1936, pp. 58-6l. [25] Zawirski, Z., "Jan Lukasiewicz 3-valued logic. On the logic of L. E.J. Brouwer. Attempts at applications of many-valued logic to contemporary natural science", Sprawozdania Poznanskiego Towarzystwa Przyjaci61 Nauk, 2, 1931, pp. 2-8 (in Polish). [26] Zawirski, Z., "Les logiques nouvelles et Ie champ de leur application", Revue de Metaphisique et de Morale, 39, 1932, pp. 503-519. NON-CLASSICAL LOGICS 99 [27J Fevrier, P., "Les relations d'incertitude de Heisenberg et la logi- que", Comptes rendus Acad. Sci. Paris., 204, 1937, pp. 481-483. [28] Destouches-Fevrier, P., "Logiques et theories physiques", in: Congres International de Philosophie des Sciences, Paris 1949, Hermann, Paris, 1951, pp. 45-54. [29] Reichenbach, H., Philosophic Foundations of quantum mechanics, University of California Press, Berkeley, 1944. [30J Reichenbach, H., "The principle of anomaly in quantum mechanics", Dialectica, 2, 1948, pp. 337-350. [31J Reichenbach, H., "Uber die erkenntnistheoretische Problemlage und den Gebrauch einer dreiwertigen Logik in der Quantenmechanik", Zeitschrijt Jur NaturJorschung, 6a, 1951, pp. 569-575. [32J Reichenbach, H., "Les fondements logiques de la mecanique des quanta", Annales de l'Institut Henri Poincare, 13, 1952-1953, pp. 109158. [33J Slupecki, J., "Der volle dreiwertige Aussagenkalkiil", Comptes rendus des seances de la Societe des Sciences et des Lettres de Varsovie, Classe III, 29, 1936, pp. 9-11. [34J Sobocinski, B., "Axiomatization of certain many-valued systems of the theory of deduction" , Roczniki pmc naukowych zrzeszenia asystent6w Uniwersytetu J6zeJa Pilsudskiego w Warszawie, 1, 1936, pp. 399-419. [35J Jammer, M., The Philosophy of quantum mechanics, Wiley-Interscience, New York, 1974. [36J Gonseth, F., Les entretiens de Zurich sur les Jondements et la methode des sciences mathematiques, 6-9 decembre 1938, Zurich, 1941. [37J Zawirski, Z., "Relations between many-valued logic and the calculus of probability", Prace Komisji FilozoJicznej Poznanski ego Towarzystwa Przyjaci61 Nauk, 4, 1934, pp. 155-240 (in Polish). [38J Frink, 0., "New algebras oflogic", American Mathematics Monthly, 45, 1938, pp. 210-219. [39J Chang, C.C., "Infinite valued logics as a basis of set theory", in: Proceedings oj the 1964 International Congress in Logic, Methodology, and Philosophy oj Science, North-Holland, Amsterdam, 1965, pp. 93100. [40J Klaua, D., "Uber einen Einsatz zur mehrwertigen Mengenlehre", Monatsb. Deutsch. Acad. Wiss. Berlin, 7, 1965, pp. 859-867. [41J Zadeh, L.A., "Fuzzy sets", Information and Control, 8, 1965, pp. 338-353. 100 JAROSLAW PYKACZ [42] Hirota, M., "Concepts of probabilistic sets", Proceedings of IEEE Conference on Decision and Control, 1977, pp. 1361-1366; Fuzzy Sets and Systems,S, 1981, pp. 31-46. [43] Proceedings of the Seventh International Fuzzy Systems Association World Congress, Prague, June 25-29, 1997, Academia, Prague, 1997. [44] Giles, R., "Lukasiewicz logic and fuzzy set theory", International Journal of Man-Machine Studies, 67, 1976, pp. 313-327. [45] Frank, M.J., "On the simultaneous associativity of F(x, y) and x + y - F(x, y)", Aequationes Mathematicae, 19, 1979, pp. 194-226. [46] Yager, R.R., "On general class of fuzzy connectives", Fuzzy Sets and Systems, 4, 1986, pp. 235-242. [47] Feynman, R., "Probability and uncertainty-the quantum mechanical view of Nature", in: The Character of Physical Law, MIT Press, Cambridge, MA, 1965, pp. 127-148. [48] Feynman, R.P., Leighton, R.B., and Sands, M., The Feynman Lectures on Physics, Addison-Wesley, Reading, MA., 1965, Vol III, p. 1. [49] Grangier, P., Roger, G., and Aspect, A., "A new light on single photon interferences", Annals of the New York Academy of Sciences, 480, 1986, pp. 98-107. [50] Taylor, G.!., "Interference fringes with feeble light", Proc. Cambridge Phil. Soc., 15, 1909, p. 114. [51] de Broglie, L.V., "Ondes et quanta", Comptes rendus Acad. Sci. Paris, 117, 1923, p. 507; "Recherches sur la theorie des quanta", Annales de Physique, 3, 1925, p. 22. [52] Davisson, C.J. and Germer, L.H., "Diffraction of electrons by a crystal of nickel", Physical Review, 30, 1927, p. 705. [53] Wheeler, J.A., "Law without law", in: Wheeler, J.A. and Zurek, W.H. (eds.), Quantum Theory and measurement, Princeton University Press, Princeton, NJ., 1983, pp. 182-219. [54] Wooters, W.K. and Zurek, W.H., "Complementarity in the double-slit experiment: Quantum nonseparability and a quantitative statement of Bohr's principle", Physical Review D, 19, 1979, pp. 473-484. [55] Mittelstaedt, P., Prieur, A., and Schieder, R., "Unsharp particle-wave duality in a photon split beam experiment", Foundations of Physics, 17, 1987, pp. 891-903. [56] Heisenberg, W., "Uber quantentheoretische Umdeutung kinematischer und mechanischer Beziehungen", Zeitschrijt fur Physik. 33, pp. 879-893. NON-CLASSICAL LOGICS 101 [57] Schrodinger, E., "Quantisierung als Eigenwertproblem", Annalen der Physik, 79, 1926, pp. 361-376; 499-507; 80, 1926, pp. 437-490; 81, 1926, pp. 109-139. [58] Born, M. and Jordan, P., "Zur Quantenmechanik", Zeitschrijt fur Physik, 34, pp. 858-888. [59] Born, M., Heisenberg, W. and Jordan, P., "Zur Quantenmechanik II", Zeitschrijt fUr Physik, 35, pp. 557-615. [60] Born, M., "Zur Quantenmechanik der Stossvorgange", Zeitschrijt fur Physik, 37, pp. 863-867. [61] Dirac, P.A.M., "The fundamental equations of quantum mechanics", Proc. Royal Society of London (AJ, 109, 1925, pp. 642-653. [62] Zwicky, F., "On a new type of reasoning and some of its possible consequences", Physical Review, 43, 1933, pp. 1031-1033. [63] Putnam, H., "Three-valued logic", Philosophical Studies, 8, 1957, pp.73-80. [64] Birkhoff, G. and von Neumann, J., "The logic of quantum mechanics", Annals of Mathematics, 37, 1936, pp. 823-843. [65] Stone, M.H., "The theory of representations for a Boolean algebra", Transactions of the American Mathematical Society, 40, 1936, pp. 37-111. [66] Aspect, A., Grangier, P., and Roger, G., "Experimental realization of Einstein-Podolsky-Rosen-Bohm Gedankenexperiment: A new violation of Bell's inequality", Physical Review Letters, 49, 1982, pp. 9194. [67] Gleason, A., "Measures on the closed subspaces of a Hilbert space", Journal of Mathematics and Mechanics, 6, 1957, pp. 885-894. [68] Pykacz, J., "Fuzzy set ideas in quantum logics", International Journal of Theoretical Physics, 31,1992, pp. 1767-1783. [69] Pykacz, J., "Fuzzy quantum logics and infinite-valued Lukasiewicz logic". International Journal of Theoretical Physics, 33, 1994, pp. 14031416. [70] Pykacz, J., "Fuzzy quantum logics as a basis for quantum probability theory", International Journal of Theoretical Physics, 37, 1998, pp. 281-290. [71] Pykacz, J., "Attempts at the logical explanation of the waveparticle duality", in: Dalla Chiara, M.L., Giuntini, R., and Laudisa, F. (eds.), Language, Quantum, Music, Kluwer, Dordrecht, 1999, pp. 265-282. [72] Pykacz, J., manuscript in preparation. CLAUDIO GAROLA AGAINST "PARADOXES": A NEW QUANTUM PHILOSOPHY FOR QUANTUM MECHANICS l. INTRODUCTION It is a commonplace that XXth century physics has produced powerful new theories, such as Relativity and quantum mechanics, that upset the world view provided by XIXth century physics. But every physicist knows how difficult it may be to explain the basic aspects of these theories to people having a non-physical professional training. The main reason of this is that both Relativity and quantum mechanics are based on fundamental ideas that are not hard to grasp in themselves, but deeply contrast the primary categories on which our everyday thinking it; based, so that it is impossible to place relativistic and quantum results within the framework suggested by ordinary intuition and common sense. Yet, despite this similarity, there are some relevant differences between the difficulties arising in Relativity and in quantum mechanics. In order to understand this point better, let us focus our attention on Special Relativity first (analogous arguments can be forwarded by considering General Relativity). Here, the strange links between space and time following from the even more strange assumption that the velocity of light is independent of the motion of the observer conflict with the very simple conception of space and time implicit in our daily practice (and explicitly stated in classical Physics, think of Newton's "absolute space" and "absolute time"): but this conflict regards geometrical space-time models, not the very roots of our language, hence our thought. Then, let us consider quantum mechanics. Here it is a basic notion that properties of physical systems are nonobjective, in the sense that a property cannot be thought of as existing if a measurement of it is not performed. As Mermin [30] writes, "it is a fundamental quantum doctrine that a measurement does not, in general, reveal a preexisting value of the measured property" . This doctrine is expressed, commented on and elaborated in a huge number of books and papers, and it has been adopted more or less explicitly for seventy years in most arguments in quantum mechanics. Yet, one can easily realize that it challenges the basic procedures of our reasoning within a natural language. Indeed, a primary function of any language is attributing properties to things and deducing, via general 103 © 1999 Kluwer Academic Publishers. 104 CLAUDIO GAROLA laws, further properties. If the above quantum doctrine is accepted, this function must be completely reconsidered. The stronger impact of quantum mechanics on the deep structures of our reasoning is clearly witnessed by the fact that no physicist nowadays seriously challenges Special Relativity, while the debate on the interpretation and on the "paradoxes" of quantum mechanics is still lively. It is apparent that even experienced physicists may be in trouble when facing all consequences of the orthodox quantum philosophy. One could wonder whether it is worth maintaining the aforesaid quantum doctrine if it produces problems and paradoxes. But it is well known that [1] and the Bell-Kochen-Specker theorem [26], [30] that seem to prove some strange features of quantum systems from which this doctrine directly follows: thus, it appears to be imposed by technical results inside quantum mechanics itself rather than being an a priori philosophical choice, which implies that one cannot renounce it without rejecting all the heritage of knowledge provided by quantum physics. Therefore, most scholars concerned with the foundations of quantum mechanics accept the above doctrine, even when propounding an alternative to the orthodox interpretation or a modification of the theory, and some of them try to develop a consistent world view that embodies and "explains" the strange features of quantum systems mentioned above (these are mainly two: nonlocality, which means that a measurement on a particle 1 may instantaneously influence the properties of a particle 2, even if 1 and 2 are far away, if 1 and 2 have interacted in the past; contextuality, which means that a property of a given physical system in a given state may be asserted, or denied, or be meaningless, depending on the measurement context). But up to now no proposal has achieved the general consent of the scientific community, so that the situation is still unsatisfactory from several viewpoints. However, there are still scarcely explored ways of coping with quantum paradoxes. In particular, the belief that nonlocality and contextuality are technical results that cannot be avoided without messing up quantum mechanics can be criticized. This point is crucial, and in order to make it clear a somewhat detailed discussion is needed. Consider first any physical theory. According to a largely accepted epistemological conception (received viewpoint, e.g., Braithwaite, [6], or Hempel, [25]) this theory can be schematically described by saying that it consists of: (i) a mathematical (more generally, formal) apparatus; (ii) a domain of facts (the observative domain); (iii) a partial interpretation, which makes some (not generally all) terms of the mathematical apparatus correspond to observed, or ob- A NEW QU ANTUM PHILOSOPHY FOR QM 105 servable, facts and relations among facts in the observative domain; (iv) a space-time model, which is very important in order to grasp intuitively the content of the theory and to conceive new developments of it (the existence of a model is not retained however a necessary requisite of any physical theory). Among these constitutive elements, the interpretation, in particular, strongly depends on the (implicit or explicit) epistemological choices underlying the theory (for instance, if one retains, with Galileo, that "nature is written in mathematical terms", he will be inclined to interpret all terms of the mathematical apparatus as describing some actually existing, even if sometimes elusive, physical entity, to consider the mathematical relations in the theory as expressions of objective laws of nature and, finally, to retain that the space-time model is a faithful, though possibly approximate, representation of the actual world). Hence, the way in which a mathematical result is interpreted may change when the epistemological premises are changed. If this conclusion is applied to the special case of quantum mechanics, one realizes that the deduction of nonlocality and contextuality, hence of the fundamental quantum doctrine reported above, from the Bell and Bell-Kochen-Specker theorems, is based on an interpretation that follows from the standard philosophy of quantum mechanics. But this philosophy contains, in particular, the fundamental quantum doctrine itself, so that the claim that this follows from the Bell and Bell-Kochen-Specker theorems creates a conceptual loop. Therefore, one can still imagine that some changes in the standard philosophy might lead one to conceive a new interpretation of quantum mechanics which does not imply, on one side, a modification of its formal apparatus, thus saving the core of the quantum description of the world, and which, on the other side, allows one to read the Bell and Bell-Kochen-Specker results in a non standard way, thus avoiding "paradoxes" as nonlocality and contextuality and reconciling quantum physics with the basic procedures of everyday physicist's and mathematician's reasoning (it must be stressed that this does not mean to propound again classical models for physical reality, since even a classically structured language can be used in order to describe an infinity of non-classical models). However, the idea of introducing philosophical changes in order to solve some problems in quantum mechanics would probably be regarded with annoyance by many pragmatically oriented physicists, who consider philosophical reasoning "practically irrelevant", or even harmful, for physicists, and tolerate it only if devoted to elucidate and legitimate the autonomous progress of science. The arguments offered above should already convince the reader of the naIvety of this position. In addition, 106 CLAUDIO GAROLA one can remind that the birth of quantum mechanics was accompanied by a lively epistemological debate on the aims of physics, the basic requirements that a physical theory must fulfil, the kind of sentences that can be accepted as meaningful in its language, and so on. The famous paper written by Einstein, Podolski and Rosen in 1935 [9] (see also Bohm, [3]), in which the authors intended to show that quantum mechanics was not a complete physical theory, and Bohr's answer to it [5], defending the opposite thesis, can be taken as typical examples of this debate. Therefore, it is apparent that the founders of quantum mechanics were well aware of the relevance of the epistemological background on which physical theories are constructed, even from a strictly technical viewpoint (indeed, the way in which physicists collect, select and organize physical data is largely affected by their underlying convictions about physics, methodology and knowledge, which also guide the choice of mathematical tools and theoretical entities). On the other hand their convictions have not to be taken as definitive and eternal truths, even because the links between the epistemological premises and the body of a physical theory are not one-to-one, which makes every theory (hence, in particular, quantum mechanics) liable to different interpretations, or modifiable so as to fit in with different conceptual frameworks. The bias against the philosophical reflection on the roots of quantum mechanics, on the contrary, leads, on the one hand, to underestimation of the relevance of the epistemological premises in this theory, and, on the other hand, to implicit acceptance (sometimes with incongruity and misunderstanding) of the viewpoint inherited from the tradition, making it unquestionable. The above remarks being taken into consideration, the rest of this paper will deal with a recent attempt at developing the idea of coping with the paradoxes of quantum mechanics by analyzing, criticizing and then changing the epistemological premises more than the formal apparatus of this theory. If looked at without prejudices, a solution of this kind should be rather appealing for physicists, since it could lead to avoiding paradoxes without renouncing all the patrimony of acknowledged achievements of quantum mechanics; furthermore, it will be seen that the change of the basic paradigma of this theory provides some interesting suggestions on possible ways of going beyond it that have never been explored before, even by the authors who tried to produce alternative theories or interpretations (see, e.g., Garola, [15], [16], [17]; Garola and Solombrino, [22], [23]). A NEW QUA N TUM PHI LOS 0 P H Y FOR Q M 107 2. QU ANTUM MECHANICS AND TRUTH To follow the program described at the end of the Introduction, it is convenient to begin with some general considerations that will constitute a standpoint for a critical analysis of the standard quantum philosophy. Whenever one starts reasoning (which is one of the basic functions of the language), one adopts, consciously or not, some general procedures, which can be summarily described as follows. Firstly, he explicitly asserts or implicitly accepts some preliminary statements (or premises), part of which are factual statements ("Socrates is a man" , in a famous syllogism of Aristotelian logic), part are general statements ("all men are mortal"). Then, he deduces some consequences from these premises ("Socrates is mortal"). This deduction cannot be arbitrary (a reasoning that deduces from the above premises that "Socrates is Chinese" would not be accepted), hence it follows some rules (deduction rules) that can be analyzed and made explicit. In order to understand the deep nature of these rules, note that when asserting or accepting some preliminary statements we mean that these statements are true; furthermore, our conclusions are also retained to be true if the deduction procedure is correct. Thus, one can maintain that the deduction rules, whatever they may be, are used in order to recognize that some new statements are true, being based on the knowledge that some initial statements (the premises) are true. The explicit form of the deduction rules has been studied for a long time, and this research has generated a huge body of knowledge and theories, classical Logic included. In the context of logical studies, the classical concept of truth supported by classical Logic plays a crucial role, and it is well known that it has been elucidated and formalized by the Tarski theory of truth (see, e.g., Tarski, [37], [38]). It is then important to point out some fundamental features of this classical concept that are relevant to our aims in this paper. (i) The definition of a truth value (true/false) for the statements of a language is given by means of a set theoretical model. Whenever the language is such that this model applies, any statement in it has a truth value, independently of any assumption on the actual existence of the entities mentioned in the statement itself ("a unicorn is a feline" is false in the context of a story, even if the actual existence of unicorns can be seriously questioned). But the fact that a truth value is defined does not mean that it is known, or that it can be obtained by means of some (empirical or logical) procedures: truth and access to truth are different concepts. This is a crucial point that must be carefully pondered, since it frequently occurs in the everyday life that we sometimes separate and sometimes identify these concepts. In order to grasp it better, one can 108 CLAUDIO GAROLA resort to a mathematical analogy: one can well prove that the solution of a given differential equation exists, but he can have no means for actually producing that solution. Furthermore, think of the statement "this match is wooden and completely combustible". If one decides to verify first whether the second statement is true, he will not be able to verify, after burning the match, whether the first statement is true: nevertheless, he is however sure that the first statement has a truth value, that he could easily check if he decided to consider it first. It follows from the above remarks that one can plainly talk in classical Logic of the truth value of a statement without implying that he possesses a procedure for getting to know that value. (ii) Intuitively, the set theoretical model provided by Tarski for defining the concept of truth can be summarily described as follows. The statements of a language can be divided into elementary (or atomic) statements and complex statements. Elementary statements state a property of an individual object (or a relation among some individual objects, but this more general case will not be considered here for the sake of simplicity), e.g., "sun is bright". Complex statements are obtained by using elementary statements and connectives as "not", "and", "or", "implies" , or quantifiers as "for every", "exists". Then, in the set of all individual objects considered in the language, or universe of the language, a property is represented by a subset, and an elementary statement which attributes a property to a given object is true if the object belongs to the subset representing the property, false otherwise. Furthermore, the negation of the same statement, obtained by using the connective "not", is true if the object belongs to the complement in the universe of the set representing the property, false otherwise; the conjunction of two statements, obtained by using the connective "and", each attributing a property to the same individual object, is true if the object belongs to the intersection of the two sets representing the two properties, false otherwise; the disjunction of the same two statements, obtained by using the connective "or", is true if the object belongs to the union of the two sets representing the two properties, false otherwise; a complex statement which states that all individual objects having a given property P also have another property Q is true if the subset representing P is contained in the subset representing Q, false otherwise; and so on. By means of the above definitions one can obtain rules for attributing truth values to complex statements whenever the truth values of all elementary statements that appear in them are known (a simple example of rules of this kind is provided by the famous truth tables of the part of classical Logic called Propositional Logic). A NEW QU ANTUM PHILOSOPHY FOR QM 109 (iii) By combining the rules considered at the end of (ii), one can obtain deduction rules. These can be considered procedures for attaining knowledge of the truth of some given statements, whose truth values are defined but unknown, on the basis of the knowledge of the truth of some other statements (the premises), consistently. with our initial remarks on the deduction process in natural languages. Furthermore, one can introduce a logical order (actually, a pre-order) on the set of all statements, defined as follows: a given statement is smaller than another given statement if, whenever the former is true, then the latter is true. (iv) As it has already been stressed, the premises on which one applies a deduction procedure usually are not homogeneous: for, some describe factual situations, some state general laws. These laws usually are part of a theory, and it is apparent that this theory can be complete, i.e., such that its laws, together with suitable sets of factual premises, are sufficient for determining the unknown truth value of any statement of the language, or incomplete if this does not occur. The above features of classical Logic provide an intuitive insight of the way in which this discipline explains and systematizes the inference procedures of natural languages. However, these procedures are so complex and sometimes seemingly contradictory that the theoretical settlement supplied by classical Logic is not uncontroversial. It is well known, for instance, that the verificationist conceptions of truth deny the possibility itself of defining truth independently of the procedures that lead one to determine the truth values of the statements of a language: in other words, a truth value is defined for a given statement only if a procedure exists which allows one to determine effectively whether the statement itself is true or false. A relevant example of a formal logic that adopts a verificationist concept of truth is provided by Intuitionistic Logic, the position of which will be called logical verificationism in the following, since it collapses the concept of truth into the concept of logical verification (at least in the case of complex statements: elementary statements must have empirical proofs, see, e.g., Dalla Pozza and Garola, [8]). It is then important to note that logical verificationism does not simply reduce the set of statements that can be thought of as having a truth value, but implies deep changes in the logical rules themselves that enter the deductive process. By imitating this illustrious example, the standard interpretation of quantum mechanics adopts a similar attitude requiring an empirical proof in every case. Indeed, the standard philosophy of quantum mechanics regards as "metaphysical" every attempt of introducing physical properties or entities which, in principle, cannot be observed, and deprives of meaning all statements in a theory whose truth or falsity 110 CLAUDIO GAROLA cannot be checked by means of suitable measurements. To be precise, it maintains that no statement which cannot be verified by means of a suitable measurement procedure has a truth value and can be accepted as meaningful in the language of physics. The above quantum philosophy, that will be called empirical verificationism in the following since it collapses the concept of truth into the concept of empirical verification, both for elementary and for complex statements, may seem at first glance to express only a natural physicist's refusal of statements that cannot be justified on an experimental ground. But a deeper insight shows that it has a number of traumatic consequences that make it far more problematical than the logical verificationism of Intuitionistic Logic. Thus, we dedicate the next section to a brief discussion of some of these consequences that are relevant for our purposes. 3. THE SN ARES OF EMPIRICAL VERIFICATIONISM As we have anticipated at the end of Section 2, empirical verificationism has a number of consequences that are epistemologically problematical. Let us discuss some of them. (i) The fact that a statement has or not a truth value depends, according to empirical verificationism, on its verifiableness (or testability), hence it depends on the measurement theory that is adopted, which is a physical theory. But eliminating all statements that are meaningless since they are not verifiable entails radical changes in the structure of our language, hence, in particular, it modifies the deduction rules, which thus turn out to depend on the physical theory that one wants to express by means of the language itself. This entails that one cannot state any a priori rationality criterion which is independent of the theory. One can summarily say that the collapse of truth and empirical access to truth produces the collapse oflogic into physics (see, e.g., Putnam, [33]; Finkelstein, [10], [11]; we insist again on the fact that this goes well beyond the position of Intuitionistic Logic, where the distinction between the logic of a theory and the theory itself can be preserved: this is witnessed by the fact that the "logic of quantum mechanics" , or Quantum Logic, has a mathematical structure which is different from that of Intuitionistic Logic). This collapse can be seen as a primary source of quantum paradoxes: for, many of these follow from the attempt at applying physical rules to logical arguments. (ii) The specific theory that is being considered, i.e., quantum mechanics, produces a disconcerting situation whenever the above verificationist A NEW QUANTUM PHILOSOPHY FOR QM 111 criterion of meaning is applied to elementary statements that attribute properties (equivalently, measurement values) to individual samples of physical systems. Indeed, these statements cannot be neatly divided once and for all into meaningful and meaningless. On the contrary, there are physical contexts (that can be chosen by the observer himself) in which some statements are meaningful, other meaningless, and different physical contexts in which the former statements are meaningless, the latter meaningful. This can be easily understood if one reminds the famous Heisenberg uncertainty principle: indeed, loosely speaking, it follows from this principle that, if the position of a given particle is completely known, then it is meaningless to say that it also has a (even unknown) velocity (to be precise, a momentum), and, conversely, if the velocity of the same particle is completely known, then it is meaningless to say that it also has a (even unknown) position. But in this way subjectivity enters into physics, at least in the sense that, by choosing the physical verification context, one chooses which properties are meaningful and which are meaningless for the inquired object. From this viewpoint, such strange features of quantum mechanics as nonlocality and contextuality (see Section 1) can be guessed even without the aid of the deep theorems of Bell and Bell-Kochen-Specker. (iii) Consider a molecular complex statement, i.e., a statement which is obtained by connecting elementary statements attributing properties to a given individual sample of a physical system by means of connectives as "and", "or", etc .. The general principle of empirical verificationism establishes that this statement is meaningful if and only if it can be verified, that is, if and only if there exists an apparatus (possibly compounded of many sub-apparatuses, suitably linked by means of electronic or mechanical connections) which yields one of the outcomes true andfalse whenever applied to the sample that is being inquired. But, if this occurs, the measuring apparatus can be looked at as testing a physical property: hence, the complex statement that we are considering is logically equivalent to an elementary statement that attributes this property to the given sample. This implies that the complex statement itself is meaningful if and only if it is equivalent to an elementary statement, which constitutes a particular but relevant aspect of the collapse of the logical structure induced by empirical verificationism and discussed in (i). The above consequences of the basic philosophy of quantum mechanics have deeply influenced the epistemological thought of our century, and many speculations can be classified as attempts of coexisting with them. But one can also legitimately wonder whether empirical verificationism is epistemologically sound and, more important, if it is actually essential 112 CLAUDIO GAROLA in physics; in particular, one can wonder whether it is so inherent to quantum mechanics that it could not be renounced without losing all physical knowledge provided by this theory. Now, the answer to all these questions is negative in our opinion. Let us discuss this decisive point in some details. Firstly, consider the problem of philosophical soundness. It is well known that a number of logicians and epistemologists (see, e.g., Russell, [34]; Popper, [32]) have argued against the verificationist concept of truth, observing that the concept of verification itself presupposes that something is verified, which is just the truth of the statement that is being considered: thus, verification and truth cannot be identified. This rather abstract objection (which however is culturally relevant, since collapsing concepts that have been disentangled after centuries of philosophical efforts can hardly be considered a progress) applies to every form of verificationism (in particular, to logical and empirical verificationism). But in the case of empirical verification ism one can add some further criticism: for instance, one can observe that the standard justification for adopting this position, that is, freeing physics from old metaphysical hindrances, actually reveals the incapability of distinguishing between the semantic concept of truth (which can be defined by means of a set-theoretical model, as in classical Logic) and the ontological concept, according to which asserting that a statement is true means to postulate the actual existence of the objects that are mentioned in it. Secondly, consider the possibility of avoiding a verificationist position in quantum mechanics. This appears to be precluded if one accepts the fundamental quantum doctrine of nonobjectivity of physical properties reported in the Introduction, which seems unavoidable since it is supported by some technical results (the Bell and Bell-Kochen-Specker theorems) that are internal to quantum mechanics itself. But it has already been observed that the support to the fundamental quantum doctrine provided by these results depends on an interpretation that contains the quantum doctrine itself. If this loop is broken, a number of possible alternatives can be conceived to the adoption of empirical verificationism as the basic philosophy of quantum mechanics, one of which will be discussed in detail in the next Sections. 4. A L T ERN A T I V EST 0 EM P I RIC A L V E R I F I CAT ION ISM The difficulties inherent in the orthodox interpretation of quantum mechanics and in all interpretations that basically accept empirical verificationism are largely discussed in scientific literature. But, usually, they A NEW QUA N TUM PHIL 0 SOP H Y FOR Q M 113 are not explicitly ascribed to the adoption of empirical verificationism itself. Furthermore, the alternative interpretations that could occur if this doctrine is abandoned are not seriously taken into account, since the identification between the semantic and the ontological concept of truth mentioned at the end of Section 2 leads most scholars to retain that they are necessarily "metaphysical" or ingenuously realistic. These attitudes are maintained even in very sophisticated and advanced studies in quantum mechanics. As an instance, let us consider the recent book on the quantum theory of measurement by Busch, Lahti and Mittelstaedt [7]. In the first Chapter the Authors discuss the difficulties encountered by every interpretation of quantum mechanics which is based on the following assumptions: (i) quantum mechanics deals with properties of individual systems and not only with measurement outcomes; (ii) quantum mechanics is a complete theory (see Section 2, (iv); the notion of completeness can be extended beyond the framework of classical Logic, and intuitively means that quantum mechanics says all what is meaningful to say about the physical systems that it describes); (iii) the properties of physical systems are non-objective in the sense specified by the fundamental quantum doctrine illustrated in the Introduction. The authors point out that these assumptions entail, in particular, that one has to face the objectification problem, that is "the question of how definite measurement outcomes are obtained", since the properties are non-objective. Then, they observe that some interpretations of quantum mechanics, as the many-worlds, witnessing and modal interpretations, as well as those stressing the decisive status of the observer, "can be read as various attempts to live with the insolubility of the objectification problem" , and that it may finally turn out that only a kind of unsharp objectification is possible if the above assumptions are maintained: but they do not recognize explicitly that the objectification problem follows from adopting empirical verificationism, which implies a number of consequences that go beyond the objectification problem itself. In addition, Busch, Lahti and Mittelstaedt take into account the possible alternatives that keep the above-mentioned fundamental assumption (i), while abandoning (ii) and (iii), that is, the objectivity/incompleteness interpretations that renounce the requisite of completeness of quantum mechanics but repropose its "objectivity": yet, the different possible meanings of the latter term are ncit analyzed, and this may lead the reader to interpret it in the ontological sense, hence to reject immediately these interpretations as "metaphysical". 114 CLA UDIO GAROLA It must be also noted that a further bias against objectivity/incompleteness interpretations arises because these interpretations are often identified with hidden variables theories (i.e., theories that attempt to explain quantum mechanics in terms of underlying variables that have a classical behaviour but are impossible to observe), which have been for a long time a taboo for every well-bred physicist. Owing to the above-mentioned reasons most scholars conclude that no proper alternative exists for accepting assumptions (ii) and (iii), even if they lead to still unsolved problems. However, this conclusion can be rejected if the arguments on which it is based are confuted. Let us discuss this confutation in detail. First, in the objectivity/incompleteness interpretations the term "objectivity" may have different meanings. To be precise, it can be intended in an ontological sense, as seen above, which means in particular that the theoretical entities of quantum mechanics represent actually existing entities. But it can also be interpreted in a semantic sense, which means that one is allowed to state meaningful complex statements about the properties of physical systems in quantum mechanics independently of any direct measurement process, and even independently of any ontological assumption. This alternative follows from the possibility of defining the truth value of a given complex statement by means of a settheoretical model or by means of logical procedures, as we have seen in Section 2, but in any case independently of the existence of an empirical procedure that allows one to determine the value itself by means of a measurement. Second, the objectivity/incompleteness interpretations cannot be identified with hidden variables theories. Indeed, Kochen and Specker [26] have proved that a hidden variables model exists even in the case of orthodox quantum mechanics if the conditions imposed on hidden variables for retaining them physically acceptable are sufficiently weak. More generally, one can conceive contextual and/or nonlocal hidden variables theories (see, e.g., Mermin, [30]) whose "objectivity" is problematical (the measuring apparatus testing a physical object which has interacted with other objects in the past may instantaneously modify the properties of these objects, even if they are far away). On the other hand, we shall see that objectivity/incompleteness interpretations exist that have no hidden variables models which satisfy the more restrictive conditions required by Kochen and Specker "for the successful introduction of hidden variables" in the paper quoted above. Our confutation opens the way in which the objectivity /incompleteness interpretations that escape the standard criticism can be introduced. The desirability of an approach of this sort to quantum me- A NEW QU ANTUM PHILOSOPHY FOR QM 115 chanics clearly follows from the paradoxical consequences of empirical verificationism pointed out in Section 3, (i), (ii), (iii), and from the epistemological and logical arguments that can be offered against it (see again Section 3). Therefore, we purpose to present in the next Sections the general lines of a recent approach to physical theories, called Semantic Realism, in which empirical verificationism is given up and the term "objectivity" is intended in a purely semantic and non-ontological sense. We shall see that, within this approach, quantum mechanics turns out to be incomplete, so that the interpretation suggested by Semantic Realism actually belongs to the class of objectivity/incompleteness interpretations. 5. THE BASIC IDEAS OF SEMANTIC REALISM The main aim of Semantic Realism is to provide a general .logical and linguistic framework suitable for expressing a wide set of physical theories (classical and quantum mechanics in particular). Its basic choice is the rejection of empirical verificationism. But, of course, this choice is not sufficient for characterizing the philosophy of Semantic Realism completely, since, in particular, it does not specify the logic that one has to adopt when constructing a language for physics. However, the criticism at the end of Section 3, which applies to every form of verificationism, suggests that one should avoid all logics based on a verificationist concept of truth (hence, in particular, to avoid Intuitionistic Logic). On the other hand, the convenience of making quantum reasoning closer to everyday physicist's and mathematician's reasoning (see Section 1) strongly recommends the adoption of a classical theory of truth, hence of classical Logic. Thus, this adoption is explicitly done by Semantic Realism. The natural subsequent step consists in constructing a language for expressing physical theories that has a classical logical structure (this language must prove to be able, in particular, to provide a new interpretation and to express properly quantum mechanics in order to get over the incorrect but unavoidable objection that adopting classical Logic constitutes a step backward, which reintroduces classical models and/or ingenuous realism). This could be done informally, as usual in physics, by choosing a suitable part (or fragment) of a natural language and introducing into it new technical terms, whose meaning is defined in a precise and accurate way: the mathematical apparatus of a specific physical theory would then be formulated and formalized by means of this enriched part of the natural language, and the interpretation of the old and new 116 CLAUDIO GAROLA terms of the language would provide the core of the partial interpretation of the theory into the observative domain (see the schematic description of physical theories supplied in the Introduction). However, this informal procedure would not prevent all semantic ambiguities inherent in the use of a natural language, which might be decisive when dealing with paradoxes that can be avoided only by introducing subtle distinctions requiring semantic exactness. Thus, one could think it convenient to provide a complete formalization of the language of physics (not only of its mathematical apparatus): but this would be exceedingly complicated, and produce a cumbersome formal structure that would hinder intuitive reasoning. Therefore, Semantic Realism adopts a compromise, accepting the standard language of physics as its general language and formalizing mainly that part of it which is interpreted on the observative domain (the observative language: to be precise, Semantic Realism aims to construct a classical formal language in which the observative languages of a class of physical theories that contains classical and quantum mechanics can be regimented). Let us try to give an intuitive insight into the basic ideas inspiring this formalization and its use for coping with old and new quantum paradoxes. In order to do this, let us begin with some primitive notions that refer to the observative domain on which the formalized language must be interpreted. (i) First, one introduces the notion of laboratory. This simply is a space-time domain in the actual world, i.e., a portion of space (e.g., a given room) associated with an interval of time (e.g., one day). The convenience of introducing this notion is made apparent by some obvious remarks. Indeed, all physical experiments are performed in a certain place at a given time, and they are are often repeated (in the same place, at different times) in order to control their results again and again. They must be intersubjective (i.e., different experimenters in different places and at different times must be able to reproduce them). Therefore, one is naturally led to concentrate his attention on those portions of space and time in which physical experiments are performed, introducing the above notion of laboratory (but it must be noted that this is somewhat different from the non-technical notion used in natural languages, which usually does not make explicit reference to a time interval). (ii) Second, one introduces the notions of preparation and dichotomic registering device. In order to understand them, think of the simplest experiment (yes-no experiment) that can be conceived. Intuitively, this consists of a device (the preparation) that prepares an individual sample of a given physical system (which can be macroscopic, as a tennis ball, or microscopic, as a photon or an electron), and of a second device A NEW QUANTUM PHILOSOPHY FOR QM 117 (the dichotomic registering device) that performs a test on the sample, yielding one of two possible outcomes (yes/no, true/false, 1/0). Thus, the notions of preparation and dichotomic registering device come out in a natural way and their introduction is justified. The above scheme for a yes-no experiment (which is inspired by Ludwig's analysis of physical experiments [28]) needs however some further comments. Therefore, let us note that actual experiments, which usually are far more complicate, can be often thought of as composed of a number of elementary yes-no experiments that are performed at a given time, and then repeated at different times. It follows that the elementary experiment described above does not represent a particular case, but plays a central role in the conceptual construction of a theory. Furthermore, let us stress that the reference to an individual sample of a given physical system can be eliminated if one wants to avoid the slanderous accusation of introducing metaphysics into physics by assuming the actual existence of some microscopic objects. Indeed, it suffices to correlate directly the act of preparing with the outcome provided by the dichotomic registering device, avoiding the reference to an intermediate physical system. This choice would however be very expensive, both with concern to language and to conceptual economy. Therefore, Semantic Realism avoids it, and introduces the specific term physical object in place of the more cumbersome expression "individual sample of a given physical system": the reader may freely decide whether this new term is only a shortcut or actually denotes some existing entity. (iii) By using the basic notions discussed in (i) and (ii), one can introduce a number of derived notions that make the formalization process carried out by Semantic Realism intuitively clear. In order to do this, let us note first that in every physical theory there are preparations that are considered physically equivalent. Thus, preparations can be grouped into equivalence classes, and these classes are called states. Analogously, there are dichotomic registering devices that are considered physically equivalent. Thus, dichotomic registering devices can be grouped into equivalence classes, and these classes are called effects. The derived notions of states and effects are basic in Semantic Realism. Therefore, we note explicitly that they depend, through the concept of physical equivalence, on the theory whose observative language is being formalized, and that, in spite of this, the abstract structure of the language itself will not depend on the theory. Moreover, a general definition of physical equivalence for preparations and dichotomic registering devices will be provided in Section 6, (iii). (iv) Let us look at a given laboratory i and consider all physical objects that are prepared in it by repeating the same preparation at different 118 CLAUDIO GAROLA times or by activating different preparations. The (finite) set of all these objects is then called the domain Di of i. Inside this domain, one can select all objects that have been prepared by preparations belonging to a given state S: the subset of all these objects is called the extension of S in i, and if x is a physical object belonging to this extension, one briefly says that "x is in the state S (in i)". Analogously, one can select every object that would give answer yes if a test should be performed on it by means of a dichotomic registering device belonging to a given effect F: the subset of all these objects is called the extension of F in i, and if x is a physical object belonging to this extension, one briefly says that "x would induce answer yes (or true, or 1) in the effect F (in i)". It is important to observe that a physicist adhering to the standard quantum philosophy would object that the above definition of the extension of an effect F is not acceptable. Indeed, one can decide whether a given physical object x belongs to the extension ofF by performing a test on it by means of a dichotomic registering device that belongs to F. But it is well known that there are in quantum mechanics non-compatible (i.e., mutually exclusive) dichotomic registering devices: hence, if one has performed the above test, he can never know whether x belongs to the extension of another effect G made up by dichotomic registering devices that are non-compatible with those in F. It follows that the extension of G is not defined, for it has no meaning to refer to what "would have happened" if one had performed a test by means of a dichotomic registering device belonging to G. Since, of course, the argument holds even while interchanging F and G, the orthodox quantum physicist concludes that one cannot associate a definite extension to every effect. The above conclusion, however, clearly follows from the fundamental quantum doctrine illustrated in the Introduction, which ensues from adopting empirical verificationism. The contrary assumption that the extensions of effects are defined subtends the rejection of this philosophy, and is based on the fact that the extension of any effect F can be actually exhibited in a laboratory i by performing a test by means of a dichotomic registering device belonging to F on all physical objects in the domain D i . This is considered sufficient, according to Semantic Realism, for maintaining that the extensions of all effects are defined even if they cannot be exhibited conjointly. (v) From the viewpoint of logic, the definitions in (iv) allow one to construct a formal language L that has the set of states and the sets of effects as sets of predicates, standard connectives, quantifiers and formation rules, and which furthermore is endowed with a built in classical logical structure. Indeed, in the laboratory i the domain Di is the "universe" for this language in the sense specified in Section 2, (ii). Fur- A NEW QU ANTUM PHILOSOPHY FOR QM 119 thermore, states and effects are represented by their extensions, that are subsets of the domain Di, just as the "properties" in Section 2, (ii), are represented by subsets of the universe. Thus, one assumes that an elementary statement of the form: S(x): the physical object x is in the state S, or F(x): the physical object x would induce answer yes in the effect F, is true if and only if the object x belongs to the extension in i of S, or F, respectively. The truth of complex statements obtained by using elementary statements, connectives and quantifiers, is then defined by means of standard conventions in classical Logic (see again Section 2, (ii); it is important to remind that these conventions define the truth of complex statements independently of the existence of empirical or logical procedures that allow one to determine effectively the truth values of the statements themselves). The above remarks conclude the illustration of the basic ideas that inspire the attempt, carried out by Semantic Realism, at providing a classical formal language in which the observative languages of classical and quantum mechanics, in particular, can be regimented. However, one must go farther if he wants to use the language L for treating problems in these theories. More specifically, there are properties of states and effects, and relations among them, that hold both in classical and in quantum mechanics and that must be explicitly analyzed and possibly expressed by means of L. We intend to provide in the next Section an intuitive outline of these properties and relations. 6. S TAT E S, E F FEe T S, AND QUA N TUM LOG I C In order to realize the program presented at the end of Section 5, let us discuss some fundamental features of states and effects within Semantic Realism, also introducing a number of derived definitions that are useful in the following. We note that some items in this section are rather technical, especially (vii), (viii) and (ix), even if we try to discuss them in a non-technical way. We apologize for bothering the reader with a number of details: but, unfortunately, they are important, since the roots of some paradoxes in quantum mechanics can be found in some subtle consequences of the standard quantum philosophy criticized in Sections 2 and 3. However, we have tried to make the rest of this paper understandable even if some of the technical points discussed in this section are not completely clear to the reader. 120 CLAUDIO GAROLA (i) It follows from the definition of extension of a state S that the extensions of different states in a laboratory i have empty intersection. Indeed, no actual physical object in the domain Di of a laboratory i can be prepared by more than one preparation, hence no physical object can belong to more than one extension of state. Moreover, the union of all extensions of states in Di coincides with Di (mathematicians would say that the set of all extensions is a partition of D i ; it must be noted that this is a typical feature of Semantic Realism, which does not occur in other axiomatic approaches, as, for instance, Ludwig's). On the contrary, the extensions of two (or more) different effects may have non-empty intersections. Indeed, it may occur that a given physical object would induce answer yes in any dichotomic registering device belonging to one of the effects, and also in any dichotomic registering device belonging to the other. Taking into consideration the above remark, the extensions of effects can be used for introducing an order relation on the set of all effects. Indeed, one says that an effect F is smaller than an effect G if and only if the extension of F is contained in the extension of G in every laboratory. This order is partial, which intuitively means that not every pair of effects is such that one of its members is smaller than the other; thus, one obtains the partially ordered set (briefly poset) of all effects, which is (because of its interpretation and structure) an effect algebra in a sense made familiar to scholars through the literature on the foundations of quantum mechanics. It must be stressed that the order defined on the set of all effects is empirical, in the sense that it depends on the physical relations that occur among different effects. However, whenever F is smaller than G, in every laboratory a physical object that belongs to F also belongs to G: thus, if the statement F(x) is true, then also G(x) is true. A logician would say that F(x) is smaller than G(x) according to the standard definition of logical order in a formalized language (see Section 2, (iii)). This point should be carefully pondered. Indeed, the same logician would also say that the statement F(x) is smaller than F(x) or G(x), since, again, if F(x) is true, then F(x) or G(x) is true. But, now, this conclusion is independent of any empirical relation between F and G, since it depends only on the conventional definition of the connective or. Thus, one must be well aware that the standard "logical order" contains both an empirical and a strictly logical part, which is a relevant distinction in the following. (ii) The yes/no outcomes in every dichotomic registering device can be exchanged, thus obtaining a new dichotomic registering device that is obviously inequivalent to the original one. If the exchange of outcomes is A NEW QU ANTUM PHILOS OPHY FOR QM 121 done in all devices belonging to a given effect F, one obtains a new effect, say FC, that can be called the complement of F. In every laboratory i, the extension of FC obviously is the complement in Di of the extension of F. Moreover, a statement of the form FC(x): the physical object x would induce answer yes in FC is true if and only if the statement F(x): the physical object x would induce answer yes in F is false. This implies that the elementary statement FC(x) is logically equivalent to the complex statement not F(x) (or, symbolically, -,F(x)), which constitutes an important instance of logical equivalence between elementary and complex statements. (iii) It is convenient to select only preparations and dichotomic registering devices that yield definite frequencies when carrying out yes-no experiments. More precisely, let a given preparation be repeated a great number of times in a given laboratory, so as to produce an ensemble of identically prepared physical objects (hence, of physical objects that are in the same state), and let a dichotomic registering device be used in order to perform a test successively on all physical objects in the ensemble. The ratio between the number of yes answer that one has obtained and the number of elements in the ensemble provides the frequency of the yes answer. One can then repeat the whole experiment in another laboratory, obtaining a new frequency. There is no a priori reason for expecting that the two frequencies coincide. But one convenes now that preparations and dichotomic registering devices are chosen in such a way that this actually occurs whenever the ensembles contain "a great number" of physical objects; in other words, we convene that one can define a probability of the yes outcome as the limit (in the statistical sense) to which frequencies approach whenever the number of elements in any considered ensemble becomes larger and larger. The above convention allows one to introduce some assumptions that are physically quite natural. More specifically, let us consider two preparations that produce the same frequencies for every dichotomic registering device whenever an experiment of the kind described above is done: then, we assume that the two preparations are physically equivalent. Analogously, two dichotomic registering devices that produce the same frequencies for every preparation are assumed to be physically equivalent. Thus, one gets two explicit definitions of physical equivalence which allow one to decide whether some preparations or some dichotomic registering devices are physically equivalent. (iv) It may happen that the extension of a state S in a laboratory i is contained in the extension of an effect F. This means that all physical objects in the state S would induce, in i, answer yes in the effect F. 122 CLAUDIO GAROLA More generally, one can consider the set of all effects whose extensions contain the extension of a given state S in every laboratory. This set is called the certainly true domain of S: every effect in the certainly true domain of S is characterized by the fact that it yields answer yes in every laboratory when it is used in order to perform a test on a physical object in the state S (one briefly says that "the state S produces answer yes with certainty if tested by an effect which belongs to the certainly true domain of S itself"). In every laboratory i one can consider the intersection of the extensions of all effects that belong to the certainly true domain of a given state S. This intersection contains all physical objects that would induce answer yes in all effects that belong to the certainly yes domain of S, and has a crucial role in Semantic Realism. Therefore, we introduce the further symbol Pi(S) in order to denote it and briefly mention some of its more relevant uses. First, Pi(S) can be used in order to distinguish pure from mixed states, which is theoretically important from several viewpoints, in particular because the basic notions of many physical theories can be discussed by making reference to pure states only. According to Semantic Realism, a state S is said to be pure if and only if, in every laboratory i, its extension is the only extension of state which is completely included in the intersection Pi(S): intuitively, this means that S is pure if and only if it is the only state which produces answer yes with certainty if tested by an effect which belongs to the certainly true domain of S itself (it can then be proved that this definition coincides in particular theories, as classical and quantum mechanics, with the standard definitions of pure states provided by these theories). Second, Pi(S) can be used in order to introduce a binary preclusivity (nonreflexive and symmetric) relation on the set of pure states (for physicists: in quantum mechanics two pure states are in the preclusivity relation if and only if the two vectors representing them are orthogonal). Indeed, a pure state S can be said to be in this relation with another pure state S' whenever, in every laboratory i, the set Pi(S) has empty intersection with the extension of S'. Intuitively, this means that there is no physical object in the state S' which would activate answer yes on all effects in the certainly true domain of S. (v) The preclusivity relation introduced in (iv) can be used in order to pick out a special subset of the set of all effects: the set of all exact effects. We cannot enter here the technical procedures that allow one to make this selection (they mainly are based on introducing a suitable notion of closure for every subset of states, via the preclusivity relation). But it is important to grasp without ambiguity the intuitive interpretation A NEW QU ANTUM PHILOSOPHY FOR QM 123 of exact effects. An exact effect F is an effect which is associated with a physical property (as "being white", "being in the point P", "having velocity v"), and every dichotomic registering device in F can be used for testing whether this physical property holds for a physical object x. The set of all physical properties associated with exact effects is called the set of all testable properties (following a terminology known in the literature one could also say that exact effects correspond to sharp testable properties while non-exact effects correspond to unsharptestable properties). Since different exact effects obviously correspond to different testable properties, the correspondence between the set of exact effects and the set of testable properties is one-to-one, so that the two sets can be identified. Because of this identification, the statement F(x), with F an exact effect, is briefly interpreted as "the physical object x has the (testable) property F" , or "the (testable) property F is true for the physical object x", and every dichotomic registering device in F is said to test the property F on x. Let us analyze more carefully the concept of testable property. It is apparent from the definition of this concept that its introduction requires an idealization, since only ideal dichotomic registering devices (that can at most be "approached" by actual devices) can be used in order to test exactly whether a given property holds or not for a given physical object. More important, the concept of testable property must not be confused with the ambiguous concept of "physical property that can be tested in some way" that is rather usual in the language of physics. Indeed, a testable property can only be attributed to a physical object (i.e., to an individual sample of a physical system) and tested on it by means of a single act of testing: in logical terms, one would say that it is a first order property. On the other hand, there are in physics properties that refer to ensembles of physical objects (as the frequencies considered in (iii)), or even to ensembles of ensembles (for instance when one says that a given frequency is minimal in a given state): in logical terms, these are second or third order properties, respectively. Even these higher order properties can be tested: but their test actually consists of a (usually great) number of elementary tests of testable properties and of a successive comparison of the sets of results that have been obtained. It must be stressed that confusing physical properties of different logical orders may be a primary source of physical paradoxes. (vi) By making reference to testable properties, one can introduce two binary relations of compatibility (see [21], [22]) that have an intuitive interpretation, as follows. Semantic compatibility: the testable properties F l , F2 are semantically compatible iff they can be simultaneously true for a physical object x. 124 CLAUDIO GAROLA Pragmatic compatibility (or conjoint testability): the testable properties F}, F2 are pragmatically compatible (conjointly testable) if and only if they can be conjointly measured on a physical object x. The relation of semantic compatibility formalizes the intuitive idea that there are properties that can be simultaneously true for a given physical object (for instance, "being white" and "being round" in the case of a "white and round ball"). The relation of pragmatic compatibility formalizes the intuitive idea that there are properties that can be tested conjointly (for instance, "having x coordinate in the interval .6.x" and "having y coordinate in the interval .6.y" in quantum mechanics), and properties that cannot (for instance, "having x coordinate in the interval .6.x" and "having momentum along the x axis in the interval .6.px" if the product of the widths of the two intervals is less than 2~' again in quantum mechanics). These relations are obviously different (for instance, "having x coordinate in the interval .0.x" and "having x coordinate outside the interval .6.x" are not semantically compatible, since they can never be simultaneously true, but they are pragmatically compatible, since a test of the former also is a test of the latter, which is obviously true if and only if the former is false). Furthermore, pragmatic compatibility can be identified with the standard relation of compatibility in orthodox quantum mechanics. But if empirical verificationism is accepted, two testable properties can be semantically compatible (or not) only if they are pragmatically compatible, for it has no meaning to say that properties that cannot be measured conjointly can (or cannot) be "simultaneously true": thus, the relation of semantic compatibility is not defined on the set of all testable properties. On the contrary, according to Semantic Realism, the two compatibility relations are disentangled and both defined on the set of all testable properties: intuitively, this follows from the fact that the extensions of two testable properties are defined in every laboratory independently of the conjoint testability of the properties themselves. In particular, it may occur that two properties can be semantically compatible without being pragmatically compatible (e.g., "having spin along the x axis" and "having spin along the y axis" in quantum mechanics), which is an important novelty introduced by Semantic Realism. (vii) Let S be a pure state, i a laboratory, and let us come back to the intersection PieS) introduced in (iv). It is apparent that there is no a priori reason for maintaining that an effect exists which, whatever i may be, has just Pi (S) as extension. Whenever this occurs, we call testable support of the state S this effect, and denote it by F s. It is then easy to see that, if a testable support of S exists, it is the smallest element in the certainly true domain of S, according to the order defined in (i) A NEW QU ANTUM PHILOSOPHY FOR QM 125 (indeed, its extension in any laboratory is contained in the extensions of all effects that belong to the certainly true domain of S). The existence of a testable support F s for every pure state Scan be easily deduced both in classical and in quantum mechanics if one refers to the standard interpretation of these theories (for physicists: if I tp > is the vector representing the state S in quantum mechanics, Fs is the testable property represented by the projection I tp >< tp I); furthermore, Fs is an exact effect, i.e., a testable property). Therefore, pure states are often identified with the testable properties that are their testable supports in classical and in quantum mechanics (see, e.g., [31]). However, this identification is misleading in the latter theory, and originates some old quantum paradoxes (as Furry's, see [12], [13]' and [23]). Indeed, in classical Mechanics the extension ofF s in any laboratory i coincides with the extension of S, since there is in this theory at least one (possibly ideal) dichotomic registering device that yields answer yes if and only if a physical object is in the state S. But, on the contrary, the extension of F s in quantum mechanics is usually greater than the extension of S, since there are states in this theory that are different from S and induce answer yes in the effect F s with a nonzero frequency in every laboratory (for physicists: these are states that are represented by vectors which are not orthogonal to the vector representing S). This difference between the two theories outlines the probabilistic character of quantum mechanics and shows that, even if the correspondence between pure states and their testable supports is one-to one, it is not possible in this theory to identify pure states with their testable supports with regard to their extensions. Synthetically, one can say that states must not be confused with testable properties in quantum mechanics. (viii) In addition to the remarks in (vii), the assumption itself that a testable support exists for every pure state can be seriously questioned in quantum mechanics whenever compound physical systems are considered (a compound physical system can be intuitively defined as a system formed by various subsystems, which can obviously interact). Indeed, the states of such a system can be divided into two classes, the first type states and the second type (or entangled) states (this partition is based on technical reasons that will not be discussed here). If'S is a second type state, it can be proved that S may be such that one cannot find in quantum mechanics a class of ideal dichotomic registering devices·, i.e., an exact effect, which satisfies the condition of being minimal in the certainly true domain of S. Should this be the case, no exact effect, i.e., (first order) testable property, can be considered as the support of S. Basing on the above criticism, Semantic Realism adopts a new and rather radical position: indeed, it defines a first type state S as a state 126 CLA UDIO GAROLA which has a testable support F s, and assumes that F s then is an exact effect. The second type states (which may exist or not in a given theory: indeed, they do not exist in classical Mechanics) are then defined as those states which have no testable support. It is interesting to note that many quantum physicists would disagree with the above definitions and assumptions, since they would object that some experiments have been contrived in quantum mechanics which test "whether a physical object is in the state S" whenever S is a second type state. Yet, this disagreement is based, in our opinion, on the lack of awareness of the fact that all these experiments actually test only second order properties (see (v)), since they consist in testing some first order properties of the different subsystems that form the compound system, repeating the test a great number of times on different samples of the system and then correlating statistically the obtained results. These second order properties (that can be called correlation properties) cannot therefore be confused with testable supports of states of the second type. The characterization of the first and the second type states provided by Semantic Realism is theoretically relevant. For, the implicit assumption that a testable support exists for every state is the basis on which some arguments stand which aim to prove that compound quantum systems cannot be separated (nonseparability intuitively means that the properties of two or more physical systems that have interacted in the past remain interconnected in such a way that testing a property on one of the systems influences the properties of the other system). If one drops out this assumption, this strange conclusion is avoided. It is then apparent that the aforesaid characterization constitutes a new branching point in which the interpretation of quantum mechanics provided by Semantic Realism differentiates from the standard interpretation (for physicists: note that dropping the assumption that a testable support exists for every state implies renouncing the assumption that every Hermitean operator represents a physical apparatus, at least in the case of compound physical systems). (ix) The breakdown of the one-to-one correspondence between pure states and their testable supports implies in particular that the partially ordered set (or poset) of all exact effects (this set is indeed ordered, since it is embedded in the bigger poset of all effects, see (i)) is not endowed, in general, with the mathematical structure of a lattice. However, one can resort to well known mathematical tools (completionprocedures, see, e.g., [14]) that allow one to enlarge the poset of all exact effects by adjoining to it further elements that transform it into a lattice. These new elements, of course, have not an operational interpretation: but they can be considered as theoretical (first order) properties, which are upper A NEW QUA N TUM PHI LOS 0 P H Y FOR Q M 127 (or lower) bounds of sets of testable (first order) properties. Thus, one obtains an enlarged set of physical properties which is partitioned into a set of testable properties and a set of theoretical properties. In this new enlarged context, a one-to-one correspondence between pure states and minimal properties can be recovered. Indeed, one can associate to every pure state S a support, which is a property that is minimal in the certainly true domain of S (which now contains also theoretical properties): this property is testable (testable support) if S is a first kind state, theoretical ( theoretical support) if S is a second kind state. (x) The completion of the poset of exact effects and the ensuing distinction between testable and theoretical properties is a typical feature of Semantic Realism, which has long ranging consequences. Indeed, on the one hand, it allows one to recover a structure of (complete, orthacomplemented) lattice, which can be identified, in the case of quantum mechanics, with the lattices that appear in a number of axiomatic approaches to this theory (see, e.g., [29], [27], [31]), often collected under the name quantum logic approach to quantum mechanics (see, e.g., [7]). Thus "Quantum Logic" is recovered by Semantic Realism as an empirical structure, since the order on it is empirical in the sense specified in (i). But, on the other hand, this Quantum Logic obtained by completion does not contain, in the case of complex systems, only observative elements (testable properties), but even elements that are non-observative (theoretical properties), at least in the sense that they do not correspond to first order properties that can be directly tested on a physical object. This presence of non-observative elements in the lattice associated to a compound physical system is a further important ingredient for avoiding quantum paradoxes, as we intend to show in the following. 7. MEANING, TESTABILITY AND VALIDITY OF PHYSICAL LAWS The topics discussed in Section 6 may have annoyed the reader who is not interested in a detailed exposition of the concepts and methods of Semantic Realism. But they allow us now to introduce some new general epistemological perspectives that are typical of Semantic Realism and have a deep physical meaning. (i) Basing on the completion procedures introduced in Section 6, (ix) , one can construct a new language, the language of properties Le , which has the set of all states and the set of all testable and theoretical properties as set of predicates, and is endowed with the same (classical) logical structure of L. This language is actually different from L: indeed, the set 128 CLAUDIO GAROLA of its predicates does not contain non-exact effects, while it may contain a set of new predicates that are interpreted as theoretical properties. Furthermore, it contains, as L, elementary and complex statements, and its statements are ordered by the standard logical order (see Section 2, (iii)). (ii) The poset of all elementary statements of Le of the form E(x), with E a property, endowed with the logical order, is obviously isomorphic to the poset of all testable and theoretical properties (this means that a one-to-one mapping exists which maps the former set onto the latter, preserving the order): therefore, it can also be considered a Quantum Logic. It is then interesting to observe that this name is now intuitively appropriate, since the elements of our new poset are statements and its order is logical. However, this order still depends on the interpretation of the predicates, consistently with the remark that logical order also contains an empirical part (see Section 6, (i)), so that even this Quantum Logic must be considered an empirical structure, not a strictly logical one. (iii) The statements of Land Le (all of which have a truth value) can be divided into testable and nontestable statements: intuitively, a statement is testable if its truth value can be tested by means of a suitable measurement procedure, nontestable if it cannot. Let us consider 1. All elementary statements of this language are obviously testable (which is a reasonable operational feature, since L aims to formalize observative languages, see Section 5). On the contrary, there may be complex statements in L that are nontestable: indeed it may occur that there is no way of testing the truth value of a complex statement which contains testable properties that are pragmatically non-compatible. Let us consider Le. This language may contain also elementary statements that are non-testable (indeed, every elementary statement E(x) of L e , where E is a theoretical property, is non-testable). Thus if the set of all theoretical properties is non-void, one cannot assert that Le formalizes an observative language, and Le provides the first instance of how theoretical (i.e., non directly observative) statements naturally enter the language of physics. Furthermore, a complex statement of Le may be nontestable, if it contains theoretical properties and/or testable properties that are pragmatically non-compatible. The existence of meaningful testable and nontestable statements in Land Le shows that the fundamental epistemological distinction between truth and empirical access to truth (see Section 2) is maintained in these languages. Moreover, a general criterion of testability can be introduced both in L and in Le. Indeed, the reasoning used in Section 3 in order to show that empirical verificationism implies that a molecular A NEW QU ANTUM PHILOSOPHY FOR QM 129 complex statement is meaningful if and only if it logically equivalent to a testable elementary statement can now be used in order to show that a molecular complex statement of L or Le is testable if and only if it is logically equivalent to a testable elementary statement. One thus gets a general criterion of testability, which can be used in order to pick out sets of testable statements of L or Le (for instance, all molecular complex statements which contain only pragmatically compatible testable properties are testable). (iv) The distinction between testable and non-testable statements in Land Le does not mean that the latter cannot be legitimately used in physics as theoretical statements: indeed they have a role, in particular, in the inferential procedures. Hence, both testable and nontestable statements of Le can be used in order to state physical laws. Therefore, one must distinguish two classes of physical laws, as follows. Empirical laws: these are expressed by testable statements, hence can be empirically checked. Theoretical laws: these are expressed by nontestable statements, hence cannot be empirically checked (at least directly). The above distinction between empirical and theoretical laws leads one to wonder about the epistemological status of these laws. It is then evident that this question has many possible answers, depending on the philosophical position that one decides to adopt. Therefore, let us make clear here the standpoint of Semantic Realism: this approach adheres to the operational philosopy underlying quantum mechanics and assumes that it must be retained in every physical theory, but it also maintains that this philosophy requires avoiding ontologization of physical laws and knowledge rather than adopting empirical verificationism. By applying the above viewpoint to theoretical laws it follows that these must be considered as formal structures, whose role consists in producing, via logical deduction, empirical physical laws that can be directly tested. The case of empirical physical laws requires a more detailed treatment. Therefore, let us preliminarily observe that they are commonly used, together with some auxiliary assumptions (or premises), in order to obtain physical predictions, i.e., true statements attributing physical properties to given physical objects (as an intuitive instance of this procedure, consider the classical syllogism quoted at the beginning of Section 2: here, "all men are mortal", "Socrates is a man" and "Socrates is mortal" play the role of a physical law, an auxiliary assumption and a prediction, respectively). It is then important to observe that the premises specify the physical situation to which the empirical law is applied, attributing properties (as "being a man") to the physical objects (as "Socrates") 130 CLAUDIO GAROLA that one is considering: we say that they determine a context. Now, the relations of semantic and pragmatic compatibility introduced in Section 6, (vi), suggest that various types of contexts can be distinguished, as follows. Contradictory contexts, in which semantically non-compatible properties are assumed. These contexts are inconsistent and can never occur, hence they are irrelevant for physics. Non-accessible physical contexts, in which semantically compatible but pragmatically non-compatible properties are assumed. These contexts are consistent and can in principle occur, but are not accessible to physical investigation, since one can never identify them empirically (indeed, the pragmatically incompatible properties that define a non-accessible context can never be tested conjointly). Accessible physical contexts, in which semantically and pragmatically compatible properties are assumed. These contexts are consistent and can occur; furthermore, they are accessible to physical investigation, since one can identify them empirically. The appearance of non-accessible physical contexts is a typical feature of Semantic Realism, and occurs because of the existence of physical theories, as quantum mechanics, where a non trivial relation of pragmatic compatibility is defined (it is evident that these contexts do not appear in classical Physics, where all physical properties can be tested conjointly, at least in principle). If one adopts the epistemological position of Semantic Realism specified above, that is, avoiding ontologization of physical laws, one concludes that empirical laws cannot be assumed to be true (in the sense that they may be true as well as false) in nonaccessible contexts: indeed, the theory itself prohibits that an empirical law be verified in a context of this kind. It follows that, if one wants to avoid "metaphysics", the validity of empirical laws must be limited, at least in those theories which introduce a non trivial relation of pragmatic compatibility. This limitation can be introduced by stating a new epistemological principle (Metatheoretical Generalized Principle, or, briefly, MGP), that we express here in an informal way as follows. MGP. A sentence expressing an empirical physical law (deduced or not from a general theoretical law) is true in every physical context in which only semantically and pragmatically compatible pmperties are assumed for each physical object that is considered (accessible physical contexts). The significance of MGP should not be underestimated. Indeed, classical Physics has accustomed physicists to assume implicitly the unrestricted validity of physical laws, which can be expressed by stating a Metatheoretical classical Principle (briefly, MCP), as follows. A NEW QUANTUM PHILOSOPHY FOR QM 131 MCP. A sentence expressing an empirical physical law (deduced or not from a general theoretical law) is true in every (non-contradictory) physical context. This principle is clearly adequate to classical Physics, where it is equivalent to MGP since non-accessible physical contexts do not occur. But it is questionable, according to Semantic Realism, in every physical theory where these contexts occur. By introducing MOP in place of MOP, Semantic Realism proposes to shift the antimetaphysical issue of the operational viewpoint from accepting empirical verificationism to renouncing any extrapolation of our knowledge beyond its empirical limits. We show in the next section that this viewpoint greatly helps in reconciling quantum mechanics with everyday physicist's intuition. Before closing this point, we would like to defend MGP against the natural objection that it produces an idealistic dependence of physical laws on the choices of the observer, as it seems to be witnessed by the term "physical context" that appears in it. Indeed, MGP introduces a kind of pragmatic contextuality which is deeply different from the semantic contextuality introduced by the standard interpretation of quantum mechanics (see Section 1). For, the latter makes the properties themselves of a physical object depend, to some extent, on the observer's decision of performing some possible experiments; the former only restricts the possibility of deducing relations among observation-independent physical properties from physical laws, the restriction depending on the character of partial knowledge attributed to these laws and not on the observer's choices. (v) Whenever the perspective of Semantic Realism is applied to quantum mechanics, this theory turns out to be incomplete with respect to the language Le; this means (see Section 2, (iv)) that the laws of quantum mechanics are not sufficient for determining the truth values of all statements of L e , even if suitable premises are assumed. Intuitively, this result follows from the probabilistic character of quantum physical laws, which usually provide only probabilities of truth values, not the truth values themselves, for statements attributing properties to physical objects in given states [16]. Since all physical properties in the language Le are either true or false for a given physical object, the interpretation of quantum mechanics provided by Semantic Realism belongs to the class of the objectivity/incompleteness interpretations, as anticipated at the end of Section 4. 132 CLAUDIO GAROLA 8. ON THE BELL AND BELL-KOCHEN-SPECKER THEOREMS Let us come now to the Bell and Bell-Kochen-Specker theorems, the relevance of which with respect to our thesis in this paper has been illustrated in the Introduction. Indeed, if the orthodox interpretation of these theorems is accepted, one of the basic assumptions of Semantic Realism (the existence of an extension for every effect in every laboratory, which allows one to attribute properties to physical objects independently of measurement procedures, see Section 5, (v)) would be invalid in quantum mechanics. In order to avoid this criticism, various proofs of the above theorems have been analyzed in a number of papers [18J, [19J, [20]' [23J and it has been shown that all of them are based on assuming the validity of some empirical (correlation) laws deduced from the general theoretical laws of quantum mechanics outside the domain of validity established by MGP: hence, these proofs are not correct according to Semantic Realism. Let us look into the above subjects in more detail. We consider first the Bell theorem. This theorem was stated by Bell in 1964 [lJ. Subsequently, the Bell argument was refined by a number of authors, who supplied accurate analyses of the premises on which it is based, and a number of alternative statements and proofs. Nowadays most authors agree that the Bell theorem shows that quantum mechanics conflicts with one at least of the following assumptions (see Selleri, [36J; note that assumption R, though called reality for conforming to current literature, actually imposes a weak reality condition: indeed, it does not entail that the mathematical objects that appear in the formal apparatus of quantum mechanics correspond to ontologically existing entities, but only requires that the properties of a physical object, or of a preparation procedure, do not depend on the choices of the observer). R (reality). The results of all conceivable measurements are simultaneously prefixed (even in the case of non-compatible observables). LOC (locality). Whenever two subsystems, say 1 and 2, of a compound physical system are sufficiently far apart, a measurement on subsystem 1 (2) does not modify the values of the observables of subsystem 2 (1). However, it can be proved that non-R and quantum mechanics imply non-LOCo Thus, the Bell theorem is usually retained to prove the nonlocality of quantum mechanics. If one considers the more ancient proofs of this theorem, one sees that they essentially show that locality implies inequalities which are not consistent with quantum mechanics. In all these proofs, empirical physical laws are assumed as true for physical objects in non-accessible physical A NEW QU ANTUM PHILOSOPHY FOR QM 133 contexts (see Section 7, (iv)), which implies a violation of MGP. This is apparent, for instance, in the Wigner [39] and Sakurai [35] proofs, where many subsets of physical objects are considered, each set consisting of physical objects having the same properties (for physicists: spin components along different directions), and then an inequality is obtained by applying a quantum law (again for physicists: perfect correlation in the singlet state) to these objects. Indeed, the properties characterizing some sets are semantically compatible but pragmatically non-compatible, which prohibits the use of quantum laws according to MGP in these cases. In other proofs, as the original Bell's one, an inequality is obtained regarding sets of physical objects in non-accessible physical contexts, but the deduction does not introduce quantum laws (we reserve the name Bell inequality in the following to Bell-type inequalities obtained in this way). But quantum laws are then used whenever the Bell inequality is compared with an analogous inequality predicted by quantum mechanics. Thus, one can conclude that none of these proofs can be accepted according to Semantic Realism. More recent proofs do not make resort to inequalities, but directly show that, joining R, LOC and quantum mechanics, one obtains contradictory predictions (e.g., Greenberger et al., [24]). In these proofs, a number of empirical laws are deduced from a general theoretical quantum law, and it is implicitly assumed that they all are simultaneously true. But whenever one of these empirical laws is assumed to hold, the properties predicted by it are easily seen to be pragmatically noncompatible with the properties that one introduces further as premises in order to deduce predictions from another law of the set. Thus, one produces again a non accessible physical context, in which the validity of the latter law cannot be assured, because of MGP. One concludes that neither of these proofs can be accepted according to Semantic Realism. Let us come to the Bell-Kochen-Specker theorem. As we have seen in the Introduction, this theorem is usually retained to prove the contextuality of quantum mechanics, which can now be interpreted in the sense that the truth value of a statement attributing a property that belongs to a set of properties that are measured on a physical object depends on the whole set, not only on the property itself. It is evident that the fundamental quantum doctrine discussed in the Introduction follows at once from contextuality, and we remind that the relevance of the Bell-Kochen-Specker theorem consists, seemingly, in showing that this feature of quantum mechanics is imposed by the internal structure of quantum mechanics itself, so that it needs not be formulated as an a priori epistemological constraint. 134 CLA UDIO GAROLA The original proofs of the Bell-Kochen-Specker theorem [2], [26] were rather complicate, but there are some recent proofs (Mermin, [30]) that are quite simple and immediate. By considering these proofs, it has been shown (Garola and Solombrino, [23]) that they are invalidated, according to Semantic Realism, by the same argument used in order to reject the proofs of the Bell theorem that do not resort to inequalities. Let us discuss this subject more carefully. The core of the Mermin argument consists in stating a set of equations that express theoretical laws connecting quantum observables (i.e., mathematical operators that represent physical quantities that can be actually measured). In each equation the observables are compatible, i.e., they can be measured conjointly: hence, Mermin introduces the "plausible" condition that the measured values of the observables must satisfy the equation itself. If all values are simultaneously prefixed (assumption R) this condition can be applied repeatedly to all equations in the set, concluding that all these equations must be simultaneously satisfied by the measured values of the observables. But, then, Mermin shows that this is impossible. Thus, he deduces that the values of some observabIes may be different if different sets of measurements are performed, hence they cannot be prefixed and quantum mechanics turns out to be a contextual theory (note that this does not introduce contradictions between measurement outcomes and physical laws in standard quantum mechanics, since some of the observables that occur in different laws are non-compatible, which means that only restricted subsets of values can actually be measured). In order to invalidate this reasoning, let us note first that the laws used in the Mermin proofs are so general that they can neither be expressed by means of L nor by means of Le: indeed, they hold for every state of the physical system, so that stating them formally would require quantification on state variables, hence a complete formalization of the language of physics that has been avoided here for the sake of simplicity (see Section 5). However, this does not affect the general remarks on theoretical and empirical laws provided at the end of Section 7 and leading to state MGP. It follows that all empirical laws deduced from the theoretical laws used by Mermin are valid, according to Semantic Realism, within the limits estabilished by MGP, and cannot be applied unrestrictedly. Now, these limits prohibit that all empirical laws that can be deduced from the set of theoretical laws considered in the Mermin proofs be assumed as valid conjointly. Indeed, the fact that there are non-compatible observables in different laws implies that one can deduce empirical laws predicting properties that are not pragmatically compatible, so that assuming some of them as valid creates a non-accessible physical context A NEW QU ANTUM PHILOSOPHY FOR QM 135 in which one cannot maintain that the remaining empirical laws are also valid. Thus, one attains a not conventional conclusion: assuming a set of equations as theoretical laws does not imply, according to Semantic Realism, that the values of the observables that appear in them, which are prefixed, must all satisfy the equations themselves conjointly. Of course, this conclusion contradicts the conclusion obtained by Mermin by using repeatedly his "plausible" condition (which however rephrases a condition imposed by Kochen and Specker themselves in order to prove their theorem [26]), which is sufficient for invalidating the reasoning that leads to the Bell-Kochen-Specker theorem. The main aims of this section are thus attained. One can summarize our results by saying that nonlocality and contextuality of quantum mechanics, that are taken for granted by most physicists concerned with foundational research, do not necessarily hold if Semantic Realism is accepted. This conclusion eliminates, on one hand, a source of paradoxes, which unavoidably appear in the orthodox interpretation and in the alternatives that accept empirical verificationism; on the other hand, it prevents the charge of being inconsistent, that could otherwise be made against the interpretation of quantum mechanics provided by Semantic Realism because of the reasons illustrated at the beginning of this section. 9. C ONCL UDING REMARKS Judging from the results of Section 8, one could believe that the Bell and Bell-Kochen-Specker theorems simply are non-proved statements according to the interpretation of quantum mechanics provided by Semantic Realism. This is certainly true, but rather disconcerting for scholars concerned with the foundations of quantum mechanics, who usually consider these theorems as basic results. However, if one reconsiders the whole subject from an epistemological viewpoint, the aforesaid theorems still play an important role. Indeed, let us consider first the Bell-KochenSpecker theorem. Our invalidation above shows that Semantic Realism rejects, through MGP, the repeated use of Mermin's "plausible" condition. But this use can be seen as a particularization of the MCP principle to the special case under consideration. Therefore, according to Semantic Realism, the Bell-Kochen-Specker theorem can be reformulated as a general statement which asserts that assuming reality (R) and unrestricted validity of physical laws (Mep) together with quantum mechanics leads to inconsistencies. 136 CLA UDIO GAROLA It follows from the above reformulation of the Bell-Kochen-Specker theorem that one has some possible choices if he wants to avoid inconsistencies. For instance, if one considers only the choices that give up one of the three assumptions above, one has three possibilities: (i) modifying quantum mechanics, preserving Rand MCP; (ii) giving up R, preserving quantum mechanics and MCP; (iii) giving up MCP, preserving quantum mechanics and R. Choice (i) is made by those physicists who want to restore a basically classical conception of the world. Choice (ii) is made by the standard interpretation of quantum mechanics (note that this choice does not transgress the operational philosophy of quantum mechanics, since giving up R prohibits producing non-accessible physical contexts, hence the distinction between MCP and MGP becomes irrelevant). Choice (iii) is made by Semantic Realism (with some changes in the interpretation of quantum mechanics, see in particular Section 6, (viii)), which criticizes (i) as an impossible attempt of recovering old intuitive models in physics, and (ii) since abandoning R creates a number of conceptual difficulties and "paradoxes". However, we insist on the fact that the appearance of alternative (iii) is due to the Bell-Kochen-Specker theorem, if restated as above, and in this sense this theorem is still decisive (it is interesting to note that even recent attempts at modifying quantum mechanics in order to recover some forms of realism at the macroscopic level essentially deny R at the microscopic level, and possibility (iii) is ignored). Consider now the Bell theorem. We remind that physicists usually think that the Bell inequalities provide a method for testing experimentally whether quantum mechanics or locality (LOC) is correct. But the invalidation of the Bell theorem implies that quantum mechanics does not necessarily conflict with LOC. Then, a crucial question arises: what would happen if one should perform a suitable test of a given Bell inequality? Would the inequality be violated or not? The answer of Semantic Realism is simple but not trivial. In fact, Semantic Realism maintains that a Bell inequality is a correct theoretical formula which is not epistemically accessible in quantum mechanics. Any physical experiment actually tests something else (correlations among properties of physical objects in accessible contexts) and obviously yields the results predicted by quantum mechanics. No contradiction can actually occur, since the inequalities that can be tested in quantum mechanics could be identified with Bell inequalities only assuming the unrestricted validity of some physical laws, i.e., violating MGP. Thus, it is untrue that a Bell inequality provides a method for testing experimentally whether either quantum mechanics or LOC is correct. But the large number of experimental proofs that confirm the predictions of quantum mechanics proves A NEW QU ANTUM PHILOSOPHY FOR QM 137 indirectly that something must go wrong with quantum laws regarding compound systems within non-accessible physical contexts: otherwise, quantum mechanical inequalities would be reduced to Bell inequalities. The above remarks on the Bell and Bell-Kochen Specker theorems clearly suggest that a physical theory could exist which goes beyond quantum mechanics (which is an incomplete theory according to Semantic Realism, see Section 7, (v)) without contradicting it. Furthermore, Semantic Realism also supplies two interesting suggestions for a theory of this kind. First, the interpretation of the supports of the second kind states as theoretical properties (see Section 6, (ix)) shows that the quantum treatment of compound systems is semantically ambiguous. Indeed, in this treatment the support of any pure state is represented by a mathematical entity (for physicists: a one-dimensional projection) which should then correspond to a (first order) testable property in the case of a first kind state, to a (first order) theoretical property in the case of a second kind state. This suggests that a new theory should represent testable and theoretical properties by means of different mathematical entities, so that one can distinguish the former from the latter (or it should at least provide mathematical rules for distinguishing within the formalism the projections that represent physical apparatuses from the projections which do not). Second, the invalidation of the Bell theorem and the interpretation provided by Semantic Realism of the experimental results which confirm the quantum predictions suggest that, contrary to a widespread belief, local hidden variables models for quantum mechanics may exist (see Section 4): but these should be such to propose new laws that differ from quantum mechanical laws within non-accessible physical contexts, i.e., physical contexts which are only indirectly accessible by means of correlation measurements that essentially test physical properties of the second or third order (see Section 6, (v)). We stress that this does not mean that one must expect violations of quantum predictions whenever correlation measurements are performed: rather, one expects that quantum predictions be fulfilled, and that the new laws explain why quantum inequalities differ from Bell inequalities. To conclude with, it is interesting to note that Semantic Realism also proves to avoid the old Furry [12], [13] and Bohm-Aharonov [4] quantum paradoxes. These results are obtained by using the distinction between states and effects, and the distinction between semantic and pragmatic compatibility, respectively, and no use is required of the MGP principle (see Garola and Solombrino, [23]). 138 CLAUDIO GAROLA AFFILIATION Claudio Carola Dipartimento di Fisica dell' Universita Lecce, Italy [email protected] REFERENCES [1] Bell, J.S., "On the Einstein Podolsky Rosen paradox", Physics, 1, 1964, p. 195. [2] Bell, J .S., "On the Problem of hidden variables in quantum mechanics", Rev. Mod. Phys., 38, 1966, p. 447. [3] Bohm, D., Quantum Theory, Prentice Hall, Englewood Cliffs (N.J.), 1951. [4] Bohrn, D. and Aharonov, Y., "Discussion of experimental Proofs for the paradox of Einstein, Rosen, and Podolsky", Phys. Rev., 108, 1957, p. 1070. [5] Bohr, N., "Can Quantum Mechanical Description of Reality be Considered Complete?", Phys. Rev., 48, 1935, p. 696. [6] Braithwaite, R.B., Scientific Explanation, Cambridge University Press, Cambridge, 1953. [7] Busch, P., Lahti, P.J., and Mittelstaedt, P., The Quantum Theory of measurement, Springer, Berlin, 1991. [8] Dalla Pozza, C. and Garola, C., "A Pragmatic Interpretation of Intuitionistic Propositional logic" , Erkenntnis, 43, 1995, p. 8l. [9] Einstein, A., Podolsky, B., and Rosen, N., "Can quantum mechanical description of reality be considered complete?" , Phys. Rev., 47, 1935, p. 777. [10] Finkelstein, D., "Matter, space and logic", in: Hooker, C. A. (ed.), The Logico-Algebraic Approach to quantum mechanics, Vol II, Reidel, Dordrecht, 1979. [11] Finkelstein, D., "The physics of logic", in: Hooker, C. A. (ed.), The Logico-Algebraic Approach to quantum mechanics, Vol II, Reidel, Dordrecht, 1979. [12]Furry, W.H., "Note on the quantum mechanical theory of measurement", Phys. Rev., 49, 1936, p. 393. [13] Furry, W.H., "Remarks on measurements in quantum theory", Phys. Rev., 49, 1936, p. 476. A NEW QU ANTUM PHILOS OPHY FOR QM 139 [14] Garola, C., "Embedding of posets into lattices in quantum logic", Int. Journ. of Theor. Phys., 24, 1985, p. 423. [15] Garola, C., "classical foundations of quantum logic", Int. Journ. of Theor. Phys., 30, 1991, p. 1. [16] Garola, C., "Semantic incompleteness of quantum physics", Int. Journ. of Theor. Phys., 31, 1992, p. 809. [17] Garola, C., "Truth versus testability in quantum logic", Erkenntnis, 37, 1992, p. 197. [18] Garola, C., "Reconciling local realism and quantum physics: a critique to Bell", Teoreticheskaya i Matematicheskaya Fizika, 99, 1994, p.285. [19] Garola, C., "Criticizing Bell: Local realism and quantum physics reconciled", Int. Journ. of Theor. Phys., 34, 1995, p. 269. [20] Garola, C., "An operational Critique to Bell's Theorem" , in: Garola, C. and Rossi, A. (eds.), The Foundations of quantum mechanics. Historical Analysis and Open Questions, Kluwer Academic Publishers, Dordrecht, 1995. [21] Garola, C., "Pragmatic versus semantic contextuality in quantum physics", Int. Journ. of Theor. Phys., 34, 1995, p. 1383. [22] Garola, C. and Solombrino, L., "The theoretical apparatus of semantic realism: A new language for classical and quantum physics", Found. of Phys., 26, 1996, p. 1121. [23] Garola, C. and Solombrino, L., "Semantic realism versus EPRlike paradoxes: the Furry, Bohm-Aharonov and Bell paradoxes", Found. of Phys., 26, 1996, p. 1329. [24] Greenberger, D.M., Horne, M.A., Shimony A., and Zeilinger, A., "Bell's theorem without Inequalities", Am. Journ. of Phys., 58, 1990, p. 1131. [25] Hempel, C.C., Aspects of Scientific Explanation, Free Press, New York, 1965. [26] Kochen, S. and Specker, E.P., "The problem of hidden variables in quantum mechanics", Journ. of Math. Mech., 17, 1967, p. 59. [27] Jauch, J.M., Foundations of quantum mechanics, Addison-Wesley, Reading (Mass.), 1968. [28] Ludwig, G., Foundations of quantum mechanics I, Springer Verlag, New York, 1983. [29] Mackey, G.W., The Mathematical Foundations of quantum mechanics, Benjamin, New York, 1963. 140 CLAUDIO GAROLA [30] Mermin, N.D., "Hidden variables and the two theorems of John Bell", Reviews of Modern Physics, 65, 1993, p. 803. [31] Piron, C., Foundations of Quantum Physics, Benjamin, Reading, (Mass.), 1976. [32] Popper, K.R., Conjectures and Refutations, Routledge and Kegan Paul, London, 1969. [33] Putnam, H., "Is logic empirical?", in: Hooker, C.A. (ed.), The Logico-Algebraic Approach to quantum mechanics, Vol II, Reidel, Dordrecht, 1979. [34] Russell, B., An Inquiry into Meaning and Truth, Allen & Unwin, London, 1940. [35] Sakurai, J.J., Modern quantum mechanics, W.A. Benjamin, Reading (Mass.), 1985. [36] Selleri, F., "Even local probabilities lead to the paradox", in: Selleri, F. (ed.), quantum mechanics Versus Local Realism, Plenum Press, New York, 1988. [37] Tarski, A., Logic, Semantics, Metamathematics, Oxford University Press, Oxford, 1956. [38] Tarski, A., "The semantic conception of truth and the foundations of semantics", in: Linsky, L. (ed.), Semantics and the Philosophy of Language, University of Illinois Press, Urbana, 1952. [39] Wigner, E.P., "On hidden variables and quantum mechanical probabilities", Am. lourn. of Phys., 38, 1970, p. 1005. DIEDERIK AERTS QUANTUM MECHANICS: STRUCTURES, AXIOMS AND PARADOXES 1. INTRODUCTION In this article we present an analysis of quantum mechanics and its problems and paradoxes taking into account some of the results and insights that have been obtained during the last two decades by investigations that are commonly classified in the field of 'quantum structures research'. We will concentrate on these aspects of quantum mechanics that have been investigated in our group at Brussels Free Universityl. We try to be as clear and self contained as possible: firstly because the article is also aimed at scientists not specialized in quantum mechanics, and secondly because we believe that some of the results and insights that we have obtained present the deep problems of quantum mechanics in a simple way. The study of the structure of quantum mechanics is almost as old as quantum mechanics itself. The fact that the two early versions of quantum mechanics-the matrix mechanics of Werner Heisenberg and the wave mechanics of Erwin Schrodinger-were shown to be structurally equivalent to what is now called standard quantum mechanics, made it already clear in the early days that the study of the structure itself would be very important. The foundations of much of this structure are already present in the book of John von Neumann [1], and if we refer to standard quantum mechanics we mean the formulation of the theory as it was first presented there in a complete way. Standard quantum mechanics makes use of a sophisticated mathematical apparatus, and this is one of the reasons that it is not easy to explain it to a non specialist audience. Upon reflecting how we would resolve this 'presentation' problem for this paper we have chosen the following approach: most, if not all, deep quantum mechanical problems appear already in full, for the case of the 'most simple' of all quantum models, namely the model for the spin of a spin~ quantum particle. Therefore we have chosen to present the technical aspects of this paper as much as possible for the description of this most simple quantum 1 The actual members of our group are: Diederik Aerts, Bob Coecke, Thomas Durt, Sven Aerts, Frank Valckenborgh, Bart D'Hooghe and Bart Van Steirteghem. 141 © 1999 Kluwer Academic Publishers. 142 DIEDERIK AERTS model, and to expose the problems by making use of its quantum mechanical and quantum structural description. The advantage is that the structure needed to explain the spin model is simple and only requires a high school background in mathematics. The study of quantum structures has been motivated mainly by two types of shortcomings of standard quantum mechanics. (1) There is no straightforward physical justification for the use of the abstract mathematical apparatus of quantum mechanics. By introducing an axiomatic approach the mathematical apparatus of standard quantum mechanics can be derived from more general structures that can be based more easily on physical concepts (2) Almost none of the mathematical concepts used in standard quantum mechanics are operationally defined. As a consequence there has also been a great effort to elaborate an operational foundation. 2. QU ANTUM STR UCTURES AND QU ANTUM LOGIC Relativity theory, formulated in great part by one person, Albert Einstein, is founded on the concept of 'event', which is a concept that is physically well defined and understood. Within relativity theory itself, the events are represented by the points of a four dimensional space-time continuum. In this way, relativity theory has a well defined physical and mathematical base. quantum mechanics on the contrary was born in a very obscure way. Matrix mechanics was constructed by Werner Heisenberg in a mainly technical effort to explain and describe the energy spectrum of the atoms. Wave mechanics, elaborated by Erwin Schrodinger, seemed to have a more solid physical base: a general idea of wave-particle duality, in the spirit of Louis de Broglie or Niels Bohr. But then Paul Adrien Maurice Dirac and later John von Neumann proved that the matrix mechanics of Heisenberg and the wave mechanics of Schrodinger are equivalent: they can be constructed as two mathematical representations of one and the same vector space, the Hilbert space. This fundamental result indicated already that the 'de Broglie wave' and the 'Bohr wave' are not physical waves and that the state of a quantum entity is an abstract concept: a vector in an abstract vector space. Referring again to what we mentioned to be the two main reasons for studying quantum structures, we can state now more clearly: the study of quantum structures has as primary goal the elaboration of quantum mechanics with a physical and mathematical base that is as clear as the one that exists in relativity theory. We remark that the initial aim of quantum structures research was not to 'change' the theory-although STRUCTURES AND PARADOXES 143 it ultimately proposes a fundamental change of the standard theory as will be outlined in this paper-but to elaborate a clear and well defined base for it. For this purpose it is necessary to introduce clear and physically well defined basic concepts, like the events in the theory of relativity, and to identify the mathematiCal structure that these basic concepts have to form to be able to recover standard quantum mechanics. In 1936, Garret Birkhoff and John von Neumann, wrote an article entitled "The logic of quantum mechanics". They show that if one introduces the concept of 'operational proposition' and its representation in standard quantum mechanics by an orthogonal projection operator of the Hilbert space, it can be shown that the set of the 'experimental propositions' does not form a Boolean algebra, as it the case for the set of propositions of classical logic [2]. As a consequence of this article the field called 'quantum logic' came into existence: an investigation on the logic of quantum mechanics. An interesting idea was brought forward. Relativity theory is a theory based on the concept of 'event' and a mathematical structure of a four dimensional space-time continuum. This space-time continuum contains a non Euclidean geometry. Could it be that the article of Birkhoff and von Neumann indicates that quantum mechanics should be based on a non Boolean logic in the same sense as relativity theory is based on a non Euclidean geometry? This is a fascinating idea, because if quantum mechanics were based on a non Boolean logic, this would perhaps explain why paradoxes are so abundant in quantum mechanics: the paradoxes would then arise because classical Boolean logic is used to analyze a situation that intrinsically incorporates a non classical, non Boolean logic. Following this idea quantum logic was developed as a new logic and also as a detailed study of the logico-algebraic structures that are contained in the mathematical apparatus of quantum mechanics. The systematic study of the logico-algebraic structures related to quantum mechanics was very fruitful and we refer to the paper that David Foulis published in this book for a good historical account [3]. On the philosophical question of whether quantum logic constitutes a fundamental new logic for nature a debate started. A good overview of this discussion can be found in the book by Max Jammer [4]. We want to put forward our own personal opinion about this matter and explain why the word 'quantum logic' was not the best word to choose to indicate the scientific activity that has been taking place within this field. If 'logic', following the characterization of Boole, is the formalization of the 'process of our reflection', then quantum logic is not 144 DIEDERIK AERTS a new logic. Indeed, we obviously reflect following the same formal rules whether we reflect about classical parts of reality or whether we reflect about quantum parts of reality. Birkhoff and von Neumann, when they wrote their article in 1936, were already aware of this, and that is why they introduced the concept of 'experimental proposition'. It could indeed be that, even if we reason within the same formal structure about quantum entities as we do about classical entities, the structure of the 'experimental propositions' that we can use are different in both cases. With experimental proposition is meant a proposition that is connected in a well defined way with an experiment that can test this proposition. We will explicitly see later in this paper that there is some truth in this idea. Indeed, the set of experimental propositions connected to a quantum entity has a different structure than the set of experimental propositions connected to a classical entity. We believe however that this difference in structure of the sets of experimental propositions is only a little piece of the problem, and even not the most important one2 . It is our opinion that the difference between the logico-algebraic structures connected to a quantum entity and the logico-algebraic structures connected to a classical entity is due to the fact that the structures of our 'possibilities of active experimenting' with these entities is different. Not only the logical aspects of these possibilities of active experimenting but the profound nature of these possibilities of active experimenting is different. And this is not a subjective matter due to, for example, our incapacity of experimenting actively in the same way with a quantum entity as with a classical entity. It is the profound difference in nature of the quantum entity that is at the origin of the fact that the structure of our possibilities of active experimenting with this entity is different 3 . We could proceed now by trying to explain in great generality what we mean with this statement and we refer the reader to [6] for such a pre2 We can easily show for example that even the set of experimental propositions of a macroscopic entity does not necessarily have the structure of a Boolean algebra. This means that the only fact of limiting oneself to the description of the set of 'experimental' propositions already brings us out of the category of Boolean structures, whether the studied entities are microscopic or macroscopic [5] 3 We have to remark here that we do not believe that the set of quantum entities and the set of classical entities correspond respectively to the set of microscopic entities and the set of macroscopic entities as is usually thought. On the contrary, we believe, and this will become clear step by step in the paper that we present here, that a quantum entity should best be characterized by the nature of the structure of the possibilities of experimentation on it. In this sense classical entities show themselves to be special types of quantum entities, where this structure, due to the nature of the entity itself, takes a special form. But, as we will show in the paper, there exists macroscopic real physical entities with a quantum structure. STRUCTURES AND PARADOXES 145 sentation. In this paper we will explain what we mean mostly by means of a simple example. 3. THE E X AMP L E: THE QUA N TUM MAC H I N E As we have stated in the introduction, we will analyze the problems of quantum mechanics by means of simple models. The first model that we will introduce has been proposed at earlier occasions (see [5], [7], [8] and [9]) and we have named it the 'quantum machine'. It will turn out to be a real macroscopic mechanical model for the spin of a spin! quantum entity. We will only need the mathematics of high school level to introduce it. This means that alsothe readers that are not acquainted with the sophisticated mathematics of general quantum mechanics can follow all the calculations, only needing to refresh perhaps some of the old high school mathematics. We will introduce the example of the quantum machine model step by step, and before we do this we need to explain shortly how we represent our ordinary three dimensional space by means of a real three dimensional vector space. 3.1. The Mathematical Representation of Three Dimensional Space We can represent the three dimensional Euclidean space, that is also the space in which we live and in which our classical macroscopic reality exists, mathematically by means of a three dimensional real vector space denoted ]R3. We do this by choosing a fixed origin 0 of space and representing each point P of space as a vector v with begin-point 0 and end-point P (see Fig 1). Such a vector v has a direction, indicated by the arrow, and a length, which is the length of the distance from 0 to the point P. We denote the length of the vector v by Ivl. Fig 1 : A mathematical representation of the three dimensional Euclidean space by means of a three dimensional real vector space. We choose a fixed point 0 which is the origin. For an arbitrary point P we define the vector v that represents the point. i) The sum of vectors We introduce operations that can be performed with these vectors that indicate points of our space. For example the sum of two vectors v, W E ]R3 representing two points P and Q is defined by means of the parallelogram rule (see Fig 2). It is denoted by v + wand is again a vector of ]R3. 146 DIEDERIK AERTS P+Q Fig 2 : A representation of the sum of two vectors v and w, denoted by v+w. v o ii) Multiplication of a vector by a real number We can also define the multiplication of a vector v E lR 3 by a real number r E lR, denoted by rv. It is again a vector of lR 3 with the same direction and the same origin 0 and with length given by the original length of the vector multiplied by the real number r (see Fig 3). rP Fig 3 : A representation of the product of a vector v with a real number r, denoted by rv. rv P v o iii) The inproduct of two vectors We can also define what is called the inproduct of two vectors v and w, denoted by < v, w >, as shown in Fig 4. Q .. Ivlcosy ". w Fig 4 : A representation of the inproduct of two vectors v and w, denoted by <v,w>. P Y v 0 It is the real number that is given by the length of vector v multiplied by the length of vector w, multiplied by the cosine of the angle between the two vectors v and w. Hence < v, w > = Iv II w I cos r (1) where r is the angle between the vectors v and w (see Fig 4). By means of this inproduct it is possible to express some important geometric properties of space. For example: the inproduct < v, w > of STRUCTURES AND PARADOXES 147 two non-zero vectors equals zero iff the two vectors are orthogonal to each other. On the other hand there is also a straightforward relation between the inproduct of a vector with itself and the length of this vector < V,v >= Ivl 2 (2) iv) An orthonormal base of the vector space For each finite dimensional vector space with an inproduct it is possible to define an orthonormal base. For our case of the three dimensional real vector space that we use to describe the points of the three dimensional Euclidean space it is a set of three orthogonal vectors with the length of each vector equal to 1 (see Fig 5). V3 h3 h, ,·····················1 =,h,+"h,+"h, o V2 h2 ........... h2 ···!··········7 i ..."" .. Ih~: ..·. ~L. ..................·::::·..:~t2h .... v Fig 5 : An orthonormal base {hl,h2,h3} of the three dimen sional real vector space. We can write an arbitrary vector vasa sum of the three base vectors multiplied respectively by real numbers VI ,V2 and V3. These numbers are called the Cartesian coordinates of the vector v for the orthonormal base {hl,h 2,h3}. Hence the set {hI, h 2 , h3} is an orthonormal base of our vector space iff < hI, hI >= 1 < hl,h2 >= 0 < hI, h3 >= 0 < h 2 , hI >= 0 < h2, h2 >= 1 < h2, h3 >= 0 < h3,hl >= 0 < h3,h2 >= 0 < h 3, h3 >= 1 ]R3 (3) We can write each vector v E ]R3 as the sum of these base vectors respectively multiplied by real numbers VI, V2 and V3 (see Fig 5). Hence: (4) The numbers VI, V2 and V3 are called the Cartesian coordinates4 of the vector V for the orthonormal base {hI, h 2 , h3}. 4 It was Rene Descartes who introduced this mathematical representation of our three dimensional Euclidean space. 148 DIED ERIK AERTS v} The Cartesian representation of space As we have fixed the origin of our vector space we can also fix one specific orthonormal base, for example the base {hI,h2,h3}, and decide to express each vector v by means of the Cartesian coordinates with respect to this fixed base. We will refer to such a fixed base as a Cartesian base. Instead of writing v = vIh i + V2h2 + V3h3 it is common practice to write v = (VI, V2, V3), only denoting the three Cartesian coordinates and not the Cartesian base, which is fixed now anyway. As a logical consequence we denote hI = (1,0,0), h2 = (0,1,0) and h3 = (0,0,1). It is an easy exercise to show that the addition, multiplication with a real number and the inproduct of vectors are given by the following formulas. Suppose that we consider two vectors v = (VI,V2,V3) and W = ( WI, W2, W3) and a real number r, then we have: ° v + W = (VI + WI, V2 + W2, V3 + W3) rv = (rVI, rV2, rV3) < v, W >= VIWI + V2W2 + V3W3 (5) These are very simple mathematical formulas. The addition of vectors is just the addition of the Cartesian coordinates of these vectors, the multiplication with a real number is just the multiplication of the Cartesian coordinates with this number, and the inproduct of vectors is just the sum of one by one products of the Cartesian coordinates. This is one of the reasons why the Cartesian representation of the points of space is very powerful. vi} The representation of space by means of spherical coordinates It is possible to introduce many systems of coordination of space. We will in the following use one of these other systems: the spherical coordinate system. We show the spherical coordinates p, () and ¢ of a point P with Cartesian coordinates (VI, V2, V3) in Figure 6. We have the following well known and easy to verify relations between the two sets of coordinates (see Fig 6). VI = P sin () cos ¢ V2 = P sin () sin ¢ V3 = pcos() (6) STRUCTURES AND PARADOXES 149 Fig. 6 : A point P with Cartesian coordinates Vl,V2 and V3, and spherical coordinates p,O and ¢. 3.2. The States of the Quantum Machine Entity We have introduced in the foregoing section some elementary mathematics necessary to handle the quantum machine entity. First we will define the possible states of the entity and then the experiments we can perform on the entity. The quantum machine entity is a point particle P in three dimensional Euclidean space that we represent by a vector v. The states of the quantum machine entity are the different possible places where this point particle can be, namely inside or on the surface of a spherical ball (that we will denote by ball) with radius 1 and center 0 (Fig 7)5. Let us denote the set of states of the quantum machine entity by ~cq 6 . We will denote a specific state corresponding to the point particle being in the place indicated by the vector v by the symbol PV. SO we have: (7) 5 In the earlier presentations of the quantum machine [5], [7], [8] and [9], we only considered the points on the surface of the sphere to be the possible states of the quantum machine entity. If we want to find a model that is strictly equivalent to the spin model for the spin~ of a quantum entity, this is what we have to do. We will however see that it is fruitful, in relation with a possible solution of one of the quantum paradoxes, to introduce a slightly more general model for the quantum machine and also allow points of the interior of the sphere to represent states (see also [6] for the representation of this more general quantum machine) 6 The subscript cq stands for 'completed quantum mechanics'. We will see indeed in the following that the interior points of ball do not correspond to vector states of standard quantum mechanics. If we add them anyhow to the set of possible states, as we will do here, we present a completed version of standard quantum mechanics. We will come back to this point in detail in the following sections. 150 DIEDERIK AERTS Let us now explain in which way we interact, by means of experiments, with this quantum machine entity. As we have defined the states it would seem that we can 'know' these states just by localizing the point inside the sphere (by means of a camera and a picture for example, or even just by looking at the point). This however is not the case. We will define very specific experiments that are the 'only' ones at our disposal to find out 'where' the point is. In this sense it would have been more appropriate to define first the experiments and afterwards the states of the quantum machine entity. We will see that this is the way that we quantum entity 7. will proceed when we introduce the spin of a spin~ Fig. 7 : The quantum machine entity P with its vector representation v, its cartesian coordinates x,y and z, and its spherical coordinates p,() and <p. 3.3. The Experiments of the Quantum Machine Let us now introduce the experiments. To do this we consider the point u and the diametrically opposite point -u of the surface of the sphere ball. We install an elastic strip (e.g., a rubber band) of 2 units of length, such that it is fixed with one of its end-points in u and the other end-point in -u (Fig 8,a). As we have explained, the state Pv represents the point particle P located in the point v. We will limit ourselves in this first introduction of the quantum machine to states Pv where v = 1 and hence P is on the surface of the sphere. Later we will treat the general case. Let us now describe the experiment. Once the elastic is installed, the particle P falls from its original place v orthogonally onto the elastic, and sticks to it 7 It is essential for the reader to understand this point. In our model we have defined the states of the quantum machine entity, but actually there is no camera available to 'see' these states. The only experiments available are the ones that we will introduce now. STRUCTURES AND PARADOXES 151 (Fig 8,b). Then, the elastic breaks at some arbitrary point. Consequently the particle P, attached to one of the two pieces of the elastic (Fig 8,c), is pulled to one of the two end-points u or -u (Fig 8,d). Now, depending on whether the particle P arrives in u (as in Fig 8) or in -u, we give the outcome 01 or 02 to the experiment. We will denote this experiment by the symbol eu and the set of experiments connected to the quantum machine by Ceq. Hence we have: feq = {e u I u E ~3, u ~ (e) '-u lui = I} (8) Fig. 8 : A representation of the quantum machine. In(a) the particle P is in state pv, and the elastic corresponding to the experiment eu is installed between the two diametrically opposed points u and -u. In (b) the particle P falls orthogonally onto the elastic and sticks to it. In (c) elastic breaks and the particle P is pulled towards the point u, such that (d) it arrives at point u, and the experiment e u gets the outcome 01. Fig. 9 : A representation of the experimental process in the plane where it takes place. The elastic of length 2, corresponding to the experiment e u , is installed between u and -u. The probability, J.L(eu,pv,ol), that the particle P ends up in point u under influence of the experiment e" is given by the length of the piece of elastic L1 divided by the total length of the elastic. The probability, J.L(e u ,pv,02), that the particle P ends up in point -u is given by the length of the piece of elastic L2 divided by the total length of the elastic. 3.4. The Transition Probabilities of the Quantum Machine Let us now calculate the probabilities that are involved in these experiments such that we will be able to show later that they equal the quantum probabilities connected to the quantum experiments on a spin~ quantum particle. 152 DIED ERIK AERTS The probabilities are easily calculated. The probability, J-L(eu,pv, 01), that the particle P ends up in point u and hence experiment eu gives outcome 01, when the quantum machine entity is in state Pv, is given by the length of the piece of elastic L1 divided by the total length of the elastic (Fig 9). The probability, J-L(eu,pv, 02), that the particle P ends up in point -u, and hence experiment eu gives outcome 02, when the quantum machine entity is in state Pv, is given by the length of the piece of elastic L2 divided by the total length of the elastic. This gives us: 1 -L1 = -(1 + cos'Y) = cos2 -'Y 222 L2 1 . 2 'Y - = - (1 - cos 'Y) = sm 2 2 2 (9) (10) As we will see in next section, these are exactly the probabilities related to the spin experiments on the spin of a spin! quantum particle. 4. THE S PIN 0 F ASP I N ~ QUA N TUM PAR TIC L E Let us now describe the spin of a spin! quantum particle so that we can show that our quantum machine is equivalent to it. Here we will proceed the other way around and first describe in which way this spin manifests itself experimentally. 4.1. The Experimental Manifestation of the Spin The experiment showing the first time the property of spin for a quantum particle was the one by Stern and Gerlach [10], and the experimental apparatus involved is called a Stern-Gerlach apparatus. It essentially consists of a magnetic field with a strong gradient, oriented in a particular direction u of space (Fig 10). u A. direction of I,' the gradient I of the magnetic field outgoing beam arrival of the beam of quantum particles Fig 10: The Stern-Gerlach apparatus measuring the spin of a spin! quantum particle. The particle beam comes in from the left and passes through a magnetic field with a strong gradient in the u direction. The beam is split in two, one goes up, absorbed by a plate, and one down, passing through. STRUCTURES AND PARADOXES 153 A beam of quantum particles of spin ~ is directed through the magnet following a path orthogonal to the direction of the gradient of the magnetic field. The magnetic field splits the beam into two distinct beams which is a very unexpected phenomenon if the situation would be analysed by classical physics. One beam travels upwards in the direction of the gradient of the magnetic field and one downwards. The beam directed downwards is absorbed by a plate that covers this downwards part behind the magnetic field in the regions where the beams emerge. The upwards beam passes through and out of the Stern-Gerlach apparatus. We will denote such a Stern-Gerlach experiment by fu where u refers to the direction of space of the gradient of the magnetic field. If we consider this experiment being performed on one single quantum particle of spin ~, then it has two possible outcomes: (1) the particle is deflected upwards and passes through the apparatus, let us denote this outcome 01, (2) the particle is deflected downwards, and is absorbed by the plate, lets denote this outcome 02. The set of experiments that we will consider for this one quantum particle is the set of all possible SternGerlach experiments, for each direction u of space, which we denote by £89 8 . Hence we have: £89 = {Ju I u a direction in space} 4.2. The Spin States of a Spin~ (11) Quantum Particle Suppose that we have made a Stern-Gerlach experiment as explained and we follow the beam of particles that emerges. Suppose that we perform a second Stern-Gerlach experiment on this beam, with the gradient of the magnetic field in the same direction u as in the first preparing SternGerlach apparatus. We will then see that now the emerging beam is not divided into two beams. All particles of the beam are deviated upwards and none of them is absorbed by the plate. This means that the first experiment has 'prepared' the particles of the beam in such a way that it can be predicted that they will all be deviated upwards: in physics we say that the particles have been prepared in a spin state in direction u. Since we can make such a preparation for each direction of space u it follows that the spin of a spin ~ quantum particle can have a spin state connected to any direction of space. So we will have to work out a description of this spin state such that each direction of space corresponds to a spin state. This is exactly what quantum mechanics does and we will now expose this quantum description of the spin states. Since we have 8 The subscript sg stands for Stern-Gerlach. 154 DIED ERIK AERTS chosen to denote the Stern-Gerlach experiments by fu where u refers to the direction of the gradient of the magnetic field, we will use the vector v to indicate the direction of the spin of the quantum particle. Let us denote such a state by qv' This state qv is then connected with the preparation by means of a Stern-Gerlach apparatus that is put in a direction of space v. A particle that emerges from this Stern-Gerlach apparatus and is not absorbed by the plate 'is' in spin state qv' Let us denote the set of all spin states by L: sg . So we have: L:sg = {qv Iv a direction in space} (12) 4.3. The Transition Probabilities of the Spin If we now consider the experimental situation of two Stern-Gerlach apparatuses placed one after the other. The first Stern-Gerlach apparatus, with gradient of the magnetic field in direction v, and a plate that absorbs the particles that are deflected down, prepares particles in spin state qv. The second Stern-Gerlach apparatus is placed in direction of space u and performs the experiment fu. We can then measure the relative frequency of particles being deflected "up" for example. The statistical limit of this relative frequency is the probability that a particle will be deflected upwards. The laboratory results are such that, if we denote by /-L(fu, qv, od the probability that the experiment fu makes the particle deflect upwards-let us denote this as outcome ol-if the spin state is qv, and by /-L(fu, qv, 02) the probability that the experiment fu makes the particle be absorbed by the plate-let us denote this outcome by 02-if the spin state is qv, we have: (13) (14) where'Y is the angle between the two vectors u and v. 4.4. Equivalence of the Quantum Machine with the Spin It is obvious that the quantum machine is a model for the spin of a spin! quantum entity. Indeed we just have to identify the state Pv of the quantum machine entity with the state qv of the spin, and the experiment eu of the quantum machine with the experiment fu of the spin for all directions of space u and v. Then we just have to compare the transition STRUCTURES AND PARADOXES 155 probabilities for the quantum machine derived in formula's 9 and 10 with the ones measured in the laboratory for the Stern-Gerlach experiment 9 . Let us show in the next section that the standard quantum mechanical calculation leads to the same transition probabilities. 5. THE QU ANTUM DES CRIPTION OF THE SPIN We will now explicitly introduce the quantum mechanical description of the spin of a spin~ quantum particle. As mentioned in the introduction, quantum mechanics makes a very abstract use of the vector space structure for the description of the states of the quantum entities. Even for the case of the simplest quantum model, the one we are presenting, the quantum description is rather abstract as will become obvious in the following. As we have mentioned in the introduction, it is the abstract nature of the quantum description that is partly at the origin of the many conceptual difficulties concerning quantum mechanics. For this reason we will introduce again quantum mechanics step by step. 5.1. The Quantum Description of the Spin States quantum mechanics describes the different spin states qu for different directions of space u by the unit vectors of a two dimensional complex vector space, which we will denote by «::::2. This means that the spin states are not described by unit vectors in a three dimensional real space, as it is the case for our quantum machine model, but by unit vectors in a two dimensional complex space. This switch from a three dimensional real space to a two dimensional complex space lies at the origin of many of the mysterious aspects of quantum mechanics. It also touches some deep mathematical correspondences (the relation between SU(2) and SO(3) for example) of which the physical content has not yet been completely unraveled. Before we completely treat the spin states, let us give an elementary description of «::::2 itself. i) The vector space «::::2 «::::2 is a two-dimensional vector space over the field of the complex numbers. This means that each vector of «::::2-in its Cartesian representationis of the form (Zl' Z2) where Zl and Z2 are complex numbers. The addition 9 We do not consider here the states of the quantum machine corresponding to points of the inside of the sphere. They do not correspond to states that we have identified already by means of the Stern-Gerlach experiment. This problem will be analysed in detail further on. 156 DIEDERIK AERTS of vectors and the multiplication of a vector with a complex number are defined as follows: for (Zl, Z2), (tl, t2) E <e 2 and for Z E <e we have: (Zl, Z2) + (h, t2) = (Zl Z(Zl, Z2) = (ZZl, ZZ2) There exists an inproduct on + tl, Z2 + t2) (15) <e 2 : for (Zl, Z2), (tl, t2) E <e 2 we have: < (Zl,Z2),(tl,t2) >= Zrtl +Z2t2 (16) where * is the complex conjugation in <e. We remark that <, > is linear in the second variable and conjugate linear in the first variable. More explicitly this means that for (Zl, Z2), (tl, t2)' (rl, r2) E <e 2 and z, t E <e we have: < Z(Zl, Z2) + t(tl, t2)' (rl, r2) > < (Tl, T2), Z(Zl, Z2) + t(tl, t2) > z* < (Zl, Z2), (rl, r2) > +t* < (tl, t2)' (rl, r2) > (17) = Z < (Tl, r2), (Zl, Z2) > +t < (Tl, T2)' (tl, t2) > (18) = The inproduct defines the 'length' or better 'norm' of a vector (Zl, Z2) E <e 2 as: and it also defines an orthogonality relation on the vectors of <e 2 : we say that two vectors (Zl, Z2), (tl, t2) E <e 2 are orthogonal and write (Zl, Z2) ~ (tl,t2) iff: (20) A unit vector in <e 2 is a vector with norm equal to 1. Since the spin states are represented by these unit vectors we introduce a special symbol U(<e 2 ) to indicate the set of unit vectors of <e 2 • Hence: (21) The vector space has the Cartesian orthonormal base formed by the vectors (1,0) and (0,1) because we can obviously write each vector (Zl, Z2) in the following way: (22) STRUCTURES AND PARADOXES 157 ii) The quantum representation of the spin states Let us now make explicit the quantum mechanical representation of the spin states. The state qv of the spin of a spin~ quantum particle III direction v is represented by the unit vector Cv of U(C 2 ): Cv = (cos2e~,in-) e e·</> .</> = c(e,¢) (23) e where and ¢ are the spherical coordinates of the unit vector v. It is easy to verify that these are unit vectors in ((:2. Indeed, e . e Ic(e, ¢)I = cos2 '2 + sm2 '2 = 1 (24) It is also easy to verify that two quantum states that correspond to opposite vectors v = (1, e, ¢) and -v = (1, 7r - e, ¢ + 7r) are orthogonal. Indeed, a calculation shows (25) This gives us the complete quantum mechanical description of the spin states of the spin of a spin~ quantum particle. 5.2. The Quantum Description of the Spin Experiments We now have to explain how the spin experiments are described in the quantum formalism. As we explained already in detail, a spin experiment fu in the laboratory is executed by means of a Stern-Gerlach apparatus. To describe these experiments quantum mechanically we have to introduce somewhat more advanced mathematics, although still easy to master for those readers who have had no problems till now. So we encourage them to keep up with us. i) Projection operators The first concept that we have to introduce for the description of the experiments lO is that of an operator or matrix. We only have to explain the case we are interested in, namely the case of the two dimensional complex vector space that describes the states of the spin of a spin~ quantum particle. In this case an operator or matrix H consists of four complex numbers H =( Z11 Z21 Z12 ) Z22 (26) 10 We use 'experiment' to indicate the interaction with the physical entity. A measurement in this sense is a special type of experiment. 158 DIED ERIK AERTS that have a well defined operation on the vectors of C 2 : for (Zl' Z2) E C 2 we have H(Zl, Z2) = (81,82) where (27) It is easily verified that this operation is linear, which means that for (Zl' Z2), (h, t2) E C 2 and z, t E C we have: (28) Moreover it can be shown that every linear operation on the vectors of C 2 is represented by a matrix. There is a unit operator I and a zero operator 0 that respectively maps each vector onto itself and on the zero vector (0,0): I=(~ ~) o=(~ ~) (29) The sum of two operators HI and H2 is defined as the operator HI + H2 such that (30) and the product of these two operators is the operator HI . H2 such that (31) We need some additional concepts and also a special set of operators to be able to explain how experiments are described in quantum mechanics. We say that a vector (Zl' Z2) E C 2 is an 'eigenvector' of the operator H iff for some Z E C we have: (32) If (ZI' Z2) is different from (0,0) we say that Z is the eigenvalue of the operator H corresponding to the eigenvector (ZI' Z2)' Let us introduce now the special type of operators that we need to explain how experiments are described by quantum mechanics. A projection operator P is an operator such that p2 = P and such that for (ZI, Z2), (tl' t2) E C 2 we have: (33) We remark that a projection operator, as we have defined it here, is sometimes called an orthogonal projection operator. Let us denote the set of all projection operators by .C(C 2 ). STRUCTURES AND PARADOXES 159 If P E £((:2) then also 1 - P E £((:2). Indeed (I - P)(I - P) = 1 - PP + p2 = 1 - P - P + P = 1 - P. We can show that the eigenvalues of a projection operator are 0 or 1. Indeed, suppose that Z E (: is such an eigenvalue of P with eigenvector (Zl, Z2) #- (0,0). We then have P(Zl, Z2) = Z(Zl, Z2) = p 2(Zl, Z2) = Z2(Zl, Z2). From this follows that Z2 = Z and hence Z = 1 or Z = 0. Let us now investigate what the possible forms of projection operators are in our case ([:2. We consider the set {P, I - P} for an arbitrary P E £(([:2). i) Suppose that P(Zl,Z2) = (0,0) V (Zl,Z2) E ([:2. Then P = 0 and 1 - P = 1. This means that this situation is uniquely represented by the set {O,I}. ii) Suppose that there exists an element (Zl, Z2) E (:2 such that P(Zl, Z2) (0,0). Then P(P(Zl' Z2)) = P(Zl, Z2) which shows that P(Zl' Z2) is an eigenvector of P with eigenvalue 1. Further we also have that (I - P)(P(Zl' Z2)) = 0 which shows that P(Zl' Z2) is an eigenvector of 1 - P with eigenvalue O. If we further suppose that (I - P)(Y1, Y2) = o V (Yl, Y2) E (:2, then we have P = I and the situation is represented by the set {I,O}. i= iii) Let us now suppose that there exists a (Zl' Z2) as in ii) and an element (Y1, Y2) E (:2 such that (I - P)(Y1, Y2) =f. O. An analogous reasoning then shows that (I - P)(Y1, Y2) is an eigenvector of I - P with eigenvalue 1 and an eigenvector of P with eigenvalue O. Furthermore we have: < P(Zl' Z2), (I - P)(Y1, Y2) > < (Zl' Z2), P(I - P)(Y1, Y2) > = = (34) 0 which shows that P(Zl' Z2) -.l (I - P)(Yl, Y2). This also proves that P(Zl,Z2) and (I - P)(Y1,Y2) form an orthogonal base of <e 2 , and P is in fact the projection on the one dimensional subspace generated by P(Zl' Z2), while 1- P is the projection on the one dimensional subspace generated by (1 - P)(Y1, Y2). These three cases i), ii) and iii) cover all the possibilities, and we have now introduced all the elements necessary to explain how experiments are described in quantum mechanics. ii) The quantum representation of the measurements An experiment e in quantum mechanics, with two possible outcomes 01 and 02, in the case where the states are represented by the unit vectors 160 DIEDERIK AERTS of the two dimensional complex Hilbert space ((:2, is represented by such a set {P, I - P} of projection operators satisfying situation iii). This means that P i= 0 and I - P i= o. Let us state now the quantum rule that determines in which way the outcomes occur and what happens to the state under the influence of a measurement. If the state p of the quantum entity is represented by a unit vector (ZI, Z2) and the experiment e by the set of non zero projection operators {P, I - P}, then the outcome 01 of e occurs with a probability given by < (ZI,Z2),P(ZI,Z2) > and if 01 (35) occurs the unit vector (ZI, Z2) is changed into the unit vector P(ZI, Z2) IP(ZI, z2)1 (36) that represents the new state of the quantum entity after the experiment e has been performed. The outcome 02 of e occurs with a probability given by (37) and if 02 occurs the unit vector (ZI' Z2) is changed into the unit vector (I - P)(ZI,Z2) I(I - P)(ZI, z2)1 (38) that represents the new state of the quantum entity after the experiment e has been performed. We remark that we have: < (Zl' Z2), P(ZI, Z2) > + < (ZI, Z2), (I - P)(ZI, Z2) > =< (ZI' Z2), (ZI, Z2) >= 1 (39) such that these numbers can indeed serve as probabilities for the respective outcomes, i.e., they add up to 1 exhausting all other possibilities. iii) The quantum representation of the spin experiments Let us now make explicit how quantum mechanics describes the spin experiments with a Stern-Gerlach apparatus in direction u, hence the experiment f u· The projection operators that correspond to the spin measurement of the spin of a spin ~ quantum particle in the u direction are given by {Pu , 1- Pu } where Pu =~ ( 2 1 i:" cosa e+ 2{3 sin a e- i {3 sina ) 1 - cos a (40) STRUCTURES AND PARADOXES 161 where u = (1, a, (3) and hence a and (3 are the spherical coordinates angles of the vector u. We can easily verify that 1 ( 1- cosa . - e+~(J sin a 1- Pu = P u = 2 sin a ) 1 + cos a _e-i(J (41) Let us calculate in this case the quantum probabilities. Hence suppose that we have the spin in a state qv and a spin measurement f u in the direction u is performed. The quantum rule for the probabilities then gives us the following result. The probability /1(fu, qv, 01) that the experiment f u gives an outcome 01 if the quantum particle is in state qv is given by: (42) And the probability /1(fu, qv, 02) that the experiment fu gives an outcome 02 if the quantum particle is in state qv is given by: (43) This is not a complicated, but a somewhat long calculation. If we introduce the angle 'Y between the two directions u and v (see Fig. 11), it is possible to show that the probabilities can be expressed by means of this angle 'Y in a simple form. Fig. 11 : Two points v=(l,O,¢) and u=(l,a,(J) on the unit sphere and the angle 'Y between the two directions (O,¢) and (a,{3). We have: J..l(fu, qv, 01) = cos 2 ~ (44) J..l(fu, qv, 02) = sin2 ~ (45) 162 DIED ERIK AERTS These are indeed the transition probabilities for the spin measurement of the spin of a spin~ quantum particle found in the laboratory (see formulas 13, 14). To see this immediately we can, without loss of generality, choose the situation of the spin measurement for the z axis direction. Hence in this case we have u = (1, a, 13) with a = 0 and 13 = 0 for the experiment giving outcome 01 and a = 7r and 13 = 0 for the experiment giving the outcome 02. This gives us for fu the projection operators {Pu , P- u } given by: (46) For the probabilities we have: f-l(fu, qv, 01) =< cv , PuCv ) f-l ( fu,qv, 02 =< Cv,P-uCv 2 e >= cos "2 . 2 e >= SIn "2 (47) (48) We mentioned already that quantum mechanics describes the experiments in a very abstract way. This is one of the reasons why it is so difficult to understand many of the aspects of quantum mechanics. Since the quantum machine that we have presented also generates an isomorphic structure we have shown that, within our common reality, it is possible to realize this structure. The quantum machine is a model for the spin of a spin ~ quantum particle and it is also a model for the abstract quantum description in the two dimensional complex vector space. This result will make it possible for us to analyse the hard quantum problems and to propose new solutions to these problems. 6. QUANTUM PROBABILITY Probability, as it was first and formally introduced by Laplace and later axiomatised by Kolmogorov, is a description of our lack of knowledge about what really happens. Suppose that we say for a dice we throw, we have one chance out of six that the number 2 will be up when it lands on the floor. What is the meaning of this statement? It is very well possible that the motion of the dice through space after it has been thrown is completely deterministic and hence that if we knew all the details we could predict the outcome with certaintyll. So the emergence of the 11 We do not state here that complete determinism is the case, but just want to point out that it could be. STRUCTURES AND PARADOXES 163 probability ~ in the case of the diee does not refer to an indeterminism in the reality of what happens with the dice. It is just a description of our lack of knowledge about what precisely happens. Since we do not know what really happens during the trajectory of the dice and its landing and since we also do not know this for all repeated experiments we will make, there is an equal chance-for reasons of symmetry-for each side to be up after the diee stopped moving. Probability of ~ for each event expresses this type of lack of knowledge. Philosophers say in this case that the probability is 'epistemic'. Probability that finds its origin in nature itself and not in our lack of knowledge about what really happens is called 'ontologie'. Before the advent of quantum mechanics there was no important debate about thenature of the probabilities encountered in the physical world. It was commonly accepted that probabilities are epistemic and hence can be explained as due to our lack of knowledge about what really happens. It is this type of probability-the one encountered in classical physics-that was axiomatized by Kolmogorov and therefore it is commonly believed that Kolmogorovian probabilities are epistemic. It was rather a shock when physicists found out that the structure of the probability model that is encountered in quantum mechanics does not satisfy the axioms of Kolmogorov. Would this mean that quantum probabilities are ontologic? Anyhow it could only mean two things: (1) the quantum probabilities are ontologic, or (2) the axiomatic system of Kolmogorov does not describe all types of epistemic probabilities. In this second case quantum probabilities could still be epistemic. 6.1. Hidden Variable Theories Most of investigations that physicist, mathematicians and philosophers carried out regarding the nature of the quantum probabilities pointed in the direction of ontologie probabilities. Indeed, a whole area of research was born specifically with the aim of investigating this problem, it was called 'hidden variable theory research'. The reason for this naming is the following: if the quantum probabilities are merely epistemic, it must be possible to build models with 'hidden variables', their prior absence causing the probabilistic description. These models must be able to substitute the quantum models, i.e., they are equivalent, and they entail explicitly epistemic probabilities, in the sense that a randomisation over the hidden variables gives rise to the quantum probabilities. Actually physicists trying to construct such hidden variable models had thermodynamics in mind. The theory of thermodynamies is independent of classical mechanics, and has its own set of physical quantities 164 DIEDERIK AERTS such as pressure, volume, temperature, energy and entropy and its own set of states. It was, however, possible to introduce an underlying theory of classical mechanics. To do so, one assumes that every thermodynamic entity consists of a large number of molecules and the real pure state of the entity is determined by the positions and momenta of all these molecules. A thermodynamic state of the entity is then a mixture or mixed state of the underlying theory. The programme had been found feasible and it was a great success to derive the laws of thermodynamics in this way from Newtonian mechanics. Is it possible to do something similar for quantum mechanics? Is it possible to introduce extra variables into quantum mechanics such that these variables define new states, and the description of the entity based on these new states is classical? Moreover would quantum mechanics be the statistical theory that results by averaging over these variables? This is what scientists working on hidden variable theories were looking for. John von Neumann gave the first proof of the impossibility of hidden variables for quantum mechanics [1]. One of the assumptions that von Neumann made in his proof is that the expectation value of a linear combination of physical quantities is the linear combination of the expectation values of the physical quantities. As remarked by John Bell [11], this assumption is not justified for non compatible physical quantities, such that, indeed, von Neumann's proof cannot be considered conclusive. Bell constructs in the same reference a hidden variable model for the spin of a spin ~ quantum particle, and shows that indeed von Neumann's assumption is not satisfied in his model. Bell also criticizes in his paper two other proofs of the nonexistence of hidden variables, namely the proof by Jauch and Piron [12] and the proof by Gleason [13]. Bell correctly points out the danger of demanding extra assumptions to be satisfied without knowing exactly what these assumptions mean physically. The extra mathematical assumptions, criticized by Bell, were introduced in all these approaches to express the physical idea that it must be possible to find, in the hidden variable description, the original physical quantities and their basic algebra. This physical idea was most delicately expressed, without extra mathematical assumptions, and used in the impossibility proof of Kochen and Specker [14]. Gudder [15] gave an impossibility proof along the same lines as the one of Jauch and Piron, but now carefully avoiding the assumptions criticized by Bell. One could conclude by stating that everyone of these impossibility proofs consists of showing that a hidden variable theory gives rise to a certain mathematical structure for the physical quantities (see [1], [13J and [14]) or for the properties (see [12J and [15]) of the physical entity under consideration. The physical quantities and the properties of a STRUCTURES AND PARADOXES 165 quantum mechanical entity do not fit into this structure and therefore it is impossible to replace quantum mechanics by a hidden variable theory. More recently this structural difference between classical entities and quantum entities has been studied by Accardi within the category of the probability models itself [16], [17]. Accardi explicitly defines the concept of Kolmogorovian probability model starting from the concept of conditional probability. He identifies Bayes axiom as the one that, if it is satisfied, renders the probability model Kolmogorovian, i.e., classical. Again this approach of probability shows the fundamental difference between a classical theory and a quantum theory. A lot of physicists, once aware of this fundamental structural difference between classical and quantum theories, gave up hope that it would ever be possible to replace quantum mechanics by a hidden variable theory; and we admit to have been among them. Meanwhile it has become clear that the state of affairs is more complicated. 6.2. Hidden Measurement Theories Years ago we managed to built a macroscopic classical entity that violates the Bell inequalities [18], [19] and [20]. About the same time Accardi had shown that Bell inequalities are equivalent to his inequalities characterizing a Kolmogorovian probability model. This meant that the example we had constructed to violate Bell inequalities should also violate Accardi's inequalities characterizing a Kolmogorovian probability model, which indeed proved to be the case. But this meant that we had given an example of a macroscopic 'classical' entity having a non Kolmogorovian probability model. This was very amazing and the classification made by many physicists of a micro world described by quantum mechanics and a macro world described by classical physics was challenged. The macroscopic entity with a non Kolmogorovian probability model was first published in [7] and refined in [8], but essentially it is the quantum machine that we have presented again in this paper. We were able to show at that time the state of affairs to be the following. If we have a physical entity S and we have a lack of knowledge about the state p of the physical entity S, then a theory describing this situation is necessarily a classical statistical theory with a Kolmogorovian probability model. If we have a physical entity S and a lack of knowledge about the measurement e to be performed on the physical entity S, and to be changing the state of the entity S, then we cannot describe this situation by a classical statistical theory, because the probability model that arises is non Kolmogorovian. Hence, lack of knowledge about measurements, that change the state of the entity under study, gives rise 166 DIED ERIK AERTS to a non Kolmogorovian probability model. What do we mean by 'lack of knowledge about the measurement e'? Well, we mean that the measurement e is in fact not a 'pure' measurement, in the sense that there are hidden properties of the measurement, such that the performance of e introduces the performance of a 'hidden' measurement, denoted e A , with the same set of outcomes as the measurement e. The measurement e consists then in fact of choosing one way or another between one of the hidden measurements e A and then performing the chosen measurement eA. We can very easily see how these hidden measurements appear in the quantum machine. Indeed, if we call the measurement e~ the measurement that consists in performing eu and such that the elastic breaks in the point A for some A E [-u, u], then, each time eu is performed, it is actually one of the e~ that takes place. We do not control this, in the sense that the e~ are really 'hidden measurements' that we cannot choose to perform. The probability fL(eu,pv, or) that the experiment eu gives the outcome 01 if the entity is in state Pv is a randomisation over the different situations where the hidden measurements e~ gives the outcome 01 with the entity in state Pv. This is exactly the way we have calculated this probability in section 3.4. 6.3. Explaining the Quantum Probabilities First of all we have to mention that it is possible to construct a 'quantum machine' model for an arbitrary quantum mechanical entity. This means that the 'hidden measurement' explanation can be adopted generally, explaining also the origin of the quantum probabilities for a general quantum entity. We refer to [8], [21], [22], [23] and [24] for a demonstration of the fact that every quantum mechanical entity can be represented by a hidden measurement model. What is now the consequence of this for the explanation of the quantum probabilities? It proves that quantum probability is epistemic, and hence conceptually no ontologic probability has to be introduced to account for it. The quantum probability however finds its origin in the lack of knowledge that we have about the interaction between the measurement and the physical entity. In this sense it is a type of probability that is non-classical and does not appear in the classical statistical theories. Quantum mechanical states are pure states and descriptions of the reality of the quantum entity. This means that the 'hidden variable' theories which try to make quantum theory into a classical statistical theory are doomed to fail, as the mentioned no-go theorems for hidden variable theories had proven already without explaining why. Our approach allows to understand why. Indeed, STRUCTURES AND PARADOXES 167 there is another type of epistemic probability than the one identified in the classical statistical theories, namely the probability due to a lack of knowledge on the interaction between the measurement and the physical entity under study. It is natural that this new type of epistemic probability cannot be eliminated from a theory that describes the reality of the physical entity, because it appears when one incorporates the experiments related to the measurements of the properties of the physical entity in question. Therefore it is also natural that it remains present in the physical theory describing the reality of the physical entity, but it has no ontologic nature. This means that, with the explanation of the quantum probabilities put forward here, quantum mechanics does not contradict determinism for the whole of reality12. 6.4. Quantum, Classical and Intermediate If we demand for the quantum machine that the elastic can break at everyone of its points, and the breaking of a piece is such that it is proportional to the length of this piece, then this hypothesis fixes the possible 'amount' of lack of knowledge about the interaction between the experimental apparatus and the physical entity. Indeed, only certain type of elastic can be used to perform the experiments. On the other hand, we can easily imagine elastics that break according to different laws depending on their physical construction. Let us introduce the following different kinds of elastics: at one extremity we consider elastics that can break in everyone of its points and such that the breaking of a piece is proportional to the length of this piece. These are the ones we have already considered, and since they lead to a pure quantum structure, we call them quantum elastics. At the other extremity, we consider a type of elastic that can only break in one point, and let us suppose, for the sake of simplicity, that this point is the middle of the elastic (in [5], [25], [26] and [27] the general situation is treated). This kind of elastic is far from elastic, but since it is an extreme type of real elastics, we still give it that name. We shall show that if experiments are performed with this class of elastic, the resulting structures are classical, and therefore we will call them classical elastics. For the general case, we want to consider a class of elastics that can only break in a segment of length 2E around the center of the elastic. Let us call these E-elastics. 12 Our explanation does of course not prove that the whole of reality is deterministic. It shows that quantum mechanics does not give us an argument for the contrary. 168 DIEDERIK AERTS Fig 12 : An experiment with an E-elastic. The elastic can only break between the points -E and +E. L1 is the length of the interval where the elastic can break such that the point P finally arrives in u, and L2 is the length of the interval where the elastic can break such that the point P finally arrives at -u. The elastic with E = 0, hence the O-elastic, is the classical elastic, and the elastic with E = 1, hence the I-elastic, is the quantum elastic. In this way, the parameter E can be interpreted as representing the magnitude of the lack of knowledge about the interaction between the measuring apparatus and the physical entity. If E = 0, and for the experiment eu only classical elastics are used, there is no lack of knowledge, in the sense that all elastics will break at the same point and have the same influence on the changing of the state of the entity. The experiment eu is then a pure experiment. If E = 1, and for the experiment e u only quantum elastics are used, the lack of knowledge is maximal, because the chosen elastic can break at any of its points. In Fig 12 we have represented a typical situation of an experiment with an E-elastic, where the elastic can only break between the points -E and +E. Let us calculate the probabilities p( e u , Pv, 01) and p( e u , Pv, 02) for state-transitions from the state Pv of the particle P before the experiment e u to one of the states Pu or p-u. Different cases are possible: (1) If the projection of the point P lies between -u and -E (see Fig 12), then p(eu,pv, 01) = 0 (49) p(eu,pv, 02) = 1 (2) If the projection of the point P lies between +E and u, then p(eu,pv,od = 1 p(eu,pv, 02) = 0 (3) If the projection of the point P lies between 1 -(E - cos,),) 2E 1 2E (E + cos')') (50) -E and +E then (51) (52) The entity that we describe here is neither quantum, nor classical, but intermediate. If we introduce these intermediate entities, then it becomes STRUCTURES AND PARADOXES 169 possible to describe a continuous transition from quantum to classical (see [5], [25], [26] and [27] for details). It gives us a way to introduce a specific solution to 'classical limit problem'. 7. QUANTUM AXIOMATICS: THE OPERATIONAL PART In the foregoing example of the intermediate situation we have the feeling that we consider a situation that will not fit into standard quantum mechanics. However the situation is either not classical. But how could we prove this? This could only be done if we had an axiomatic formulation of quantum mechanics and classical mechanics, such that the axioms could be verified on real physical examples of entities to see whether a certain situation is quantum or classical or neither. This means that the axioms have to be formulated by means of concepts that can be identified properly if a real physical entity is given. This is certainly not the case for standard quantum mechanics, but within the quantum structures research large parts of such an axiomatic system has been realised through the years. 7.1. State Property Spaces: the Ontologie Part By lack of space we can not expose all the details of an operational axiomatic formulation, but we will consider the most important ingredients quantum particle or the in some detail and consider the spin of a spin~ quantum machine as an example. In the first place we have to formalize the basic concepts: states and properties of a physical entity. i) The states of the entity S With each entity S corresponds a well defined set of states E of the entity. This are the modes of being of the entity. This means that at each moment the entity S 'is' in a specific state pEE. ii) The properties of the entity S Historically quantum axiomatics has been elaborated mainly by considering the set of properties 13 . With each entity S corresponds a well defined set of properties .c. The entity S 'has' a certain property or does not have it. We will respectively say that the property a E .c is 'actual' or is 'potential' for the entity S. 13 We have to remark that in the original paper of Birkhoff and von Neumann [2], the concept of 'operational proposition' is introduced as the basic concept. An operational proposition is not the same as a property (see [28], [29]), but it points at the same structural part of the quantum axiomatic. 170 DIEDERIK AERTS To be able to present the axiomatisation of the set of states and the set of properties of an entity S in a mathematical way, we have to introduce some additional concepts. Suppose that the entity S is in a specific state p E ~. Then some of the properties of S are actual and some are not (potential). This means that with each state p E ~ corresponds a set of actual properties, subset of .c. Mathematically this defines a function ~ : ~ --+ P('c), which makes each state p E ~ correspond to the set ~(p) of properties that are actual in this state. With the notation P('c) we mean the 'powerset' of 'c, i.e., the set of all subsets of .c. From now on-and this is methodologically a step towards mathematical axiomatization-we can replace the statement 'property a E ,C is actual for the entity S in state p E ~' by 'a E ~ (p)', which is just an expression of set theory. Suppose now that for the entity S a specific property a E ,C is actual. Then this entity is in a certain state pEL: that makes a actual. With each property a E ,C we can associate the set of states that make this property actual, i.e., a subset of~. Mathematically this defines a function '" : ,C --+ P(~), which makes each property a E ,C correspond to the set of states ",(a) that make this property actual. This is a similar step to axiomatization. Indeed, this time we can replace the statement 'property a E ,C is actual if the entity S is in state p E ~' by the set theoretical expression 'p E ",(a)'. Summarising the foregoing we now have: property a E ,C is actual for the entity S in state p E ¢:? a E ~(p) ¢:? p E ",(a) ~ (53) This expresses a fundamental 'duality' among states and properties. We will introduce a specific mathematical structure to represent an entity S, its states and its properties, taking into account this duality. We need the set ~, the set 'c, and the two functions ~ and "'. Definition 1 (state property space). Consider two sets two functions ~:P('c) pf-7~() '" : ,C --+ P(~) If p E ~ a f-7 ",(a) ~ and ,C and (54) and a E ,C we have: aE ~(p) ¢:? P E ",(a) (55) then we say that (~,'c "') is a state property space. The elements of ~ are interpreted as states and the elements of ,C as properties of the STRUCTURES AND PARADOXES 171 entity S. The interpretation of (55) is 'property a is actual if S is in state p' 14 There are two natural 'implication relations' we can introduce on a state property space. If the situation is such that if 'a E [, is actual for S in state p E ~' implies that 'b E [, is actual for S in state p E ~' we say that the property a implies the property b. This 'property implication' relation is expressed by a mathematical relation on the set of properties (see following definition). If the situation is such that 'a E [, is actual for S in state q E ~' implies that 'a E is actual for S in state p E ~' we say that the state p implies the state q. Again we will express this 'state implication' by means of a mathematical relation on the set of states. Definition 2 {state implication and property implication}. Consider a state property space (~, £,~ ~). For a, bE £ we introduce: a -< b {:} and we say that a ~(a) (56) C ~(b) 'implies' b. For p, q p -< q {:} ~(q) E ~ we introduce: C ~(p) (57) and we say that p 'implies' q 15. We will introduce now the mathematical concept of a pre-order relation. Definition 3 {pre-order relation}. Suppose that we have a set Z. We say that -< is a pre-order relation on Z iff for x, y, z E Z we have: x-<x x -< y and y -< z =? x -< z For two elements x, y E Z such that x -< y and y -< x we denote x and we say that x is equivalent to y. (58) :::::! y 14 We remark that it is enough to give two sets L: and £ and a function ~ : L: --> P(£) to define a state property space. Indeed, if we define the function r;, : £ --> P(L:) such that r;,(a) = {p I pEL:, a E ~(p)} then (L:, £,~ r;,) is a state property space. This explains why we do not explicitly consider the function r;, in the formal approach outlined in [6], [30] and [31] in the definition of a state property system, which is a specific type of state property space. Similarly it would be enough to give L:, £ and r;, : £ --> P(L:). 15 Remark that the state implication and property implication are not defined in a completely analogous way. Indeed, then we should for example have written p -< q <=} ~(p) c ~(q). That we have chosen to define the state implication the other way around is because historically this is how intuitively is thought about states implying one another. 172 DIED ERIK AERTS It is easy to verify that the implication relations that we have introduced are pre-order relations. Theorem 1. Consider a state property space (2:, L, C K:), then 2:,-< and L, -< are pre-ordered sets. We can show the following for a state property space Theorem 2. Consider a state property space (2:, L, ~, K:). (1) Suppose that a, bEL and P E 2:. If a E ~(p) and a -< b, then b E ~(p). (2) Suppose that p, q E 2: and a E L. If q E K:(a) and p -< q then p E K:(a). Proof: (1) We have p E K:(a) and K:(a) C K:(b). This proves that p E K:(b) and hence b E ~(p). (2) We have a E ~(q) and ~(q) C ~(p) and hence a E ~(p). This shows that p E K:(a). The reader will now better understand why the original studies of the axiomatization of quantum mechanics have been called quantum logic. Indeed, we have also used the name 'implication'. We will see that we can also introduce concepts that are close to 'disjunction' and 'conjunction'. But we point out again that we are structuring more than just the logical aspects of entities. We aim at a formalization of the complete ontologic structure of physical entities. 7.2. Meet Properties and Join States If we have a structure with implications and we are inspired by logic, we are tempted to wonder about conjunctions and disjunctions. Here again it becomes clear that we are studying a quite different situation than the one analyzed by traditional logic. Suppose we consider a set of properties (aik It is very well possible that there exist states of the entity S in which all the properties ai are actual. This is in fact always the case if niK:(ai) "I- 0. Indeed, if we consider p E niK:( ai) and S in state p, then all the properties ai are actual. If it is such that the situation where all properties ai of a set (ai)i and no other are actual is again a property of the entity S, we will denote this new property by !\iai, and call it the 'meet property' of all ai. Clearly we have !\iai is actual for S in state p E 2: iff ai is actual for all i for S in state p. This means that we have !\iai E ~(p) iff ai E ~(p) Vi. This formulation of the 'meet property' gives us the clue how to introduce it formally in a state property space. Suppose now that we consider a set of states (pj) j of the entity S. It is very well possible that there exist properties of the entity such that these properties are actual if S is in anyone of the states Pj. This is in STRUCTURES AND PARADOXES 173 fact always the case if nj~(p) =I=- 0. Indeed suppose that a E nj~(p). Then we have that a E ~ (pj) for each one of the states Pj, which means that a is actual if S is in anyone of the states Pj. If it is such that the situation where S is in anyone of the states Pj is again a state of S, we will denote this new state by VjPj and call it the 'join state' of all Pj. Clearly we have that a property a E C is actual for S in state V jPj iff this property a is actual for S in any of the states Pj. This formulation of the 'join state' indicates again the way we have to introduce it formally in a state property space 16 . The existence of meet properties and join states will give additional structure to L; and C. Definition 4 (complete state property space). Consider a state propK:). We say that the state property space is 'property erty space (L;, C,~ complete' iff for an arbitrary set (ai)i, ai E C of properties there exists a property !\iai E C such that for an arbitrary state pEL;: (59) We say that a state property space is 'state complete' iff for an arbitrary set of states (pj) j, Pj E ~ there exists a state V jPj E ~ such that for an arbitrary property a E C: (60) If a state property space is property complete and state complete we call it a 'complete' state property space. The following definition and theorem explain why we have chosen to call such a state property space complete. Definition 5 (complete pre-ordered set). Suppose that Z, --< is a preordered set. We say that Z is a complete pre-ordered set iff for each subset (Xi)i, Xi E Z of elements of Z there exists an infimum and a supremum in Z17. 16 We remark that we could also try to introduce join properties and meet states. It is however a subtle, but deep, property of reality, that this cannot be done on the same level. We will understand this better when we introduce in the next section the operational aspects of the axiomatic approach. We will see there that only meet properties and join states can be operationally defined in the general situation. 17 An infimum of a subset (Xi)i of a pre-ordered set Z is an element of Z that is smaller than all the Xi and greater than any element that is smaller than all Xi. A supremum of a subset (Xi)i of a pre-ordered set Z is an element of Z that is greater than all the Xi and smaller than any element that is greater than all the Xi. 174 DIEDERIK AERTS Theorem 3. Consider a complete state property space Then ~,-< and C, --< are complete pre-ordered sets. (~, C,~ K,). Proof: Consider an arbitrary set (ai)i, ai E C. We will show that !\iai is an infimum. First we have to proof that !\iai --< ak V k. This follows immediately from (59) and the definition of --< given in (56). Indeed, from this definition follows that we have to prove that K,(!\iai) C K,(ak) V k. Consider p E K,(!\iai). From (55) follows that this implies that !\iai E ~(p). Through (59) this implies that ak E ~(p) V k. If we apply (55) again this proves that p E /'i,(ak) V k. So we have shown that K,(!\iad C K,(ak) V k. This shows already that !\iai is a lower bound for the set (aik Let us now show that it is a greatest lower bound. So consider another lower bound, a property b E C such that b --< ak V k. Let us show that b --< !\iai. Consider p E K,(b), then we have pEak V k since b is a lower bound. This gives us that ak E ~(p) V k, and as a consequence !\iai E ~(p). But this shows that p E /'i,(!\iai). So we have proven that b --< !\i ai and hence !\i ai is an infimum of the subset (ai k Let us now prove that V jPj is a supremum of the subset (pj k The proof is very similar, but we use (60) in stead of (59). Let us again first show that V jPj is an upper bound of the subset (pj k We have to show that Pl --< VjPj V l. This means that we have to prove that ~(VjP) C ~(pl) V l. Consider a E ~(VjP), then we have VjPj E /'i,(a). From (60) it follows that Pl E /'i,(a) V l. As a consequence, and applying (55), we have that a E ~(pl) V l. Let is now prove that it is a least upper bound. Hence consider another upper bound, meaning a state q, such that Pl --< q V l. This means that ~(q) C ~(Pl) V l. Consider now a E ~(q), then we have a E ~(pl) V l. Using again (55), we have PI E K,(a) V l. From (60) follows then that Vjpj E /'i,(a) and hence a E ~(Vja). We have shown now that !\iai is an infimum for the set (ai)i, ai E C, and that VjPj is a supremum for the set (Pj)j,Pj E ~. It is a mathematical consequence that for each subset (ai)i, ai E C, there exists also a supremum in C, let is denote it by Viai, and that for each subset (pj) j , Pj E ~, there exists also an infimum in ~, let us denote it by !\jPj. They are respectively given by Viai = !\xE£:',ai...;xVi x and !\jPj = VyE'E,Y"';PjVj y18. For both C and ~ it can be shown that this implies that there is at least one minimal and one maximal element. Indeed, an infimum of all elements of C is a minimal element of C and an infimum of the empty 18 We remark that the supremum for elements of £:. and the infimum for elements of E, although they exists, as we have proven here, have no simple operational meaning, as we will see in the next section. STRUCTURES AND PARADOXES 175 set is a maximal element of £'. In an analogous way a supremum of all elements of E is a maximal element of E and a supremum of the empty set is a minimal element of I;. Of course there can be more minimal and maximal elements. If a property a E £, is minimal we will express this by a ~ 0 and if a property bE£' is maximal we will express this by b ~ I. An analogous notation will be used for the maximal and minimal states. For a complete state property space we can specify the structure of the maps ~ and K, somewhat more after having introduced the concept of 'property state' and 'state property'. Theorem 4. Consider a complete state property space (~, £', ~, K,). For pEE we define the 'property state' corresponding to p as the property s(p) = /\aE~(p). For a E .c we define the 'state property' corresponding to a as the state t( a) = VpEK(a)P. We have two maps: t : .c ~ I; a H t(a) s :~ -+ £, P f---7 s(p) a -< b {:} t(a) -< t(b) p -< q {:} s(p) -< s(q) t(l\iai) ~ I\i t (ai) s(Vjpj) ~ Vjs(pj) (61) (62) Proof: Suppose that p -< q. Then we have ~(q) C ~(p). From this follows that s(p) = l\aEt;(p)a -< l\aEt;(q)a = s(q). Suppose now that s(p) -< s(q). then we have s(q) -< a. Hence also s(p) -< a. But this Take a E ~(q), Hence this shows that ~(q) C ~(p) and as a implies that a E ~(p). consequence we have p -< q. Because I\iai -< ak \:f k we have t(l\iai) -< t(ak) \:fk. This shows that t(l\iai) is a lower bound for the set (t(adk Let us show that it is a smallest lower bound. Suppose that p -< t(ak) \:f k. We remark that t(ak) E K,(ak). Then it follows that p E K,(ak) \:f k. As a \:f k. But then I\iai E ~(p) which shows consequence we have ak E ~(p) that p E K,(l\iai). This proves that p -< t(l\iai). So we have shown that t(l\iai) is a smallest lower bound and hence it is equivalent to I\it(ai). Theorem 5. Consider a complete state property space (~, I:,~ K,). For ~ we have ~(p) = [s(p), +00] = {a E I: I s(p) -< a}. For a E I: we have K,(a) = [-00, t(a)] = {p E ~ I p -< t(a)}. pE Proof: Consider b E [s(p) , +00]. This means that s(p) -< b, and hence ~(p). Consider now b E ~(p). Then s(p) -< b and hence bE [s(p) , +00]. bE 176 DIEDERIK AERTS If p is a state such that ~(p) = 0, this means that there is no property actual for the entity being in state p. We will call such states 'improper' states. Hence a 'proper' state is a state that makes at least one property actual. In an analogous way, if /';;(a) = 0, this means that there is no state that makes the property a actual. Such a property will be called an 'improper' property. A 'proper' property is a property that is actual for at least one state. Definition 6. Consider a state property space (~,.c /';;). We call p E a 'proper' state iff ~(p) =I=- 0. We call a E .c a 'proper' property iff /';;(a) =I=- 0. A state p E ~ such that ~(p) = 0 is called an 'improper' state, and a property a E .c such that /';;(a) = 0 is called an 'improper' property. ~ It easily follows from theorem 5 that a complete state property space has no improper states (J ~ 1\0 E ~(p) and no improper properties (0 ~ v0 E /';;(a)). 7.3. Tests and Preparations: the Operational Part Our contact with physical entities of the exterior world happens by means of experiments we can perform. A test is an experiment we perform on the physical entity in a certain state testing a certain hypothesis. States can often be prepared. A preparation is an experiment we perform on the physical entity such that as a result of the experiment the entity is in a certain state. We will not develop the algebra of experiments connected to a physical entity in a complete way in this paper, and refer to [6] for such an elaboration. Here we will only introduce the concepts that we need for our principal purpose: the presentation of quantum axiomatics. i) The tests on the entity S Tests are experiments that verify a certain hypothesis about the entity S. More specifically tests can test properties of the entity S in the following way. With a property a E .c corresponds a test a(a), which is an experiment with two possible outcomes yes and no. If the test a(a) has an outcome yes it does not yet prove that the property a is actual. It is only when we can predict with certainty that the test would have an outcome yes, without necessarily performing it, that the property a is actual. STRUCTURES AND PARADOXES 177 Definition 1 (testing a property). Suppose that we have an entity S with corresponding state property space (~, [" ~, /'i,). a( a) is a test of the property a E [, if we have a E ~(p) {:? yes can be predicted for a(a) S being in state p (63) ii) The preparations of the entity S Preparations are experiments that prepare a state of the entity S. More specifically, with a state p E ~ corresponds a preparation 7r(p) which is an experiment such that after the performance of the experiment the entity 'is' in state p. Definition 8 (preparing a state). Suppose that we have an entity S with corresponding state property space (~, [,~ /'i,). 7r(p) is a preparation of the state p E ~ p E /'i,(a) if we have {:? a is actual after the preparation 7r(p) (64) For a set of tests (ai)i and for a set of preparations (7rj)j we can now introduce in a very natural way a new test, that we call the product test, denoted by lIiai, and a new preparation, that we call the product preparation, denoted by IIp!"j, as follows: Definition 9 (product test and preparation). To execute lIiai we choose one of the ai, perform it and consider the outcome that we obtain. To execute lIj7rj we choose one of the 7rj, perform is and consider the state that we obtain. We want to show now that the product test tests an infimum of a set of properties, while the product preparation prepares a supremum of a set of states. Theorem 6. Suppose that we have an entity S with corresponding state property space (I::, 12,~ K.). Consider a set of properties (ai)i, ai E £ and a set of states (Pj)j, Pj E I::. Suppose that we have tests and preparations available for all properties and states. Then the product test lIia(ai) tests a meet property /\iai, where a(ak) tests ak V k, and the product preparation lIj7r(pj) prepares a join state VjPj, where lI(Pl) prepares PI V l. Proof: We have to show that 'yes can be predicted for lIia(ai) the entity S being in state P' is equivalent to 'ak E ~(p) V k'. This follows immediately from the definition of the product test. Indeed 'yes can be 178 DIED ERIK AERTS predicted for IIia(ai) the entity S being in state p' is equivalent to 'yes can be predicted for a(ak) \j k the entity S being in state p'. Consider now an arbitrary property a E 1: and suppose that (7r(Pj))j is a set of preparations that make a actual if the entity S is in state Pj. Consider now the preparation II j 7r(pj) , that consists of choosing one of the 7r(pj) and performing it. Then it is clear that a is actual after this preparation, since S will be in one of the states Pj. On the other hand, suppose now that IIj1r(pj) is a preparation that makes a E 1: actual. Consider an arbitrary one of the preparations 7r(Pk) of the product preparation. Then obviously also this preparation has to make a actual, since it could have been this one that was chosen by performing the product preparation. This shows that II j 7r(pj) prepares the state V jPj. This theorem shows that it is natural to introduce the meet property for a set of properties and the join state for a set of states, like we did in the foregoing section. It is time now that we expose the concepts that we have introduced here for the example of the spin of a spin! quantum particle. 7.4. The Example of the Spin Model In section 5 we have explained in detail the standard quantum description of the spin of a spin! quantum particle. For the case of the spin of a spin! quantum particle, the experiments fu have only two outcomes 01 and 02, and hence they are tests in the sense that if 01 is interpreted as yes then 02 is no. This means that we can represent the properties by means of the projection operators that we use to represent the experiments. For each direction u we have a property au that is represented by the projection operator given in formula (40) Pu = ~ ( 1 + cos a 2 e+ 2 {3 sin a e- i {3 sin a ) 1 - cos a (65) where u = (1, a, (3) and hence a and (3 are the spherical coordinates angles of the vector u. The set of properties 1:sPin~ is given by these properties au and the maximal and minimal property that we will respectively denote by I and 0 19 . Hence: 1: sPin ! = {au'!, 0 Iu E surface of the sphere ball} (66) 19 It will become clear in the next section why for the quantum case there is a unique minimal property and a unique maximal property 179 STRUCTURES AND PARADOXES We have stated in section 5 that a state qv of the spin of a spin! quantum particle in direction v is represented by means of the unit vector Cv in the two dimensional complex vector space (2. We have to elaborate a little bit more on the description of the states. Indeed, if we consider again our quantum machine, which is a model for the spin of a spin~ quantum particle, then we can see that there are more states than the ones represented by the unit vectors. If we consider a point w in the interior of the sphere, hence not on the surface of the sphere, then this is also a possible state of the quantum machine, not represented however by a unit vector of the vector space, but by a density operator 20 . Let us analyse this situation in detail. Let us calculate the probabilities for such a state Pw where w is a point inside the sphere. First we remark the following. Because ball is a convex set, each vector w E ball can be written as a convex linear combination of two vectors v and -von the surface of the sphere (see Fig 13). More concretely this means that we can write (referring to the wand v and -v in the Figure 13): w = Al . V - A2 . v, 0::; AI, A2 ::; 1, Al + A2 = 1 (67) Hence, if we introduce these convex combination coefficients AI, A2 we have w = (AI - A2) . v. Let us calculate now the transition probabilities in a general state Pw with w E ball and hence Ilwll ::; 1 (see Fig 13). Again the probability f.-l(eu,pw, 01), that the particle P ends up in point u and hence experiment eu gives outcome 01 is given by the length of the piece of elastic Ll divided by the total length of the elastic. The probability, f.-l(eu,pw, 02), that the particle P ends up in point -u, and hence experiment e u gives outcome 02 is given by the length of the piece of elastic L2 divided by the total length of the elastic. This means that we have: (68) (69) (70) (71) 20 A density operator is an operator W such that < c, We >=< We, c > VeE 1[2 (which means self-adjointness), and such that 0 ::; < W c, W c > V c E 1[2 (which means positiveness), and such that the trace equals to 1. 180 DIEDERIK AERTS v -v Fig 13 : A representation of the experimental process in the case of a state Pw where w is a point of the interior of the sphere. The elastic of length 2, corresponding to the experiment e u , is installed between u and -u. The probability, JL(e u ,pw,ol), that the particle P ends up in point u under influence of the experiment eu is given by the length of the piece of elastic Ll divided by the total length of the elastic. The probability, JL( eu ,pw ,02), that the particle P ends up in point -u is given by the length of the piece of elastic L2 divided by the total length of the elastic. These are new probabilities that will never be obtained if we limit the set of states to the unit vectors of the two dimensional complex space. The question is now the following: can we find a mathematical concept, connected in some way or another to the Hilbert space, that would allow us, with a new quantum rule for calculating probabilities, to recover these probabilities? The answer is yes. We will show that these new 'pure' states of the interior of the sphere can be represented using density operators, the same operators that are used within the standard quantum formalism to represent mixed states. And the standard quantum mechanical formula that is used to calculate probabilities connected to mixed states, represented by density operators, can also be used to calculate the probabilities that we have identified here. But of course the meaning will be different: in our case this standard formula will represent a transition probability from one pure state to another and not the probability connected to the change of a mixed state. Let us show all this explicitly and do this by constructing the density operators in question. The well known quantum formula for the calculation of transition probabilities related to an experiment e, represented by the projections {P, 1- P}, and where the quantum entity is in a mixed state p represented by the density operator W, is the following: f1( e, p, P) = tr(W . P) (72) where tr is the trace of the operator21. A standard quantum mechanical calculation shows that the density ().¢ ().¢ operator representing the ray state Cv = (cos 2ez"2, sin 2e-z"2) (see (23)) is given by: (73) 21 The trace of an operator is the sum of its diagonal elements. STRUCTURES AND PARADOXES 181 and the density operator representing the diametrically opposed ray state C v is given by: sin2 It. - sin fl.2 cos fl.e-ir/> ) 2 W(-v) = ( - sin fl. cos2 fieir/> 2 cos fi 2 2 2 (74) We will show now that the convex linear combination of these two density operators with convex weights Al and A2 represents the state Pw if we use the standard quantum mechanical formula (72) to calculate the transition probabilities. If, for w = Al V + A2 ( -v), we put: (75) we have: W( w) = \ Al COS ( (Al \ - 2 () 2 + A2 \ sm . 2 2,() \ ) . () () ~¢ A2 sm 2 cos 2 e (\ . () () -i¢ ) sm 2 cos 2 e (76) \ 2 13 Al sm 2 + A2 cos 2 Al - \ A2 ) \ . 2 () and it is easy to calculate now the transition probabilities using (72) and: p=(~ ~) (77) We have: W( W· ) P -_ ( ) (78) . 2() 2() = Al sm 2' + .\2 cos 2' (80) Al cos2 ~ + A2 sin2 ~ B B ~ (AI - A2) sin 2 cos '2e t ~ and hence, comparing with (68), we find: In an analogous way we find that: tr(W(w) . (1 - P)) = (..L(eu,pw, 02) So we have shown that we can represent each one of the new states Pw by the density operator W(w) if we use (72) for the calculation of the transition probabilities. We can also prove that each density operator in ([:2 is of this form. We show this easily by using the general properties of density operators. 182 DIEDERIK AERTS Let us identify now the set of states for the case of the spin of a spin~ quantum particle. Each state is of the form Pw with w a point of the sphere ball. There also exist a zero state 0 and a unit state J. = {Pw, 0, J I w L;sPin~ E (81) ball} The state property space corresponding to the spin of a spin~ particle is given by (L;sPin~' .csPin~' ~spin' i'\;sPn~) where quantum (82) We have for u and v belonging to the sphere ball: au E ~sPin We have, for ~spin(PV) Ivl = (Pw) ~ Pw E i'\;spn~ (au) ~ u =w (83) 1, and hence pertaining to the surface of ball: = {av,I} (84) For w < 1, and hence pertaining to the interior of ball we have: ~sPin(w) = {J} (85) 8. QUA N TUM A X 10M A TIC S: THE TEe H N I CAL PAR T In the foregoing section we have introduced the structure that can be operationally founded. To come to the full structure of standard quantum mechanics some additional axioms have to be introduced which are more of a technical nature. This section will be mathematically more sophisticated, but can be skipped for those readers that are mainly interested in the results that are presented in next section. In the introduction we mentioned that quantum axiomatics was developed to build up standard quantum mechanics and not to change it. Meanwhile it has become clear that some of the more technical axioms of standard quantum mechanics are probably not generally satisfied in nature. This finding will be our main comment regarding the standard axioms, and for this reason we anyhow have to introduce them. An analysis of the failing axioms and their consequences is presented in the next section. 8.1. State Property Systems Since we will introduce in this section the axioms that have very little operational meaning, we will enter much less in detail. 183 STRUCTURES AND PARADOXES (1) The identification of properties As we have seen in the example of the spin of a spin~ quantum par- ticle, a property of the physical entity is represented by a projection operator. This remains true for a general quantum entity. Suppose that we consider two properties a and b of a general quantum entity, and their corresponding projection operators Pa and Pb. The implication of properties, giving rise to a pre-order relation on is translated for a quantum entity as follows by means of the projection operators: .c, Theorem 7. Consider a state property space (~, 12, ~, r;,) corresponding to a quantum entity S, described by means of the standard quantum formalism in a Hilbert space H. For two properties a, b E 12, and corresponding projection operators Pa and Pb we have: (86) Definition 10. Consider a pre-ordered set Z, -<. The pre-order relation -< is a partial order relation iff we have for x, Y E Z x -< y and y -< x =? x =Y (87) r;,) corresponding Theorem 8. Consider a state property space (I:, 12,~ to a quantum entity S, described by means of the standard quantum formalism in a Hilbert space H. For two properties a, bE £ we have: a -< band b -< a =? a= b (88) and hence the pre-order relation on £ is a partial order relation. Proof: Suppose that a -< band b -< a. Then from theorem 7 follows that Pa = PaPb = PbPa = Pb. which means that a = b. (2) Completeness of the property lattice The second special property for quantum entities that we will identify has already been explained in the foregoing section. It is related to the existence of the meet property for a set of properties. Suppose that we consider again a quantum entity and a set of properties (ai)i with corresponding set of projection operators (Paik Then there exists a unique projection operator Pa that corresponds to the meet property !\iai· 184 DIEDERIK AERTS Theorem 9. Consider a state property space eE, £, (, K,) corresponding to a quantum entity S, described by means of the standard quantum formalism in a Hilbert space 11. For a set of properties (ai)i, ai E £, and corresponding set of projection operators (PaJi' there exists a projection operator Pa such that a E ((p) ¢:? ai E ((p) Vi (89) which means that a = !\iai and which means that the state property space is property complete (see definition 4) We will not prove this theorem because it will lead us into too many technical details. We only mention that this is a well known result about Hilbert space projectors. (3) Minimal property and maximal property We can remark now also that for the case of a quantum entity the maximal property I is always actual for any state of the entity and the minimal property 0 is never actual (both are unique since the set of properties is a partially ordered set). Indeed: Theorem 10. Consider a state property space (E, £, (, K,) corresponding to a quantum entity S, described by means of the standard quantum formalism in a Hilbert space 11. Consider an arbitrary state pw E E and let 0 be the minimal property and I be the maximal property of £, then o tI. ((p) and I E ((pw). Proof: The minimal property is represented in quantum mechanics by the zero projection 0, and the maximal property by the unit operator I. We have tr(W . 0) = 0 and tr(W . I) = tr(W) = 1 which shows that o tI. ((pw) and I E ((pw). The additional structure that we have observed for a quantum entity in (1), (2) and (3) will be our inspiration for the first axiom, and will make us introduce the structure of a state property system, that we have studied intensively in [30] and [31]. Definition 11 (state property system). Suppose that property space (E, £, (, K,). This state property space erty system iff (£, -<,!\, v) is a complete lattice22 , and maximal element and 0 the minimal element of £, and 22 A complete lattice is a complete partially ordered set. we have a state is a state propfor pEE, I the ai E £, we have: STRUCTURES AND PARADOXES 185 L, ~, /\,) correspondTheorem 11. Consider a state property space (~, ing to a quantum entity S, described by means of the standard quantum formalism in a Hilbert space 71, then (~, L, ~, /\,) is a state property system. The foregoing results make it possible to introduce the first axiom: Axiom 1 (state property system). Suppose that we have a state property space (L:,.c, corresponding to an entity S. We say that axiom 1 is satisfied iff the state property space is a state property system. e, /\,) If axiom 1 is satisfied we will call the set of properties the 'property lattice' corresponding to entity S. 8.2. Atomic States We have seen that the set of states ~ of a general physical entity has a natural pre-order relation that we have called the state implication. We have also explained that a state in standard quantum mechanics can be represented by a density operator. Some of the density operators represent vector states of the Hilbert space. The representation theorem that we will put forward in this section is inspired on the classical representation theorem formulated for the situation where all states are vector states. Therefore we wonder whether it is possible to characterize the vector states in a more general way. This is indeed possible by means of the mathematical concept of 'atom' that we will introduce now. Definition 12. Consider a pre-ordered set Z, -<. We say that x E Z is an 'atom' iff x is not a minimal element and for y -< x we have y ~ x or y is a minimal element of Z. Definition 13. Consider a state property space (L:,.c, t;, /\,) describing an entity S. We know that L:, -< is a pre-ordered set. We call pEL: an 'atomic state' of the entity S iff p is an atom for the pre-order relation on L:. We will denote the set of atomic states of a state property space by means of A. Let us first investigate which are the atomic states for our example of the quantum machine. Following (84) and (85) we have Pv -< Pw and Pv ¢ Pw for Ivl = 1 and w < 1. This shows that none of the states Pw with Iwl < 1 is an atomic state. From (83) follows that Pv -< Pw for Ivl = Iwl = 1 implies that v = wand hence Pv = Pw' This shows that all of the states Pv with Ivl = 1 are atomic states. Hence the atomic states for the quantum machine are exactly these states that correspond 186 DIEDERIK AERTS to points on the surface of ball. For the situation of a general quantum entity described in a Hilbert space 1{ a similar result can be shown: the atomic states are those states that are represented by density operators that correspond to vectors of the Hilbert space. Theorem 12. Consider a state property space eE, £,~ K,) corresponding to a quantum entity S described by a Hilbert space 1{. A state pEE is atomic iff it is represented by a density operator corresponding to a vector of the Hilbert space. Proof: Consider a state PW E E represented by a density operator W of 1{. This density operator W can always be written as a convex linear combination of density operators Wi corresponding to vectors of 1{ and representations of states PWi E E23. Hence we have W = Li Ai Wi. Consider a property a E £ such that a E ~(pw). This means that tr(W Pa) = 1 where Pa is the projection operator representing the property a. We have tr(W Pa ) = tr(Li Ai WiPa ) = Li Aitr(WiPa). Since a :=:; Ai :=:; 1 V i and Li Ai = 1 and a :=:; tr(WiPa) :=:; 1 V i this proves that tr(WiPa ) = 1 V i. As a consequence we have a E ~(pwJ. This shows that ~ (pw) c ~ (pw;) and hence pWi --< PW. This shows already that genuine density operators that are not corresponding to a vector of the Hilbert space are not atomic. Suppose now that we consider a state Pw E E where W corresponds to a vector c E 1{ and another state pv E E such that pv --< pw· This means that ~(pw) c ~(pv). Suppose the property represented by the projector operator on the vector c, let us denote it Pc, is contained in ~(pw) and hence also in ~(Pv). From this follows that V = Pc and hence pv = PW. This shows that states represented by density operators corresponding to vectors of the Hilbert space are atomic states. The reader has perhaps meanwhile understood in which way we will gradually arrive at a full axiomatization of standard quantum mechanics. We analyse step by step what are the requirements that are additionally satisfied for the state property space of a quantum entity described by standard quantum mechanics. To proceed along this line we will now first show that for an entity satisfying axiom 1, the atomic states can be identified unequivocally with atomic properties of the property lattice. This will make it possible to concentrate only on the structure of the property lattice. Theorem 13. Consider a state property space (E, £,~ K,) satisfying axiom 1 (hence a state property system) with set of atomic states A. 23 This is a well known property of density operators in a Hilbert space. 187 STRUCTURES AND PARADOXES Let us denote by A the set of atoms of L. If we consider the function s : L: --t C, then s(A) = A. Proof: Suppose that rEA and let is show that s(r) is a atom of L. Consider a E such that a -< s(r) and a =/::. O. We remark that in this case K;(a) =/::. 0. Indeed, suppose that K;(a) = 0 then a -< b V b E and hence a = O. If K;( a) =/::. 0 there exists apE 2: such that p E K;( a). We then have a E ~(p) and as a consequence s(p) -< a -< s(r). From this follows that p -< r, but since rEA we have p ~ r. This implies that s(p) = s(r) and hence a = s(r). This proves that s(r) is an atom of C. Consider now a E A. Let us show that there exists apE A such that s(p) = a. We have 11:( a) i= 0 since a i= o. This means that there exists pEL: such that p E l1:(a). Hence a E ~(p) and as a consequence we have s(p) -< a. From this follows that s(p) = 0 or s(p) = a. We remark that s(p) = 0 is not possible since this would imply that ~(p) = C and hence 0 E ~(p) which is forbidden. Hence we have s(p) = a. We must still show that p is an atom. Indeed suppose that r -< p, then s(r) -< s(p) = a. This shows that s(r) = a and hence ~(r) = ~(p). .c .c The foregoing theorem shows that we can represent an atomic state by means of the atom of the property lattice on which it is mapped by the map s. That is the reason we will concentrate on the property lattice c. It could well be possible that no or very few atomic states exist. For the case of standard quantum mechanics there are however many atomic states. Theorem 14. Consider a state property space (L:, C, ~, 11:) describing a standard quantum mechanical entity S in a Hilbert space H. Each property ap 1= 0 represented by a non zero projection operator P equals the supremum of the set of rays (one dimensional projections) contained in P. This is the inspiration for the next axiom. First we introduce the concept of a complete atomistic lattice. Definition 14. Consider a complete lattice C, -< and its set of atoms A. We say that C is 'atomistic' iff each a E C is equal to the supremum of its atoms, i.e., a = V cEA,c-<ac. Axiom 2 (atomicity). Consider a state property system (L:, C,~ 11:) describing an entity S. We say that the state property system satisfies axiom 2 iff its property lattice is atomistic. 188 DIED ERIK AERTS We know from the foregoing that the atoms of C represent the atomic states of the entity S. Apart from atomicity, the property lattice of an entity described by standard quantum mechanics satisfies an additional property, called the 'covering law'. It is the following: Axiom 3 (covering law). The property lattice C of a state property system (~, c, r;,) satisfies the 'covering law' iff for c E A and a, b E C such that a 1\ c = 0 and a --< b --< a V c we have a = b or a V c = b. e, The covering law demands that the supremum of a property and an atom 'covers' this property, in the sense that there does not exists a property in between. The first three axioms introduce the linearity of the set of states of the entity. Indeed it can be shown that a complete atomic lattice satisfying the covering law and containing sufficiently many atoms is isomorphic to a projective geometry. Making use of the fundamental theorem of projective geometry we can construct of vector space coordinating this geometry and also representing the original lattice of properties. We refer to [34] for a proof of this fundamental representation theorem for complete atomic lattices satisfying the covering law. 8.3. Orthogonality The next axiom is inspired by the specific and strong orthogonality structure that exists on a Hilbert space. If axiom 1 is satisfied the set of properties is a complete lattice. We give now the definition of the structure of an orthocomplementation, which will make it possible for us to introduce the next axiom. Definition 15 (orthocomplementation). Consider a complete lattice C. We say that I : C ~ C is an orthocomplementation iff for a, b E C we have: a --< b =} b' --< a' (a ' )' = a 1\ a' a (91) =0 e, Theorem 15. Consider a state property space (~, c, r;,) corresponding to a quantum entity S, described by means of the standard quantum formalism in a Hilbert space 1i. If we define for a property a E C, and corresponding projection operator Pa , the property a ' as corresponding to the projection operator 1- Pa , then I : C ~ C is an orthocomplementation. STRUCTURES AND PARADOXES 189 Proof: Suppose that we have a, bEL and their corresponding projection operators Pa and Pb. If a -< b then Pa = PaPb = PbPa. We have (I - Pa)(1 - H) = 1 - H - Pa + PaH = 1 - H = 1 - Pb - Pa + HPa = (1 - H)(1 - Pa). This shows that b' -< a ' . We have 1 - (1 - Pa) = Pa which proves that (a')' = a. Since Pa(I - Pa) = 0 we have a 1\ a' = O. This gives us the next axiom: Axiom 4 (orthocomplementation). A state property system (L;, L, ~,/1;) describing an entity S is called 'property orthocomplemented' and satisfies axiom 4 iff there exists an orthocomplementation on the complete lattice of properties. Apart from the orthocomplementation the property lattice of an entity described by standard quantum mechanics satisfies an additional property called 'weak modularity'. It is a purely technical axiom expressed as follows: Axiom 5 (weak modularity). The property lattice £ of a state property space eE, £,~ /1;) satisfying axiom 1 and 4 (hence a property orthocomplemented state property system) is 'weakly modular' iff for a, bE £ such that a -< b we have (a Vb') 1\ b = a. 8.4. Full Axiomatisation of Standard Quantum Mechanics We need more requirements in order to be able to prove that the obtained structure is isomorphic to standard quantum mechanics. We leave the proof for these requirements to be satisfied in standard quantum mechanics to the dedicated reader. The first requirement is called 'plane transitivity'. It has been identified only recently ([32] [33]). Axiom 6 (plane transitivity). The property lattice £ of a state property space (L:,£~/1;) is 'plane transitive' iff for p,q E L: there are r i= s E L: and an automorphism of £ that maps ponto q and leaves the 'plane' interval [0, r V s] invariant. Let us introduce the next axiom: Axiom 7 (irreducibility). The property lattice £ of a state property space (L:, £,~ /1;) satisfying axiom 1 and 2 (hence a property orthocomplemented state property system) is 'irreducible', i.e., whenever b E £ is such that b = (b 1\ a) V (b 1\ a') V a E £ then b = 0 or b = I. 190 DIEDERIK AERTS The standard representation theorem has been proven for the irreducible components of the property lattice. The foregoing axiom is in this sense not on the same level as the other ones. Indeed even if we do not require the property lattice to be irreducible, the representation theorem can be proven for each irreducible component. It can indeed be shown that a general property lattice is the direct product of its irreducible components. We refer to [38] and [41] and more specifically to [42] and [43] for a detailed analysis of this decomposition. When we mentioned the representation theorem derived from the fundamental theorem of projective geometry in section 8.2 we already pointed out that the property lattice has to contain enough states to be able to derive this theorem. For the full representation theorem of standard quantum mechanics we need infinitely many atoms. Axiom 8 (infinite length). The property lattice £. of a state property space (l:, £.,~ K,) is 'infinite' if it contains an infinite set of mutually orthogonal elements 24 . Theorem 16 (Representation theorem). Suppose that we have an entity S described by means of a state property space (l:, £.,~ K,) for which axioms 1 to 8 are satisfied. Then £. is isomorphic to the complete lattice of the projection operators of an infinite dimensional real, complex or quaternionic Hilbert space N. The atoms of £. and hence also the atomic states of S correspond to the rays of N. The orthocomplementation is induced by the orthogonality structure of N. We will not prove this theorem, but refer to [32, 33, 34, 35, 36, 37, 38, 39, 40,41,42,43,44,45,46,47,48,49] where pieces preparing the proof can be found. We refer to [32, 33] for a recent and more complete overview and the inclusion of the new axiom of plane transitivity. 9. PAR ADO XES AND F A I LIN G A X 10M S The aim of quantum axiomatics was to construct an operational foundation for standard quantum mechanics starting from basic concepts, states and properties, that are easy to identify physically. Once a full axiomatics has been constructed this gives of course a powerful tool to investigate the well known paradoxes that quantum mechanics entails. Let us investigate some aspects of this possibility. 24 Two elements are orthogonal iff they imply respectively a property and its orthocomplement. STRUCTURES AND PARADOXES 191 9.1. The Description of an Entity Consisting of Two Entities We will consider the description of two spins by using the quantum machine model for these spins. So we consider now two quantum machines. Let us call them Sl and S2, and the entity that just consists of these two quantum machines. In a general way, the entity 81 is described by a state property space (~l' £1, 6, fi:1) and the entity 82 is described by a state property space (~2, £2, 6, fi:2). Let us denote the sphere of the first quantum machine Sl by balh, the points of this sphere by Vl E balh, and the states, the experiments and the properties connected to this quantum machine by PVl' eUl and aUl . In an analogous way we denote the sphere of the second quantum machine S2 by ball2, and its states, experiments and properties by PV 2' eU2 and aU2 • As we have shown in (66) and (81) the sets of properties, the sets of states and the sets of experiments are given by: {a Ul {PVl I Ul E balh, lUll I Vi E balh} = 1} {euIIU1Ebalh,lull=1} {a U2 i U2 E ball2, iU2i = 1} {PV2 i V2 E ball2} {e U2 i U2 E ball2, IU2i = 1} (92) Let us call S the compound physical entity, consisting of the two quantum machines, and (~, L,~ fi:) the state property space describing this entity S. To see in which way the three state property spaces are connected we have to analyse the physical situation. The states Clearly a state P of the entity S completely determines a state Pi of Sl and a state P2 of S2-using the physical principle that when the entity S 'is' in state P then the entities Sl and S2 'are' in two corresponding states Pi and P2 25 . This defines two functions: ml:~-* m2 : ~ 25 p~ml() ----* ~2 P~ m2 (p) (93) If we take the ontologie meaning of the concept of state seriously, we can hardly ignore this physical principle. Although, as we will see, standard quantum mechanics gives rise to problems here. 192 DIEDERIK AERTS The properties Each experiment e1 on 51 is also an experiment on 5 and each experiment e2 on 52 is also an experiment on 5-following the physical principle that if we perform an experiment on one of the sub-entities we perform it also on the compound entity26. Since the properties are operationally defined by means of the experiments, from the same physical principle, we have that each property of a sub-entity is also a property of the compound entity. This defines again two functions: n1 : £1 n2 : £2 --+ --+ £ £ a1 a2 1---+ 1---+ n1(a1) n2 (a2) (94) A covariance principle If property a1 is actual for entity 8 1 in state m1(p), then property n1(a1) is actual for entity 5 in state p. An analogous covariance principle is satisfied between entity 52 and entity 5. This means that we have the following equations: a1 E 6(m1(p)) {:} n1(a1) E ~(p) a2 E 6(m2(p)) {:} n2(a1) E (p)~ (95) It has been shown in [50], [30], [31] and [33], that for the case of physical entities satisfying axiom 1 (hence the three state property spaces are state property systems) this covariance principle gives rise to a unique minimal structure for the state property system of the compound entity. It is the (co )product in the category of state property systems. Theorem 17. Suppose that we have two entities 51 and 52 with state property systems (L:1' £1, 6, /1;1) and (L:2' £2, 6, /1;2) that form a compound entity 5 with a state property system (L:, £,~ /1;) according to (93), (94) and (95). The minimal solution is as follows: L: = L:1 X L:2 (96) where L:1 x L:2 is the cartesian product of L:1 and L: 2. For (PI, P2) E L: we have:- (97) 26 Again, if we take the meaning of what an experiment is seriously, it is hard to ignore this principle. We have even no reason here to doubt it, because standard quantum mechanics agrees with it. STRUCTURES AND PARADOXES For PI, ql E ~1 and P2, q2 E ~2 193 we have: (98) £1 II £2 (99) I alE £ 1, a2 E £2, =f 01, a2 =f 02} U {O} (100) { (aI, a2) al where £1 U £2 is called the co-product of £1 and £2. For al E £1 and a2 E £2 we have: nl(al) nl(Ol) n2(a2) n2(02) = (a1,h) if a1"1 01 =0 = (h, a2) if a2 "1 02 = (101) ° (aI, a2) --< (b l , b2) {::} al --< bi and a2 --< b2 (102) 0--< (aI, a2) (103) Ai(aL a~) = (AiaL Aia~) ai (104) if Ai "1 01 and Ai a~ o if Ai = 01 or Ai a~ ai "1 02 = 02 (105) I a1 E 6 (PI), a2 E 6 (P2 )} I PI E K;(al),P2 E K;(a2)} (106) Further we have: .; (PI, P2) = {(a 1 , a2) K;(al' a2) = ((Pl,P2) (107) 9.2. The Covering Law and Compound Entities This structure of (co ) product is the simplest one that can be constructed for the description of the compound physical entity S. One would expect that it is 'the' structure to be used to describe the compound entity S. This is however not the case for quantum entities in standard quantum mechanics. The reason is that the co-product 'never' satisfies two of the axioms of standard quantum mechanics, namely axiom 3 (covering law) and axiom 4 (orthocomplementation). Let us prove this for the case of the covering law. 194 DIED ERIK AERTS Theorem 18. Suppose that axiom 1 and 2 are satisfied and consider two entities Sl and S2 described by state property systems (E1' £1, 6, ~1) and (E2' £2, 6, ~2) and the minimal compound entity S consisting of Sl and S2 and described by the state property system (E1 xE 2 , £1 U £2'~) as defined in theorem 17. Suppose that axiom 3 is satisfied for the entity S, then one of the two entities Sl or S2 has a trivial property lattice consisting only of the minimal and maximal element. Proof: Suppose that £1 has at least one element a1 different from It and 0 1. Since £1 is atomistic there exists an atom C1 -< aI, and at least one atom d 1 -I< a1. Hence C1 =f. d 1. Consider now two arbitrary atoms C2, d2 E £2. We have: (q, C2) -< (C1 V d 1, C2) -< (C1 V d 1, C2 V d2) (Cl V d 1 , C2 V d 2) = (q, C2) V (d 1, d 2) (108) (109) Since (d 1, d2) is an atom of £1 U £2, the property (q, c2)V(d l , d2) 'covers' (Cl' C2), because the covering law is satisfied for S. We therefore have: (q, C2) = (Cl V d1, C2) or (C1 V d 1, C2) = (q V d 1, C2 V d2) (110) This implies that (111) Since C1 =I d 1 we cannot have q = C1 V d 1. Hence we have C2 = d 2. Since C2 and d2 were arbitrary atoms of £2, this proves that £2 contains only one atom. From this follows that £2 = {02, h}. This theorem proves that for two non-trivial entities Sl and S2 the property lattice that normally should represent the compound entity S never satisfies the covering law. This same theorem also proves that, since we know that for an entity described by standard quantum mechanics the covering law is satisfied for its property lattice, in quantum mechanics the compound entity S is 'not' described by the minimal product structure. The covering law, as we remarked earlier already, is the axiom that introduces the linear structure for the state space. This means that for a property lattice that does not satisfy the covering law it will be impossible to find a vector space representation such that the superposition principle of standard quantum mechanics is available. It can be shown that this fact is at the origin of the Einstein Podolsky Rosen paradox as it is encountered in quantum mechanics (see [19] and [20]). It means indeed that the compound entity S, as it is described in standard quantum STRUCTURES AND PARADOXES 195 mechanics, will have additional elements in its state property structure, that are not contained in the minimal product structure that we have proposed here. As we will see, these additional elements are the so called 'non-product states'. 9.3. The Quantum Description of the Compound Entity For standard quantum mechanics the compound entity S consisting of two entities SI and S2 is described by means of the tensor product HI ® H2 of the Hilbert spaces HI and H2 that describe the two subentities 51 and 52. We have studied this situation in detail in earlier work [51]' and will here only expose the scheme. Let us consider an entity 5 described with a state property space (~, L, ~, "') corresponding to the Hilbert space H consisting of two entities 51 and 52 described by state property spaces (~1' L1, 6, "'1) and (~2' L2, 6, "'2) corresponding to Hilbert space HI and H2. Let us first identify the functions m and n that describe the situation where 5 is the joint entity of 51 and 52. For the function n this identification is straightforward. We have: n1 : L1 n2 : L2 --+ --+ L L ap1 ap2 f-> f-> ap1 rg, I2 = n1(apl) ahrg,P2 = n2(ap2) (112) This shows that for standard quantum mechanics, as in the case where we would describe the compound entity by means of the co-product, for each property al of 51 (a2 of 52) there is a unique property n1(a1) (n2(a2)) of 5. The requirement that with each state p of the compound entity 5 correspond unique states PI and P2 of the sub-entities gives rise to a special situation in the case that P corresponds to a non-product vector of the Hilbert space HI ® H2, i.e., a vector c = I:i ci ® c~ that cannot be reduced to a product of a vector in HI and a vector in H2. It can be shown that, taking into account the covariance requirement-this time also for the probabilities-there do correspond two unique states PWI and P W2 to such a state Pc, but when c is a non-product vector, WI and W 2 are density operators that do not correspond to vectors in HI and H2. This means (see theorem 12) that pW1 and PW 2 are non atomic states although Pc is an atomic state. We have not stated this too explicit till now, but in standard quantum mechanics there is a real physical difference between the atomic states-represented by density operators corresponding to a vector-and the non atomic states-represented by density operators not corresponding to a vector. The atomic states are 'pure states' and the non atomic states are 'mixed states'. This is in fact also the case in our operational definition of the join states in (see 196 DIED ERIK AER TS definition 9). The join state of a set of states, as defined there is a mixture of these states, which means that the entity is in the join state of this set iff it is in one of the states of this set, but we lack the knowledge about which one. So we repeat: a mixed state of a set of pure states describes our lack of knowledge about the pure state where the entity is in. If this is the meaning of a non atomic state, hence a mixed state over some set of atomic states, as it is the case in standard quantum mechanics, we can conclude that the entity is always in an atomic state. The non atomic states only describe our lack of knowledge about the atomic state the entity exactly is in. We can now see where the fundamental problem arises with the tensor product coupling procedure of quantum mechanics. If entities are always in atomic states, and since for an atomic state of the compound entity that corresponds to a non-product vector of the tensor product Hilbert space, the component states are strictly non atomic, it would indicate that the sub-entities are not in a state. This is of course very strange. Indeed, it seems even contradictory with the concept of state itself. An entity must always be in a state (and hence a quantum entity always in an atomic state), whether it is a sub-entity of another entity or not. As we have mentioned already, these non-product states also give rise to EPR type correlations between the two sub-entities 8 1 and 8 2. We remark that the presence of these correlations also indicates that the quantum description of the compound entity is not a description of 'separated' entities. So something really profoundly mysterious occurs here. We also mention that it is excluded that the non-product states of the quantum compound entity would be mathematical artifacts of the theory, since entities are without many problems prepared in these non-product states in the laboratory these days27. So the non-product states exists and are real states of the compound entity consisting of two quantum entities. Should we then decide that the sub-entities have disappeared as entities, and only some properties are left? We want to reflect more about this question and investigate what the possibilities are. Most of all we want to put forward an alternative possibility, that is however speculative, but should be worth further investigation. 27 The question about the reality of the non-product states was settled during the second half of the seventies and the first half of the eighties by means of the well known Einstein Podolsky Rosen correlation experiments. Meanwhile it has become common laboratory practice to prepare 'entangled' entities-that is what they are referred to now in the literature-in non-product states. STRUCTURES AND PARADOXES 197 9.4. About Mixtures, Pure States, Non Atomic Pure States We have to remark that the problem that we explained in the foregoing section was known from the early days of quantum mechanics but concealed more or less by the confusion that often exists between pure states and mixtures. Let us explain this first. The reality of a quantum entity in standard quantum mechanics is represented by a pure state, namely a ray of the corresponding Hilbert space. Mixed states are represented in standard quantum mechanics by density operators (positive self adjoint operators with trace equal to 1). But although a mixed state is also called a state, it does not represent the reality of the entity under consideration, but a lack of knowledge about this reality. This means that if the entity is in a mixed state, it is actually in a pure state, and the mixed state just describes the lack of knowledge that we have about the pure state it is in. We have remarked that the deep conceptual problem that we indicate here was noticed already in the early days of quantum mechanics, but disguised by the existence of the two types of states, pure states and mixed states. Indeed in most books on quantum mechanics it is mentioned that for the description of sub-entities by means of the tensor product procedure it is so that the compound entity can be in a pure state (and a non-product state is meant here) such that the sub-entities will be in mixed states and not in pure states (see for example [52]11-8 and [53] p 306). The fact that the sub-entities, although they are not in a pure state, are at least in a mixed state, seems at first sight to be some kind of a solution to the conceptual problem that we indicated in the foregoing section. Although a little further reflection shows that it is not: indeed, if a sub-entity is in a mixed state, it should anyhow be in a pure state, and this mixed state should just describe our lack of knowledge about this pure state. So the problem is not solved at all. Probably because quantum mechanics is anyhow entailed with a lot of paradoxes and mysteries, the problem was just added to the list by the majority of physicists. Way back, in a paper published in 1984, we have already shown that in a more general approach we can define pure states for the sub-entities, but they will not be "atoms' of the lattice of properties [50]. As we have shown already the ray states of quantum mechanics give rise to atoms of the property lattice, such that "pure states' in quantum mechanics correspond to "atomic' states of the state property space. This means that the non atomic pure states that we have identified in [50] can anyhow not be represented within the standard quantum mechanical formalism. We must admit that the finding of the existence of non atomic pure states in the 1984 paper, even from the point of view of generalized quantum 198 DIEDERIK AERTS formalisms, seemed also to us very far reaching and difficult to interpret physically. Indeed intuitively it seems that only atomic states should represent pure states. We know now that this is a wrong intuition. But to explain why we first have to present the other pieces of the puzzle. A second piece of the puzzle appeared when in 1990 we built a model of a mechanistic classical laboratory situation violating Bell inequalities with y2, exactly 'in the same way' as its violations by the EPR experiments [54J. With this model we tried to show that Bell inequalities can be violated in the macroscopic world with the same numerical value as the quantum violation. What is interesting for the problem of the description of sub-entities is that new 'pure' states were introduced in this model. We will see in a moment that the possibility of existence of these new states lead to a possible solution of the problem of the description of sub-entities within a Hilbert space setting, but different from standard quantum mechanics. More pieces of the puzzle appeared steadily during the elaboration of the general formalism presented in [6J. We started to work on this formalism during the first half of the eighties, reformulating and elaborating some of the concepts during these years. Then it became clear that the new states introduced in [54], although 'pure' states in the model, appear as non atomic states in the general formalism. This made us understand that the first intuition that classified non atomic states as bad candidates for pure states was a wrong intuition. Let us present now the total scheme of our possible solution. In the example that we proposed in [54J we used two spin models as the one presented here (the quantum machine) and introduced new states on both models with the aim of presenting a situation that violates the Bell inequalities exactly as in the case of the singlet spin state of two particles do. We indeed introduced a state for both spin coupled spin~ models that corresponds to the point in the center of each sphere, and connecting these two states by a rigid rod we could generate a violation of Bell's inequalities. We have shown in the last part of section 7.4 that the centre of ball is a non atomic state of the quantum machine. This means that we have 'identified' a possible 'non-mixture' state (meaning with 'non-mixture' that it really represents the reality of the entity and not a lack of knowledge about this reality) that is not an atom of the preordered set of states. Is this a candidate for the 'non-mixed' states that we identified in [50J and that were non-atoms? It is indeed, as we prove explicitly in [6J. The states Pw, where wE ball and Iwl < 1, as defined in 7.4, and that are certainly pure states for the quantum machine entity, are represented by the density operator W (w) if we use formula 72 for the calculation of the transition probabilities. STRUCTURES AND PARADOXES 199 9.5. Completing Quantum Mechanics? The idea that we want to put forward is the following: perhaps density operators just do represent pure states, also for a quantum entity. Such that the set of pure states would be represented by the set of density operators and not by the set of rays of the Hilbert space. If this would be the case, the conceptual problem of using the tensor product would partly be solved. We admit immediately that it is a very speculative idea that we put forward here. The problem is also that it will be difficult to test it experimentally on one physical entity. Indeed, the so called new pure states, corresponding to density operators of the Hilbert space, cannot experimentally be distinguished from mixtures of old pure states, corresponding to vectors, and represented also by these density operators. It is an easy mathematical result since in all probability calculations only the density operator appears. We can also see it explicitly on the quantum machine. Let us go back to the calculation that we made in the second part of section 7.4. The 'pure' state Pw corresponding to an interior point w = Al . V + A2 . (-v) of ball is represented by the density operator W( ) w - ( \ /\1 ( /\1 \ \ sm"2 . 20 cos 2 "20 + /\2 \ ) . 0 0 iq, - /\2 sm"2 cos "2 e (\ . 0 sm"2 cos "20 e-iq, ) \ 2 0 /\1 sm "2 + /\2 COS "2 /\1 - \. \ /\2 ) 2 0 But this density operator represents also the 'mixed' state describing the following situation of lack of knowledge: the point is in one of the pure states Pv or P-v with weight Al and A2 respectively. Although this mixed state and the pure state Pw are ontologically very different states, they cannot experimentally be distinguished by means of the elastic experiments. The reason that they cannot is connected to the linearity of the whole scheme. This means that if we would be able to realise a nonlinear evolution of a quantum entity, the distinction between the density operator representing a pure non atomic state (as the interior point of the sphere in the case of the quantum machine) and the density operator representing a mixture (as the weighted average over two diametrically oposed surface points in the case of the quantum machine) could be detected experimentally. Hence, if one quantum entity of a pair of two entangled quantum entities could be experimentally subjected to a nonlinear evolution, it could be tested experimentally whether this quantum entity 'is' in a pure ontologie state, described by the density operator. Indeed, the non-linear evolution will have an effect on the atomic states (the points of the surface of the sphere in the quantum machine example) and an effect on the non-atomic pure states (the interior points of the sphere in the quantum machine example), and both effects will no cancel out in general, like it is the case for a linear evolution, such that it can be 200 DIEDERIK AERTS made out whether the entangled quantum entity - being entangled - is in an ontologically pure state. We invite the experimentalists to engage in this challenge. 10. FRO M E U C LID TOR I E MAN N: THE QUA N TUM MECHANICAL EQUIVALENT We have called section 9 'Paradoxes and failing axioms'. Indeed a possible conclusion for the result that we have exposed in this section is that the covering law (and some of the other axioms) are just no good and that we should look for a more general formalism than standard quantum mechanics. We are more and more convinced that this must be the case, also because it is again the covering law that makes it impossible to describe a continuous change from quantum to classical passing through the intermediate situations that we have mentioned in section 6.4 (see [5], [25] and [27] and the paper of Thomas Durt in this book). In this sense we want to come back now to the suggestive idea that was proposed in the introduction. The idea of relativity theory, to take the points of space-time as representing the events of reality, goes back to a long tradition. It was Euclid who for the first time synthesised the descriptions that the Greek knew about the properties of space by using as basic concepts points, lines and planes. Let us remember that Euclid constructed an axiomatic system consisting of five axioms-now called Euclid's axioms-for Euclidean space and its geometric structure. All classical physical theories have later, without hesitation, taken the Euclidean space as theatre for reality. From a purely axiomatic point of view there has been a long and historic discussion about the independence of the fifth Euclidean axiom. Some scientists have pretended to be able to derive it from the four other axioms. The problem was resolved in favour of the independence by Gauss, Bolyai and Lobachevski by constructing explicit models of non-euclidean geometries. Felix Klein proposed a classification along three fundamental types: an elliptic geometry, the one originally proposed by Gauss, the geometry of the surface of a sphere for example, a hyperbolic geometry, proposed by Lobachevski, the geometry of the surface of a saddle for example, and a parabolic geometry, that lies in between both. It was Riemann who proposed a complete theory of non-euclidean geometries, the geometry of curved space, that was later used by Einstein to formulate general relativity theory. It is interesting to remark that the programme of general relativity-to introduce the force fields of physics as properties of the metric of space-was already put forward explicitly by Riemann. He could however not have found the solution of general relativity because he was looking for a solution in STRUCTURES AND PARADOXES 201 three dimensional space, and general relativity has to be constructed in four dimensional space-time. In the formulation of Einstein, which is the one of Riemann applied to the four dimensional space-time continuum, the events are represented by the points and the metric tensor describes the nature of the geometry. Let us now see whether we can find an analogy with our analysis of quantum mechanics by means of our axiomatic approach. Here the basic concepts are states, properties and probability. In geometry a set of points forms a space. In quantum mechanics the set of states and properties form a state property space. Just like the Euclidean space is not just any space, the state property space of quantum mechanics is not just any state property space. We have outlined 8 axioms that make an arbitrary state property space into a quantum mechanical state property space. We have remarked already that the purpose of quantum axiomatics was not to change standard quantum mechanics, exactly as the purpose of Euclid was certainly not to formulate an alternative geometry: he wanted to construct an operational theory about the structure of space. The operational axiomatization of quantum theory has taken a long time, the axiom of 'plane transitivity' relies on a result of Maria Pia Soler of 1995 [48]. Since however the general problems related to axiomatization are nowadays better known, there has been little discussion about the independence of the axioms. But the physical meaning of certain axioms (e.g., the failing covering law) remained obscure. These were merely axioms introduced to recover the complete Hilbert space structure which includes the linearity of the state space. Within this development of the axiomatic structure of quantum theory it has been shown now that the compound entity of two separated quantum entities cannot be described. The axiom that makes such a description impossible is the covering law, equivalent to the linearity of the state space. Do we have to work out a general quantum theory, not necessarily satisfying the covering law, as Riemann has given us a general theory of space? We believe so, but we know that a lot of work has to be done. In [6] a humble general scheme is put forward that may be a start for the elaboration of such a general quantum theory. ACKNOWLEDGEMENTS I want to thank Jan Broekaert and Bart Van Steirteghem to read and discuss with me parts of the text of this article. Their remarks and suggestions have been of great value. 202 DIEDERIK AERTS AFFILIATION Diederik Aerts, CLEA-FUND, Brussels Free University, Brussels, Belgium 11. REFERENCES [1] von Neumann, J., Mathematische Grundlagen der Quanten-Mechanik, Springer-Verlag, Berlin, 1932. [2] Birkhoff, G. and von Neumann, J., "The logic of quantum mechanics", Annals of Mathematics, 37, 1936, p. 823. [3] Foulis, D., "A Half-Century of Quantum Logic-What have we learned?" in Quantum Structures and the Nature of Reality, the Indigo Book of Einstein meets Magritte, eds.,Aerts, D. and Pykacz, J., Kluwer Academic, Dordrecht, 1998. [4] Jammer, M., The Philosophy of quantum mechanics, Wiley and Sons, New York, Sydney, Toronto, 1974. [5] Aerts, D. and Durt, T., "Quantum. Classical and Intermediate, an illustrative example", Found. Phys. 24, 1994, p. 1353. [6] Aerts, D., "Foundations of Physics: a general realistic and operational approach", to be published in International Journal of Theoretical Physics. [7] Aerts, D., "A possible explanation for the probabilities of quantum mechanics and example of a macroscopic system that violates Bell inequalities", in Recent developments in quantum logic, (eds.), Mittelstaedt, P. and Stachow, E.W., Grundlagen der Exakten Naturwissensehaften, band 6, Wissenschaftverlag, Bibliografisches Institut, Mannheim, 1985. [8] Aerts, D., "A Possible Explanation for the Probabilities of quantum mechanics", J. Math. Phys., 27, 1986, p. 202. [9] Aerts, D., "Quantum structures: an attempt to explain their appearance in nature", Int. J. Theor. Phys., 34, 1995, p. 1165. [10] Gerlach, F. and Stern, 0., "Der experimentelle Nachweis der Richtungsquantelung im Magnetfeld", Zeitschrift fUr Physik 9, 1922, p.349. [11] Bell, J.S., Rev. Mod. Phys., 38, 1966, p. 447. [12] Jauch, J.M. and Piron, C., Helv. Phys. Acta, 36, 1963, p. 827. [13] Gleason, A.M., J. Math. Meeh., 6, 1957, p. 885. STRUCTURES AND PARADOXES 203 [14] Kochen, S. and Specker, E.P., J. Math. Mech., 17, 1967, p. 59. [15] Gudder, S.P., Rev. Mod. Phys., 40, 1968, p. 229. [16] Accardi, L., Rend. Sem. Mat. Univ. Politech. Torino, 1982, p. 24l. [17] Accardi, L. and Fedullo, A., Lett. Nuovo Cimento, 34, 1982, p. 161. . [18] Aerts, D., "Example of a macroscopical situation that violates Bell inequalities", Lett. Nuovo Cimento, 34, 1982, p. 107. [19] Aerts, D., "The physical origin of the EPR paradox", in Open questions in quantum physics, (eds.), Tarozzi, G. and van der Merwe, A., Reidel, Dordrecht, 1985. [20] Aerts, D., "The physical origin of the Einstein-Podolsky-Rosen paradox and how to violate the Bell inequalities by macroscopic systerns", in Proceedings of the Symposium on the Foundations of Modern Physics, (eds.), Lahti, P. and Mittelstaedt, P., World Scientific, Singapore, 1985. [21] Aerts, D., "Quantum Structures, Separated Physical Entities and Probability", Found. Phys. 24, 1994, p. 1227. [22] Coecke, B., "Hidden Measurement Representation for Quantum Entities Described by Finite Dimensional Complex Hilbert Spaces" , Found. Phys., 25, 1995, p. 203. [23] Coecke, B., "Generalization of the Proof on the Existence of Hidden Measurements to Experiments with an Infinite Set of Outcomes", Found. Phys. Lett., 8, 1995, p. 437. [24] Coecke, B., "New Examples of Hidden Measurement Systems and Outline of a General Scheme", Tatra Mountains Mathematical Publications, 10, 1996, p. 203. [25] Aerts, D. and Durt, T., "Quantum, classical and intermediate: a measurement model" , in Montonen C. (ed.), Editions Frontieres, Gives Sur Yvettes, France, 1994. [26] Aerts, D., Durt, T. and Van Bogaert, B., "A physical example of quantum fuzzy sets, and the classical limit" , in the proceedings of the International Conference on Fuzzy Sets, Liptovsky, Tatra mountains, 1993, p. 5. [27] Aerts, D., Durt, T. and Van Bogaert, B., "Quantum Probability, the Classical Limit and Non-Locality", in the proceedings of the International Symposium on the Foundations of Modern Physics 1992, Helsinki, Finland, ed. T. Hyvonen, World Scientific, Singapore, 1993, p.35. 204 DIEDERIK AERTS [28] Randall, C. and Foulis, D., "Properties and operational propositions in quantum mechanics", Found. Phys., 13, 1983, p. 835. [29] Foulis, D., Piron, C. and Randall, C., "Realism, operationalism, and quantum mechanics", Found. Phys., 13, 1983, p. 813. [30] Aerts, D., Colebunders, E., Van der Voorde, A. and Van Steirteghem, B., "State property systems and closure spaces: a study of categorical equivalence", Int. 1. Theor. Phys., to appear 1998. [31] Aerts, D., Colebunders, E., Van der Voorde, A. and Van Steirteghem, B., "Categorical study of the state property systems and closure spaces", preprint, FUND - TOPO, Brussels Free University. [32] Aerts, D. and Van Steirteghem, B., "Quantum Axiomatics and a Theorem of M.P. Soler", preprint, FUND, Brussels Free University. [33] Van Steirteghem, B., "Quantum Axiomatics: Investigation of the structure of the category of physical entities and Soler's theorem", graduation thesis, FUND, Brussels Free University. [34] Piron, C., "Axiomatique Quantique" , Helv. Phys. Acta, 37, 1964, p.439. [35] Amemiya, I. and Araki, H., "A remark of Piron's paper", Publ. Res. Inst. Math. Sci., A2, 1966, p. 423. [36] Zierler, N., "Axioms for non-relativistic quantum mechanics", Pac. J. Math., 11, 1961, p. 1151. [37] Varadarajan, V., Geometry of Quantum Theory, Van Nostrand, Princeton, New Jersey, 1968. [38] Piron, C., Foundations of Quantum Physics, Benjamin, Reading, Massachusetts, 1976. [39] Wilbur, W., "On characterizing the standard quantum logics", . Trans. Am. Math. Soc., 233, 1977, p. 265. [40] Keller, H.A., "Ein nicht-klassicher Hilbertsher Raum", Math. Z., 172, 1980, p. 41. [40] Aerts, D., "The one and the many", Doctoral Thesis, Brussels Free University, Brussels, 1981. [41] Aerts, D., "Description of many physical entities without the paradoxes encountered in quantum mechanics", Found. Phys., 12, 1982, p. 1131. [42] Aerts, D., "Classical theories and Non Classical Theories as a Special Case of a More General Theory", J. Math. Phys., 24, 1983, p. 2441. [43] Valckenborgh, F., "Closure Structures and the Theorem of Decomposition in Classical Components", Tatra Mountains Mathematical Publications, 10, 1997, p. 75. STRUCTURES AND PARADOXES 205 [44] Aerts, D., "The description of one and many physical systems", in Foundations of quantum mechanics, eds. C. Gruber, A.V.C.P., Lausanne, 1983, p. 63. [45] Aerts, D., "Construction of a structure which makes it possible to describe the joint system of a classical and a quantum system", Rep. Math. Phys., 20, 1984, p. 421. [46] Piron, C., Mecanique Quantique: Bases et applications, Presse Polytechnique et Universitaire Romandes, Lausanne, 1990. [47] Pulmannova, S., "Axiomatization of Quantum Logics", Int. J. Theor. Phys., 35, 1995, p. 2309. [48] Soler, M.P., "Characterization of Hilbert spaces with Orthomodular spaces", Comm. Algebra, 23, 1995, p. 219. [49] Holland Jr, S.S., "Orthomodularity in Infinite Dimensions: a theorem of M. Soler", Bull. Aner. Math. Soc., 32, 1995, p. 205. [50J Aerts, D., "Construction of the tensor product for lattices of properties of physical entities", J. Math. Phys., 25, 1984, p. 1434. [51J Aerts, D. and Daubechies, I., "Physical justification for using the tensor product to describe two quantum systems as one joint system", Helv. Phys. Acta 51, 1978, p. 661. [52J Jauch, J., Foundations of quantum mechanics, Addison-Wesley, Reading, Mass, 1968. [53J Cohen-Tannoudji, C., Diu, B. and Laloe, F., Mecanique Quantique, Tome I, Hermann, Paris, 1973. [54J Aerts, D., "A mechanistic classical laboratory situation violating the Bell inequalities with yI2, exactly 'in the same way' as its violations by the EPR experiments", Helv. Phys. Acta, 64, 1991, p. 1. THOMAS DURT ORTHOGONALITY RELATIONS: FROM CLASSICAL TO QUANTUM INTRODUCTION We developed in Brussels a heuristic model, the E-model, in which it is possible to simulate a continuous transition between a classical (deterministic) and a quantum (probabilistic) regime. The lattice of properties related to this model was intensively studied [3, 5 to 10] and it appears that it exhibits a continuous transition between the classical and the quantum lattices. These lattices are representative of classical situations at one side and of quantum situations at the other side. It is not presently known if a continuous transition between the classical, deterministic, and the quantum, fuzzy, regimes does occur in nature. Even if this would occur, no one exactly knows the border-line between the two regions. This is an aspect of the so-called problem of measurement which deals with the comprehension of the way in which the macroscopic, deterministic and sharp world to which observers and measuring apparata belong coexists with the microscopic, unsharp, quantum world. In our model, we assume that "hidden variables" are present at the level of the apparatus, which is a non-standard hypothesis. The reader may thus consider this paper as a good example of how speculative ideas can be implemented in the framework of quantum logics. In particular, we shall study two ways of defining an orthogonality relation inside the set of states of the system described in the E-model. In lattice theory, the orthogonality relation is important at least for two reasons: - It is possible to associate to an orthogonality relation what is called an orthocomplementation, as we shall show it. The existence of an orthocomplementation is postulated in many attempts made in order to generalise the classical (Boolean) paradigm. For instance it is an unavoidable element of the representation theorem of Piron [18 to 20]. - The orthocomplementation can be interpreted as a generalised logical negation. We study here three orthogonality relations in the framework of the Emodel, show that they evolve continuously during the classical-quantum 207 © 1999 Kluwer Academic Publishers. 208 THOMAS DURT transition, and discuss whether the lattice of properties is orthocomplemented by them. 1. THE E-MODEL From now on, we shall consider that the state of the system is represented by a point on the surface of the unit 3-dimensional sphere (the so-called Poincare-sphere). We invite the interested reader to consult the appendix where we present some details about the connections which exist between a spin 1/2, which is described in a 2-dimensional Hilbert space, and its representation on the surface of the Poincare-sphere. For information, the properties of a 2-dimensional Hilbert space as well as the lattice of properties associated to it are discussed in great detail in the paper of D. Aerts (same volume). All what is necessary for our present purposes is that this direction on the sphere somehow represents the direction along which the system that we consider here, which could be a particle, "turns around itself" . For instance, the spin of the Earth, if our system was the Earth, would be represented by the North Pole. If the Earth was spinning in the opposite direction (with the Sun rising in the West), its spin would be represented by the South Pole. 1.1. Guiding Principles of the f.-Model The f.-model aims at simulating the interaction which occurs during a measurement process. It is based on the following guiding principles: - The measuring apparatus is essentially classical and interacts with the observed system in a deterministic way. - The state of this apparatus undergoes statistical fluctuations, responsible for a dispersion in the results of measurement [2J. - The amplitude of these fluctuations is quantified by a real parameter f. comprised between 0 and 1. - When f. equals 0, the fluctuations vanish, when f. equals 1, they are maximal. - The classical limit is reached when we can neglect these fluctuations (f. equals 0); intuitively, this means that the scale of the disturbance caused by the measuring apparatus is smaller than the scale of the observed system. - When the scale of the disturbance of the measuring apparatus is larger than the scale of the observed system, f. is not negligible anymore, the dispersion of the experimental results increases, and when f. equals 1, we recover a distribution of results equal to the quantum one. ORTHOGONALITY RELATIONS 209 1.2. Simple Version of the f.-Model Let us briefly describe how the model works (this is an ultrasimplified version, see [3, 6, 16] for more sophisticated presentations). We advice the readers who are not familiar with this kind of models to consult the article of D. Aerts (same volume), where the measurement process is illustrated by the "metaphor of the elastic" that we summarize now. The measurement process is assumed to occur in two steps: the particle "falls" on an elastic rod that joins the poles p and -p and sticks there for a while; thereafter, the elastic, which is assumed to be fragile in a certain zone 1, breaks at random in this zone, and the part of the elastic on which the particle is located projects the particle on the pole to which it is attached. A picture and detailed explanations are given in the article of D. Aerts. Let us now give, in an abstract style, a presentation which contains the essential features of the model. - The f.-model describes a generalised spin-measurement. - The state of the generalised spinning particle is represented by a point q of the Poincare-sphere (a direction). The set of states of the system is thus the surface of the Poincare sphere. - The generalised Stern-Gerlach device is represented by another point p of the Poincare-sphere and a hidden variable cpo - We assume that the hidden variable cP is submitted to uncontrollable fluctuations, so that it is a random variable homogeneously spread over the real interval [CPmin, CPmaxl (with 0 ~ CPmin ~ CPmax ~ 1). - The measurement process forces the state to collapse towards two possible directions: the direction of the device p, or its opposite -po We shall then say that the spin of the system is "up" or "down" along p. - Let us denote () the angle between p and q. We suppose now that the result of the measurement is determined as follows: - If cP appears to be strictly less than cos2~, the result of the measurement is "spin-up" and the state q collapses on p. - If cP appears to be greater than or equal to cos 2 ~, the result of the measurement is "spin-down" and the state q collapses on -po We obtain after averaging on the variable cP, for each value of the pair (CPmin, CPmax) , a transition probability which depends on the angle between the initial and the final states (before and after the measurement process). The probability of getting spin-up P(plq) can be shown to depend on the angle () between p and q as follows: 1 In an interval of width 2</>max - 2</>min centered around a point located at a distance </>min + </>max from the south pole in our case. 210 THOMAS DURT - It is equal to 1 when 0 ::; B < Bup , 0 when 7r 2 B 2 7r - Bdown' where cos2~ = cPmax and sin2~ = cPmin. The subscripts up and down are justified by the fact that these angles are the extents of the classical zones (probability 0 or 1) around the poles in which the spin measurements are predetermined with a probability equal to hundred percent. - In between it is superposition of the two possible results, in a zone of angular opening Bsup. The subscript sup is justified by the fact that Bsup is the extent of the superposition zone (probability neither 0, nor 1) between the classical zones. We have, in the superposition zone (see pictures at the end of the paper), that: P(plq ) = cosB + COSBdown + COSBdown· cosBup (1) The three angles Bsup , Bup , Bdown fulfill the following relation: ()sup + + Bdown = 7r. We define the classical (deterministic) and quantum limits of the model by imposing that Bsup is respectively equal to zero and 7r. Bup p -p Figure 1. The up and down zones around p. The classical case corresponds to a purely deterministic situation (the probability of getting spin up takes only values 0 or 1), while in the quantum case we recover the quantum probability associated to the measurement of the spinning direction of a spin 1/2 particle 2. Historically 2 The idea to associate hidden variables to the apparatus in the quantum context was originally presented in [2]. ORTHOGONALITY RELATIONS 211 [3], the parameter E which is at the origin of the name of the model (see section 1) was introduced firstly. The relation with the angles introduced here is given by the following relation: sin( (}s2 P ) = E. 2. THE IMPORT ANCE OF ABSOL UTE TRUTH An important aspect of the lattice theory (at least in the Geneva formulation [18 to 20, 1]) is that in order to deduce from experimental data the "propositions", which are the objects studied in the Geneva approach to quantum logics, all the relevant information is contained in the socalled eigenstate sets, the sets of states for which the results of a given measurement can be deterministically foreseen (with probability one or zero). For instance, in the E-model, the eigenstate sets of the property "the spin is up (down) when we measure it along the direction p" are spherical sectors of opening ()up (()down) along p (-p). We will not explain here the procedure which allows us to build the lattice of properties once we know the eigenstate sets of all the experiments that can be performed on the system, but it is useful to draw the attention of the reader on this fact: when the lattice is built, on the basis of experiments, all the states which are not eigenstates of one of the outcomes are simply discarded and do not contribute in any manner to the properties of the structure. This remark is trivial in classical logics, because in classical physics the results of arbitrary measurements are assumed to be "sharply" predetermined, so to say, determined with absolute certainty (probability one or zero) whatever the state of the system is. This is no longer true in quantum mechanics where new kinds of states appear, the so-called superposition states, for which all what we can predict is the probability (neither zero nor one) assigned to the different possible outcomes of the measurement. This major difference between the classical and quantum worlds is reflected in the E-model by the existence of a superposition zone between the eigenstate sets, which are spherical sectors of angular opening ()up, (()down) around the direction of the generalised spin measurement (its antipode). Quantum logics, in the Geneva approach, is thus an attempt made in order to generalise classical logics by considering only the deterministic part of probabilistic theories (only the states for which propositions are absolutely true or false, so to say the states which realise an outcome of a given experiment with probability 1 or 0, are considered during the building of the lattice). Note that for experimental physicists, these superposition states are important because the probabilities of occurrence assigned to the different outcomes of which they are superposition can be measured through repeated measurements and contain relevant information about the phys- 212 THOMAS DURT ical properties of the system (dynamics of the system, the influences that it underwent before the measurement and so on). The fact that in quantum logics one can discard all the information contained in intermediate probabilities is related to a property of Hilbert space quantum mechanics that we will sketch here without entering into technicalities [22, 23]: once we know the eigenstate set of a particular outcome of a given measurement, the projection postulate makes it possible to compute all the probabilities assigned to this possible outcome of the measurement, for arbitrary initial states of the system. Considered so, intermediate (strictly between 0 and 1) probabilities are thus second hand properties which can be recovered once we know the states which realise a given outcome with probabilities 0 or 1. This is a peculiarity of quantum mechanics (and also of the E-model): tell me what is certain (actual) and I can quantify the uncertain (potential). This is why the lattice of properties of these systems contains potentially all the information that comes from experiments. Note that these considerations are trivial in the classical regime where the uncertain simply vanishes. We will now show how it is possible to generalise the classical concept of negation on the basis of the knowledge of the eigenstate sets of the experimental outcomes that can be obtained by submitting a system to measurements. This generalised negation will thus be deduced from the knowledge of the states which realise experimental outcomes with probability 0 or 1 only, a survival of the special status given to "absolutely true and certain" propositions in classical logics. 3. THE ORTHOGON ALITY RELATION 3.1. The Negation Let us consider the following example: the system that we study is a car, that we select among a given population of cars. For one or another reason, we study the colour of this car. We classify the colours into a set which contains eight elements: black, white, yellow, orange, red, purple, blue, and green. This would correspond to what we call experimental outcomes in the Geneva approach. The part of the population that has a given colour would correspond to what we call state in the Geneva approach. Normally, it is easy to assign unambiguously one of these colours to a car arbitrarily chosen in the population. In this case, the result of a measurement is deterministic, and the negation of a property is well defined: if a car is not red, then it is black or white or yellow or orange or purple or blue or green, and we made use here of an "exclusive or". We shall from now on refer to this situation as the sharp one. ORTHOGONALITY RELATIONS 213 In reality, things could be more complicated and some subjectivity could appear: if a car is grey, is it white or black?; if it is dark orange, is it (pale) red or orange? The answer to such questions could for instance depend on the person who realises the test, or even on the meteorological conditions (grey or sunny sky) during the test. If we do not control the choice of these parameters, we are in a situation similar to the one described in the €-model, with superposition states (here intermediate colours). We shall refer to this situation as the unsharp one. What are then, in the unsharp situation, the subpopulations (states) that we can classify according to the test formulated by the property: "This car is not red?" In a first approach, one could say that if we submit a given car to the test of colours and that we get a positive answer to the question "the car is red" , even with a very low probability of occurrence, we may not say that this car is not red. Now, what can we do if the car is grey? According to our first approach, such a car is not red, but if the car is grey with a touch of red, so that some confusion is still possible, we may not say that our car is not red. As we see, the situation becomes complex because we have in this approach infinite number of different states (colours). Now, it could happen (this depends on the experimental protocol) that an embarassed tester simply does not answer to the test when he doubts the answer, in which case we would have nine possible colours (the eight forementioned colours plus the "indefinite" colour). In this second approach, a car is not red except when it is either red or of indefinite colour. There would be nine different states in this case, and the "negation of the red state" would be, according to the previous suggestion, true when the state is black or white or yellow or orange or purple or blue or green. These two ways of defining the negation in a fuzzy context suggest that we can define the negation as a relation between states. The example furnished by the classification of cars in function of their colour suggests that, in the sharp situation, this relation would be the relation of difference: if a car is not red, then it is black or white or yellow or orange or purple or blue or green, as we noticed it already, which means in this case that the "negation of the state red" is true in the set of all the states which differ from the red state. Furthermore, the state (colour of a given car) can be checked at once, in a unique experiment, because no probabilistic element is present here. Note that the difference is a symmetric and antirefiexive relation: if p differs from q, then q differs from p and p does never differ from itself. This corresponds to the classical logical properties: "if a proposition A is not true whenever the proposition B is true, then the proposition B is 214 THOMAS DURT not true whenever A is true", and "the proposition cannot possibly be true and not true simultaneously". We will now generalise this relation to unsharp situations and represent the generalised negation thanks to a symmetric and antireflexive relation: the orthogonality relation. The question to know if such generalisations are formal speculations or really mean that the logics itself must be generalised [12] is let to the appreciation of the reader. For reasons that we will clarify later, when this orthogonality relation is the relation of difference (two states are orthogonal iff they are different), we will say that the orthogonality is Boolean. The previous discussion suggests that in the unsharp situation at least two definitions can be adopted for deducing an orthogonality relation from the knowledge of experimental data. Let us firstly study one of them, the Aerts orthogonality. 3.2. The Aerts Orthogonality D. Aerts [1] proposed to deduce an antireflexive and symmetrical function from the experiments in the following way. Definition 1: Two states are A-orthogonal iff they are eigenstates of two different outcomes of the same experiment. This relation is clearly symmetrical, and antireflexive because in general an experiment performed on a given system prepared in a given state cannot have two different outcomes at the same time. 3.3. The E-Orthogonality Let us now present a second definition of the orthogonality relation originally introduced in [16]' that we shall call the E-orthogonality, because it can be shown [16] that this is a definition which is perfectly adapted to the E-model. Definition 2: Two states are E-orthogonal iff there exists at least one experiment that does not induce a transition from one of the states to the other. In other words, the probability of transition from one state to the other is zero for at least one experiment. This definition was inspired by the work of Grib and Zapatrin who proposed the following orthogonality relation [4, 16, 17]: two states p, q are G-Z-orthogonal iff no experiment exists which induces a transition from one of the states to the other. It can be shown that this definition is not convenient in the deterministic limit, so that we shall not discuss it in details (see [16J for more information). The definition of the G-Zorthogonality as well as the definition of the E-orthogonality implicitly ORTHOGONALITY RELATIONS 215 presuppose some kind of collapse ("jump" between different states). The hypothesis of the existence of a collapse is of quantum inspiration and is not always realised, so that the G-Z and €-orthogonality relations are less general than Aerts orthogonality. Nevertheless, in the €-model, we recover the quantum property that after a spin measurement along a direction, the state of the system aligns itself (collapses) along this direction or its antipode. We can thus study the €-orthogonality in this framework. Note that the E-relation is given in a form symmetrical in p, q, because the probability of transition is symmetrical in general (in quantum mechanics, this is an aspect of microreversibility). Furthermore, it is antirefiexive, because in general the probability of transition of a state towards itself is equal to 1. Remark also that, in accordance with the discussion of the second section, the two definitions of the negation presented here require the knowledge of the "non-superposition" states (which realise an outcome with probability 1 or 0) only: the A-orthogonality is based on the eigenstate sets of outcomes (probability 1), the €-orthogonality is based on the impossible transitions (probability 0). This illustrates that quantum logics in the Geneva approach can be considered as an attempt to build a mathematical structure which generalises the classical (Boolean) structure but is still built on the basis of the "actual, absolutely true and certain" propositions which survive in probabilistic theories 3 . 4. THE A AND EORTHOGONALITY RELATIONS AND THE E-MODEL Before discussing the orthogonality, it is useful to recall some properties of the €-model: - The measurement process forces the state to collapse along two possible directions: the direction of the device p ("up"), or its opposite -p ("down"). - The probability of getting spin-up P(plq) depends on the angle () between p and q only and yields 1 when 0 ::; () < ()up, 0 when 7l' ~ () ~ 7l' - ()down' - In between it is superposition of the two possible results, in a zone of angular opening ()sup (with sin(oS2 P ) = E). We can now apply the definition of the A-orthogonality. We obtain the following: 3 This is also true for what concerns the Birkhoff-von Neumann approach [12). 216 THOMAS DURT Theorem 1: Two states p and q are A-orthogonal iff they make an angle superior or equal to ()sup when € # 0, iff they are different when ()sup = € = o. The proof, of technical nature, is given in [16]. As a consequence, the set of states A-orthogonal to a given state is a spherical sector around the antipode of this state, of opening 1[" - ()sup, closed if € # 0, and open in the deterministic case (€ = 0; ()sup = 0). It is then the Boolean (set-theoretical) complement of the state and we recover the classical relation (of difference) in this case. In the quantum case, the fuzziness is maximal and the orthogonal of a state is reduced to a unique point: its antipode. p ......... .......... ............. .............. ................. .................. ..................... ...................... ......................... .......................... ............................. .............................. ................................. ................................ ............................... ............................... ............................... .............................. ............................. ............................ ........................... .......................... ......................... ........................ ....................... ...................... ..................... .................. ................. .............. ·······;.;.;.·····i p A Figure 2. The A-orthogonal of p. If we apply the definition of the €-orthogonality, we obtain the following: Theorem 2: - When € # 0, the €-orthogonals of a state are closed spherical sectors of opening max(()down , ()up) around the antipode of this state. - When € = 0 and ()down ~ ()up, the €-orthogonals of a state are closed spherical sectors of opening ()down around the antipode of the state. - When € = 0 and ()down < ()up, the €-orthogonals of a state are open spherical sectors of opening ()down around the antipode of the state. A first difference appears relatively to the A-orthogonality: the deterministic limit does not necessarily imply that the €-orthogonality relation is ORTHOGONALITY RELATIONS 217 the difference (the Boolean relation). This is true only if (}down = O. Note that the E-orthogonality relation can be tested by a unique experimental configuration: if one places the measuring apparatus along the direction of the state when (}down 2: (}up (of its opposite otherwise), the states which collapse along the opposite with probability one belong to a sector of opening (}down (Bup) , so to say, they are orthogonal to this state. This property, which presents much interest [16] at the level of the lattice of properties is not fulfilled in the case of the A-orthogonality relation. Nevertheless, both definitions coincide in the quantum case. p 1t - max (S down,S up) Figure 3. The E-orthogonal of p. 4.1. The Orthocomplementation In the Geneva approach to quantum logics, the propositions are in general in one to one correspondence with the sets of states for which these propositions are actual (true with probability one). It is thus necessary, if one wants to generalise the classical negation, to define it as a relation between sets of states, rather than a relation between states. This can be realised as follows. Let us assume that we know a relation * between the states (the set of states is denoted ~), such that this relation is symmetric and antireflexive: Vp, q E ~, : p * q => q * p and II p E ~ : p * p. (2) Then, we can associate to any subset K of ~ its orthogonal defined by: K* = {p E ~ : p * q , V q E K} (3) 218 THOMAS DURT It is possible [13] to build, on the basis of the orthogonality relation *, what is called an orthocomplementation, defined as follows. Let us consider a set of subsets of E, denoted X (X c P(E)), partially ordered by set-inclusion in E, then X is said to be orthocomplemented if there exists a bijection', from X to X, such that VA, B EX: i) A c B "* B' c A' ii) (A')' = A iii) A n A' = 0 (4) Thanks to the fact that in our approach, the propositions are in one to one correspondence with subsets of the set of states (the sets of states which "actualise" the proposition), we will say that a lattice is orthocomplemented if an orthocomplementation relation exists between the sets of states which represent the proposition (this representation is called the Cartan map of the lattice). For a Boolean lattice, by definition (section 3.1), two states are orthogonal iff they are different. This implies that the complement of a set is its Boolean complement or set-theoretical complement. In Boolean logics, this complementation is associated to the classical logical negation. Note that it is possible to build a set of subsets of the set of states [13] which is orthocomplemented under the set orthogonality *. In order to build this set, one must consider the map (denoted here clorth in accordance with the conventions made in the refs. [6, 16]) from the set of states E onto itself defined by: clorth(K) = (K*)*. If one defines Forth as the subset of P(E) which is self-closed under clorth: Forth = {F E P(E) : clorth(F) = F}, it is possible to show [13] that Forth is orthocomplemented under the set-orthogonality *. This orthocomplementation generalises the properties of the Boolean complementation and presents a formal analogy with some laws of classical logics. For instance, the three properties (i, ii, iii) which enter the definition of an orthocomplementation generalise the following classical properties: i) if A implies B, the negation of B implies the negation of A ii) the negation of the negation of A is A itself iii) A is never true and false simultaneously It can be shown that the Cartan map of the lattice forms what is called a closure-structure (see [11, 14, 15] for a definition), and that this is also true for Forth. When these structures are identical, then, the lattice will be said to be orthocomplemented. ORTHOGONALITY RELATIONS 219 We also mention without proof the following theorems relative to the orthocomplementation of the lattice of properties of the E-model under the A and E-orthogonality relations [16]. Theorem 3: The lattice of properties deduced from an E-probability distribution characterised by the given of the angles (Oup , Odown) is A-orthocomplemented (so to say, the Cartan map of the lattice of properties is equal to Forth) iff Odown = o. Theorem 4: The lattice of properties deduced from an E-probability distribution characterised by the given of the angles (Oup , Odown) is E-orthocomplemented for arbitrary values of Oup and Odown. This shows that the E complementation is very well adapted to the E-model, which justifies its name. 5. APP ENDIX: PROP ER TIES OF THE S PIN ONE HALVE. Let us recall some properties of the spin 1/2. - It is defined in a two-dimensional Hilbert space and can always be expressed as a normalised superposition of a spin-up and a spindown state along an a priori direction z: 1'!fJ) = al +) + (31 - ), lal 2 + 1(31 2 = 1. (5) - Then, the spin-measurement with a Stern-Gerlach device oriented where along z is described by the observable ~O'z, = (I + )( + I) - -I)· (6) - The Pauli mapping maps bijectively the set of physical states onto the unit 3-sphere, according to the following transformation law: O'z (1-)( 1'!fJ) - t n = (2Re a* (3, 21m a* (3, lal 2 - 1(31 2) (7) (in Cartesian coordinates) and conversely: o _!::e. 2 (8) 2 where 0, cp are the polar angles of n. This allows us to visualize all states on the sphere ; the up-state for instance is sent on the north pole, the down-state on the south pole. a = cos -e ACKNOWLEDGEMENTS This work was realised in the framework of a project of the Flemish Funds of Scientific Research (F.W.O.). Some parts of it were already present in the reference [16]. It was finished during an academical visit in Gdansk, realised in the framework of the bilateral Flemish-Polish 220 THOMAS DURT project 007. Thanks to J. Pykacz for his constructive and destructive comments. AFFILIATION Thomas Dun, FUND, Brussels Free University, Brussels, Belgium REFERENCES [1] Aerts, D.: "The One and the Many", Doctoral Dissertation, Brussels Free University, Brussels, 1981. [2] Aerts, D.: "A possible explanation for the probabilities of quantum mechanics", J. Math. Phys., 27, 1986, p. 203. [3] Aerts, D., Durt, T., and Van Bogaert, B.: "A physical example of quantum fuzzy sets and the classical limit", Tatra Mountains Math. Publ., 1, 1992, p. 5. [4] Aerts, D., Durt, T., Grib, A.A., Van Bogaert, B., and Zapatrin, R.R.: "Quantum structures in macroscopic reality" , Int. J. Theor. Phys., 32, nO 3, 1993, p. 489. [5] Aerts, D., Durt, T., and Van Bogaert, B.: "Indeterminism, nonlocality and the classical limit", in uProceedings of the Symposium on the Foundations of Modern Physics, Helsinki, August 1992, World Scientific Publishing Company, Singapore, 1993, p. 154. [6] Aerts, D. and Durt, T.: "Quantum, classical and intermediate. A measurement model", in uProceedings of the Symposium on the Foundations of Modern Physics", Helsinki, August 1993, World Scientific, Singapore, 1994, p. WI. [7] Aerts, D.and Durt, T.: "Quantum, classical and intermediate. An illustrative example", Found. of Phys., 24, 1994, p. 1407. [8] Aerts, D., Aerts, S., Coecke, B., D'Hooghe, B., Durt, T., and Valckenborgh, F.: "A model with varying fluctuations in the measurement context", in New Developments on Fundamental Problems in Quantum Physics, Ferrero et al., eds, Kluwer, Dordrecht, 1997, p. 7. [9] Aerts, D., Coecke, B., Durt, T., and Valckenborgh, F.: "Quantum, classical and intermediate I: A model on the Poincare sphere", in uProceedings of the 4th Winter School on Measure Theory", Eds. A. ORTHOGONALITY RELATIONS 221 Dvurecenskij and S. Pulmannova, Tatra Mountains Math. Publ., 10, Bratislava, 1997, p. 225. [10] Aerts, D., Coecke, B., Durt, T., and Valckenborgh, F.: "Quantum, classical and intermediate II: The vanishing vector space structure", in "Proceedings of the 4th Winter School on Measure Theory", Eds. A. Dvurecenskij en S. Pulmannova, Tatra Mountains Math. Publ., 10, Bratislava, 1997, p. 24l. [11] Beltrametti, E. and Cassinelli, G.: "The Logic of Quantum Mechanics", Addison-Wesley Publishing Company, 1981. [12] Birkhoff, G. and von Neumann, J.: "The logic of quantum mechanics", Annals of Mathematics, 37, 1936, p. 823. [13] Birkhoff, G.: "Lattice Theory", third edition, Amer. Math. Soc., Colloq. Publ. Vol. XXV, Providence. [14] Crapo, H.H. and Rota, G.C.: "Geometric lattices" in "Trends in lattice Theory", ed. Abbott J.C. Van Nostrand-Reinhold, New York, 1970. [15] Crapo, H.H. and Rota, G.C.: "On the foundations of combinatorial theory (II)" in "Studies in Appl. Math. ", 49, 1970, p. 109. [16] Durt, T.: From quantum to classical, a toy model, Doctoral Dissertation, Brussels Free University, 1996. [17] Grib A.A. and Zapatrin R.R.: Int. Journal of Theor. Phys., 29, (2), 1990, p. 113. [18] Piron, C.: "Axiomatique quantique", Helv. Phys. Acta, 37, 1964, p.439. [19] Piron, C.: "Foundations of Quantum Physics", W.A. Benjamin, Inc., 1976. [20] Piron, C.: "Mecanique Quantique, Bases et Applications", Presses Polytechnique et Universitaire Romandes, 1990. [22] von Neumann, J.: "Grundlehren, Math. Wiss. XXXVIII", 1932. [23] von Neumann, J.: "Mathematische Grundlagen der Quanten-mechanik", Springer-Verlag, Berlin, 1932. 222 THOMAS DURT APPENDIX: SOME PICTURES: THE TRANSITION PROBABILITY E- We present here the graphs of the probabilities of transition for some values of Oup and 0down' The first graph corresponds to the quantum situation, and the second and third graphs correspond to the deterministic situation. 0.8 0.6 0.4 0.2 0.5 1.5 2.5 Figure 4. Probability distribution when ()up 3 = 0, the quantum case. 0.8 0.6 0.4 0.2 I.S 0.5 Figure 5. ()up 2 2.5 = ()down = 7r /2. 0.8 0.6 0.4 0.2 0.5 Figure 6. I.S ()v.p 2.S = 7r. 3 ORTHOGONALITY RELATIONS O.S 0.6 0.4 0.2 0.5 2.5 1.5 Figure 7. ()up = ()down = 1r /8. 0.8 0.6 0.4 0.2 0.5 Figure 8. 1.5 (}up /2, 2 2.5 = 7r = 1r /4, (}down (}down 3 = 7r /4. 0.8 0.6 0.4 0.2 0.5 Figure 9. 1.5 (}up 2 2.5 3 = 7r /2. 0.8 0.6 0.4 0.2 0.5 Figure 10. 1.5 (}up = 7r/6, 2 2.5 (}down 3 = 7r /3. 223 M.L. DALLA CHIARA & R. GIUNTINI QUANTUM LOGICAL SEMANTICS, HISTORICAL TRUTHS AND INTERPRETATIONS IN ART The basic features of classical logical semantics can be summarized as follows: 1) Truth behaves like the knowledge of an omniscient mind, which is at the same time non contradictory and complete. 2) Meanings behave in an atomistic and compositional way: the meaning of a whole is determined by the meaning of its parts. 3) Meanings are non ambiguous and sharp. All this renders classical semantics hardly applicable to an adequate analysis of natural languages and of artistic contexts, where holistic and ambiguous features seem to playa relevant role. However, formal semantics is not necessarily bound to classical logic. In contemporary logical research, the classical notion of truth has been transformed according to different views. For instance, truth can be identified with what is known by non omniscient minds in a universe that may be either deterministic or indeterministic. The second choice represent the starting point of a semantic characterization of quantum logic. As a consequence one obtains examples of formal semantics that permit us to model ambiguous and holistic situations. In such a framework, even the classical non contradiction principle can be violated. Have these semantic considerations any bearing for a possible formal analysis of interpretations in artistic contexts? We will refer to the case of music, and will try and answer the following question: is it possible to recognize a kind of abstract nucleus that might be common to scientific theories and musical compositions? Birkhoff and von Neumann began their famous article "The Logic of quantum mechanics" [1 J with the following claim: One of the aspects of quantum theory that has attracted the most general attention, is the novelty of the logical notions which it presupposes. After sixty years such a novelty seems to have largely crossed the boundaries of microphysics. From an intuitive point of view the basic features of classical semantics (developed by Leibniz, Frege, Tarski ... ) can be summarized as follows: 1) Truth behaves like the knowledge of an omniscient mind, which is at the same time non contradictory and complete. 225 © 1999 Kluwer Academic Publishers. 226 M . L. D ALL A CHI A R A & R. G I U N TIN I As a consequence: a) any problem is semantically decided: for any sentence A, either A or not A is true (Tertium non datur); b) A sentence A and its negation not A are not at the same time true (Non contradiction principle). 2) Meanings behave in an atomistic and compositional way: the meaning of a whole is determined by the meanings of its parts. 3) Meanings are non ambiguous and sharp. All this renders classical semantics hardly applicable to an adequate analysis of natural languages and of artistic contexts, where holistic and ambiguous features seem to playa relevant role. In this connection, one might refer to a number of different examples. For instance, let us think of the final verse of a famous poem, L'Infinito (Infinity), by Giacomo Leopardi: E'l naufragar m 'e dolce in questa mare (And drowning in this sea is sweet to me). The poetic result seems to be essentially connected with the following semantic relation: the meanings of the component expressions "naufragar" (drowning), "dolce" (sweet), "mare" (sea) do not correspond here to the most common meanings. By the way, there is no sea in Recanati, Leopardi's native village which the poem refers to. However the usual meanings of our expressions are somehow present and ambiguously correlated with the metaphorical meanings that are evoked by the whole poem. Needless to say, this represents a quite typical semantic situation in poetry. In contemporary logical research, the classical notion of truth has been transformed according to different views. For instance, truth can be identified with what is known by non omniscient minds in a universe that may be either deterministic or indeterministic. The first choice is compatible with the intuitionistic approaches to logic and mathematics, whereas the second choice represents the basic assumption of the quantum logical investigations. In both cases, the classical notion of truth a sentence A is true under a given interpretation has been replaced by the following relation: a given information forces us to assert the truth of a sentence A. QUANTUM LOGICAL SEMANTICS 227 All this can be conveniently described by using the metaphor of possi- ble worlds. What is a possible world? In the framework of an epistemic conception, a possible world can be regarded as a piece of information about a possible state of affairs: for instance, what is known by an observer about a physical system. The basic semantic relation is usually represented as follows: i pA And is read: "information i forces sentence A" . Should i represent a non contradictory and complete system of information, our forcing relation would naturally collapse into the classical notion of truth. However, knowledge is in general not complete and not necessarily consistent. Further, an information may essentially refer to other correlated information systems. As a consequence, it is expedient to suppose a whole set of pieces of information: 1= {i,j, k, ... } that admit of different correlations. In the technical jargon of possible world semantics, a correlation between pieces of information (or possible worlds) is usually called an accessibility relation. How can Birkhoff and von Neumann's quantum logic be described in the framework of a possible world semantics? The basic intuitive idea is the following: any information i represents the observer's knowledge about the quantum system under investigation (for instance the electron of a given hydrogen atom). As a limit case, i might correspond to a non contradictory maximal knowledge: in other words, i cannot be consistently extended to a more precise information. Pieces of information of this kind are usually called, both in classical and in quantum physics, pure states. In the formalism of quantum theory, pure states are mathematically represented as particular vectors in an appropriate abstract space (which is called a Hilbert space). Accessibility here means logical compatibility: i and j are accessible (or non orthogonal) if and only if they do not have contradictory consequences. On this basis, the accessibility relation turns out to be reflexive (any information i is accessible to itself) and symmetric (if i is accessible to j then j is accessible to i). What about the forcing conditions? As to the atomic sentences (that cannot be decomposed into simpler sentential parts) the following condition is assumed: i forces a sentence A if and only if any information j accessible to i has an accessible information k that forces A. 228 M . L. D ALL A CHI A R A & R. G I U N TIN I This condition represents a kind of stability requirement. Let us now consider the case of compound (molecular) sentences that are constructed by composing atomic sentences by means of logical connectives (negation, conjunction, disjunction). The behaviour of negation is governed by the following condition: an information i forces a sentence not A if and only if i has no accessible information j that forces A. In other words: i forces not A when it is impossible to transform i into a compatible information j that forces A. As a consequence, an information i will not necessarily decide a sentence A that may remain strongly indetermined for i: i forces neither A nor not A. The semantic tertium non datur fails, in accordance with the probabilistic character of quantum theory. What about conjunction and disjunction? Conjunction has a classical behaviour, governed by the usual truth-table. In other words: an information i forces A and B if and only if i forces both members A,B. At the same time, like in classical logic disjunction is supposed to be defined in terms of negation and conjunction by means of the so called de Morgan law: i forces A or B if and only if i forces not (not A and not B). Since negation does not have here the usual classical meaning, the characteristic truth-table for the connective or turns out to be violated: forcing the truth of a disjunction does not mean forcing the truth of at least one member As a consequence, the following situation is possible: i forces A or B; however i forces neither A nor B! QU ANTUM LOGICAL SEMANTICS 229 This peculiar behaviour of disjunction corresponds to a fairly typical quantum situation. In quantum theory on can mention a number of examples where an alternative is determined and true, even if both members are strongly undetermined (and hence not true). For instance, any electron has in any direction either spin up or spin down. However, owing to the uncertainty relations, an electron that has a well determined spin value in the x direction (say value up) cannot have a well determined spin value in the y direction (neither up, nor down). Birkhoff and von Neumann's quantum logic represents, in a sense, a "semiaristotelian logic". As we have seen, the semantic tertium non datur fails: a sentence is not necessarily either true or false. However, the non contradiction principle still holds. New forms of quantum logic that are even "more non aristotelian" arise in the framework of the so called unsharp approaches to quantum theory. Why moving to the unsharp approaches? From an intuitive point of view, on can say that such approaches represent, in a sense, an important step towards a kind of second degree of "fuzziness" or "ambiguity". We can try and illustrate the difference between the standard and the unsharp approaches by referring to a non scientific example. Let us consider the two following sentences, which apparently have no definite truth-value: I) II) Hamlet is 1. 70 meters tall. Brutus is an honourable man. The semantic uncertainty involved in our first example seems to depend on the incompleteness of the individual concept associated to the name "Hamlet". In other words: the property "being 1.70 meters tall" is certainly a sharp property. However, our concept of Hamlet is not able to decide whether such property is satisfied or not. Differently from real persons, literary characters have a number of undetermined properties. On the contrary, the semantic uncertainty involved in our second example, is mainly caused by the ambiguity of the concept "honourable". What does it mean "being honourable"? Needless to recall how the ambiguity of the adjective "honourable" plays an important role in the famous Mark Antony's monologue in Shakespeare's "Julius Caesar". Orthodox quantum theory and quantum logic take into consideration only examples of the first kind: properties are sharp; whereas all semantic uncertainties are due to the incompleteness of the individual concepts, which correspond to pure states of quantum objects. A characteristic of unsharp quantum theory, instead, is to investigate also examples of the second kind. 230 M . L. D ALL A CHI A R A & R. G I U N TIN I In Hilbert space quantum theory, physical properties that may be ambiguous and unsharp are mathematically represented by particular operators, called effects. At the same time, sharp properties correspond to projection operators, which are limit-cases of effects. An extreme case of an unsharp property is represented by the semitransparent effect, to which any physical state assigns probability ~. This represents a kind of paradigmatic example of a totally undetermined physical property. In the possible world semantics for unsharp quantum logic, an information i may correspond to an ambiguous knowledge. As a consequence, pieces of information turn out to be not necessarily compatible with themselves: reflexivity breaks down. On this basis, it turns out that an information may force at the same time a sentence A and its negation not A: iF A, notA. The non contradiction principle is violated! The result is a paraconsistent (or fuzzy or unsharp) form of quantum logic. Similar semantic behaviours can be recognized also in other domains of experience that, prima facie, appear to be quite far from microphysics. In fact, the notion of possible world can be successfully applied not only by logicians and by physicists. For instance, also historians naturally interact with possible worlds, at least implicitly. The role of possible worlds in historical research has been investigated in [2], [3]. We will use here a slightly different semantic approach. Similarly to physical states, even historical sources determine pieces of information about possible state of affairs. From an abstract point of view, in the simplest case, a source can be idealized as a set of sentences: a source i will assert a sentence A when A is contained in i. For instance the source The Lives of the Twelve Caesars asserts the sentence: Evident miracles had announced to Julius Caesar his violent death. More generally a source may be represented also by some non linguistic objects (monuments, tools, tombs, ... ). As a consequence, the relation "i asserts A" might have a weaker meaning, like "A is confirmed, testified by i". Similarly to a quantum logical situation, two sources may be either compatible or incompatible. We will say that two sources i and j are compatible when it is not the case that i asserts a sentence A whereas j asserts the negation not A. Since a source is not necessarily consistent, QUANTUM LOGICAL SEMANTICS 231 we will have that some sources might be incompatible with themselves. Apparently, the presence of a local contradiction is not a sufficient reason that renders a given source completely unreliable. Consequently, like in the case of unsharp quantum logic, the accessibility relation between pieces of information turns out to be symmetric, but generally not reflexive. Let I represent a set of sources that are available to a given historian. When will our historian accept the historical truth of a given statement? At first sight, one is tempted to take into consideration the two following possibilities: I) II) All sources assert A. At least one source asserts A. However, our first choice seems too strong: many events that are sometimes considered as "historical facts" are not asserted by all sources. At the same time, our second choice appears too weak: some sources might be not completely reliable. For instance, why should we trust Suetonius when he asserts that "Evident miracles had announced to Julius Caesar his violent death"? An intermediate choice between I) and II), that seems quite reasonable from the intuitive point of view, naturally leads to a quantum-logic-like truth condition: a source i forces the truth of an atomic sentence A if and only if for any source j compatible with i, there exists a compatible source k that asserts A. In other words, A represents a stable assertion. As to the case of compound sentences, again the quantum logical truth conditions appear to be quite natural. In particular, one might mention a number of examples in historical contexts, where an alternative is true, whereas both members are strongly undetermined. Finally let us ask: when will a historian accept the historical truth of a given sentence? The natural answer in our abstract semantics is the following: a historian will accept the truth of A, when A is forced by a set of sources that are judged reliable by our historian. On this basis one can conclude that truths discovered by historians and by quantum physicists turn out to have some strong formal similarities! In both cases, one is dealing with a typical holistic behaviour, against the analytic tradition of classical semantics. Holistic features can be easily recognized also in the quantum theoretical treatment of the notion of physical object. Suppose we want 232 M. L. DALLA CHIARA & R. GIUNTINI to describe a compound physical system. For instance, a two particle system 8 = 81 + 82 (say, the electron and the nucleus of a given hydrogen atom). In the happiest situation, the observer's information about the system will correspond to a maximal information: a pure state. Otherwise, it will be a mixed state or more generally a physical property corresponding to a partial description of our system. What about the component systems? Information about the parts can be derived from the information about the whole system in accordance with the mathematical formalism of quantum theory. Now, a typical quantum theoretical situation is the following: a state i provides a maximal information about the whole system. However, the information determined by i about the component systems cannot be maximal: the individual concepts of the two parts do not correspond to pure states. This is exactly what happens in the case of a compound system consisting of two indistinguishable particles of the same kind (for instance, two bosons, which obey the Bose-Einstein statistics). Since we are dealing with two indistinguishable bosons, any pure state of the whole system may have a typical form that has been called entangled: as a consequence the states of the two subsystems will not be pure and will correspond to two equal mixed states. The result is a characteristic holistic behaviour: the whole permits to determine partial individual concepts for the parts; at the same time the whole is not determined by its parts. Have these semantic considerations any bearing for a possible formal analysis of interpretations in artistic contexts? Let us refer to the case of music and let us try and ask the following question: is it possible to recognize a kind of abstract nucleus that might be common to scientific theories and to musical compositions? As is well known, according to a standard logical approach, any scientific theory can be, in principle, analyzed as a pair 1'IHlE([J)~Y = ( Formal system, Class of interpretations) where the formal system corresponds to an axiomatic version of the theory, while the interpretations (or models) represent experience domains where the sentences asserted by the formal system (the theorems of the theory) are verified. In some limit-cases a formal system is lacking, and the theory is simply identified with a language and with a class of models. Paradigmatic examples of theories in this sense are Peano arithmetic or von Neumann's axiomatic quantum theory. QUANTUM LOGICAL SEMANTICS 233 One can guess that even a musical composition can be similarly analyzed as a pair: COMJlD«Jl§TIlTION = ( Score, Class of interpretations) The score plays the role of a formal system (a particular syntactic object); at the same time all historical interpretations of the score play the role of the models. Needless to observe, musical compositions cannot be generally reduced to their mere scores. Scores and formal systems present some clear similarities and some obvious differences. For instance, like in the case of formal languages, also in a score one can naturally distinguish between atomic and compound musical expressions: atomic expressions will represent musical phrases or themes that are perceived as a whole, and that cannot be further decomposed. As an example, the first measures of the first violins and of the violas in Beethoven's 9th symphony can be reasonably regarded as an atomic expression: All the proper parts in such expression, are perceived as incomplete themes. Compound musical expressions are build up by means of complex concatenation operations in a bidimensional frame: as is well known, bidimensionality is a characteristic of the standard formal language of musical works. These concatenation operations can be regarded as the natural counterparts of logical connectives in formal systems. For a theoretician, a formal language represents a possible linguistic context: asserting a given formal system essentially means selecting a set of theorems within this language. In the same way, one could say that a composer refers to a linguistic context, where he selects a system of musical expressions. In a sense, the musical expressions written in a score play the role of the theorems in a formal system. A main difference between formal systems and scores is the following: in most formal languages well formed expressions give rise to a decidable set; in other words, one can always decide in a finite number of steps whether a given sequence of signs is a well formed expression of our language. At the same time, after Codel we know that the set of the theorems is generally undecidable. On the contrary, musical languages (as well as all artistic languages, in general) are intrinsically ambiguous and undecidable, in spite of all rigid harmony treatises. 234 M . L. D ALL A CHI A R A & R. G I U N TIN I What about the second component of a musical work (the class of all interpretations)? As is well known, interpreting a formal system means realizing the well formed expressions of the language in a given universe of discourse. Similarly a musical interpretation (a performance) realizes musical events that corresponds to musical expressions written in the score. In both cases, one is dealing with an operation that associates meanings to systems of signs. As we have seen, in classical model theory meanings have a compositional and analytic behaviour: the meaning of a whole is determined by the meanings of the parts. Of course, such a behaviour cannot be expected in the case of musical performances: generally a whole will determine the meanings of the parts, but not the other way around. Whenever one interprets (or one simply listens to) a given symphony, one cannot generally perceive all the different lines written in the score as distinguished elements. At the same time, the result of a performance does not represent, for a musically educated person, a kind of indistinct sound beam (which represents a typical unidimensional phenomenon, from the physical point of view). Generally, an interpretation decomposes a whole consisting of many expressions into a limited number of voices that are perceived as unitary elements, where each voice is active during a certain time interval. Of course voices may be either human or instrumental. Is it really useful to try and develop technically an abstract holistic semantics for musical compositions, based on these general ideas? We do not have a certain answer to this question. Anyway, the mere existence of some common abstract patterns that can be discovered both in scientific and artistic works, seems to be interesting from the theoretical point of view. AFFILIA TION Maria Luisa Dalla Chiara Roberto Giuntini Dipartimento di Filosofia Universita di Firenze, Italy REFERENCES [1] Birkhoff, G. and von Neumann, J., "The Logic of quantum mechanics", Annals of Mathematics, 37, 1936, pp. 823-843. [2) Toraldo di Francia, G., Un Universo Troppo Semplice, Feltrinelli, Milano, 1990. [3) Toraldo di Francia, G., "Historical Truth", Foundations of Science, 1, 1995-1996, pp. 417-425. 235 INDEX Lukasiewicz, 68, 91 conjunction, 70, 82 disjunction, 70, 82 equivalence, 70 implication, 70, 72 negation, 70, 82 operations on fuzzy sets, 82 accessibility relation, 227 antisymmetry, 44 Aristotle, 68 associativity, 42, 47 atomicity, 188 attribute, 6, 14, 29 actual, 14, 16, 22 irreducible, 18 logic, 15, 17, 27, 28 potential, 14, 22 principal, 16 principle of actuality, 18 tested, 16 axiomatics, 169, 176, 182, 190, 201 Bell -Kochen-Specker theorem, 104, 133 inequality, 133 theorem, 132 binary operation partiaL 42 total, 42 Birkhoff and von Neumann's quantum logic, 227 Birkhoff-von Neumann, 88, 89 Bohr, 31, 106, 142 bosons,232 certainly true domain, 122 characteristic function, 39 classical, 3, 4, 13, 14, 18, 27, 28, 30, 54-57, 67, 68, 75, 78, 81, 82, 8693,103,105,107-109,112-116,118, 119, 122, 125-127, 129-131, 136, 143-145, 153, 163-169, 185, 198, 200, 207, 208, 210-213, 215-218, 225-228, 231, 234 closure, 122 commutativity, 42, 47 compatibility pragmatic, 124 semantic, 123 complement, 121 complementarity, 31, 86, 88 completion procedure, 126 compound entities, 193 compound physical system, 125, 232 conditional hypothesis, 28 confidence algebra, 43 function, 38, 43 confidence function, 41 conjunction, 13, 18, 27, 94 Lukasiewicz, 70, 82 Zawirski, 74, 82 contexts accessible physical, 130 contradictory, 130 non-accessible physical, 130 contextuality, 104, 133 pragmatic, 131 semantic, 131 Copenhagen paradigm, 86 covering law, 188, 193, 194, 200, 201 crisp set, 76 D-poset, 7 de Broglie, 85, 142 de Morgan law, 228 decidable, 233 deduction rule, 107, 109 denial, 28 detector isotropic, 42 particle, 38 dichotomic registering device, 116 dichotomy, 44 difference poset, 48 236 INDEX Dirac, 2, 17, 18, 87, 142 disjunction, 28, 94 Lukasiewicz, 70, 82 Zawirski, 74, 82 distributivity, 49 domain, 118 effect, 38, 47, 117 algebra, 7 operator, 7 algebra, 47, 120 Hilbert space, 50 crisp, 48 operator, 7 sharp, 38, 46 unsharp,38 effects, 230 Einstein, 2, 75,84, 106, 142, 194, 196, 200, 201, 232 equivalence Lukasiewicz, 70 estimator, 24 maximum likelihood, 24 Euclidean, 58, 68,143,145,147,149, 200,201 event, 9, 91 logic, 10, 11, 28 nonoccurred, 22 nonoccurrence, 9 not tested, 22 occurred, 22 occurrence, 9 test of, 19 events compatible, 10 locally complementary, 10 orthogonal, 10 perspective, 10 experiment, 1, 7, 14, 22, 23, 27, 3739, 55, 62, 67, 83-87, 89-95, 116, 117, 121, 126, 131, 136, 149-154, 157-160, 162, 163, 167, 176-178, 191, 192, 196, 198, 199,211-214 double-slit, 83, 85 interference, 84, 85 experimental procedure, 8 proposition, 11, 27, 29, 89-9l confirmed, 12, 22 not tested, 22 refuted, 12, 22 test of, 12 propositions compatible, 13 logic of, 11, 12 orthogonal, 13 simultaneously testable, 27 extension, 118 extreme point, 20 face lattice, 21, 28 Feynman, 83 forcing conditions, 227 formal system, 232 frequency, 121 fuzzy proposition, 7 set, 76 complement, 78, 82 theory, 76 sets Lukasiewicz operations, 82 bold operations, 82 equality, 78 Giles operations, 82 inclusion, 78 intersection, 80, 82 union, 79, 82 Zadeh operations, 78, 82 Giles, 82 greatest lower bound, 13, 46, 93 Heisenberg, 26, 54, 55, 87, Ill, 141, 142 hidden measurements, 166 hidden variables, 2, 30, 31, 90, 114, 137, 163, 164, 207, 210 local, 137 theory, 114 Hilbert space, 4-7, 48,50,54,57,62, 89-92, 94, 96, 142, 143, 160, 180, INDEX 183-186, 188-190, 195-199, 201, 208, 219, 227, 230 historical sources, 230 historical truth, 231 holistic behaviour, 232 implication, 12 Lukasiewicz, 70, 72 connective, 28 relation, 13, 15, 28, 172 indistinguishable particles, 232 interference, 83 interval estimation, 23 involution, 44 join, 13 laboratory, 116 lack of knowledge, 90, 163, 165-168, 196-199 language of properties, 127 lattice, 3-7,13,21,22,27,28,49,50, 57, 62, 90, 92, 94, 126, 127, 183, 185-190, 194, 197, 207, 208, 211, 212,217-219 of face,;, 28 of facs, 21 orthomodular, 5, 6, 13, 90 law 0- 1, 47 associative, 42 De Morgan, 13, 46, 74 distributive, 49 distributivity, 54 empirical, 129 of bivalence, 69 of C'ontmniction, 74 of distributivity, 4, 90 of excluded middle, 69, 74 theoretical, 129 least upper bound, 13, 45, 93 Leopardi Giacomo, 226 locality, 132 logic 237 n-valued, 69, 71 Lukasiewicz, 68 of experimental propositions, 12 classical, 68, 75, 86, 107, 115 inductive, 23, 28 infinite-valued; 71, 81 intuitionistic, 109 many-valued, 68, 71, 86 non-classical, 68 of attributes, 15, 17, 27, 28 of conditional hypotheses, 28 of events, 10, 11, 28 of experimental propositions, 11, 28 quantum, 90, 110, 127 three-valued, 69, 87 logical connectives, 27 macroscopic, 60, 61, 83, 85, 116, 136, 144, 145, 165, 198, 207 maximal knowledge, 227 measurement, 7, 8, 25, 26, 29, 30, 37, 38, 41, 46-49, 54, 55, 59, 60, 62, 103, 104, 110, 111, 113, 114, 128, 132, 134, 137, 159, 165, 167, 207212, 215, 219 measurement yes-no, 37 meet, 13 membership degree of, 53, 75, 82 function, 76 Metatheoretical Principle classical, 130 Generalized, 130 mixed state, 232 models, 232 musical interpretation, 234 negation, 13, 92 Lukasiewicz, 70, 82 Post, 73 Non contradiction principle, 226 nonlocality, 104, 132 objectification problem, 113 238 INDEX observative language, 116 ontologie, 90, 112-115, 132, 163, 166, 167, 169, 172, 191, 199 operational, 62, 91, 126, 128, 129, 131, 136, 142, 143, 169, 173, 174, 176, 182, 190, 195, 201 operator, 2,4, 5, 7, 26, 29, 30, 54, 57, 62, 89-92, 94, 126, 134, 143, 157159, 178-181, 183-187, 189, 198, 199, 230 order inverting operation, 44 logical, 109 relation, 120 partial, 44 total, 44 orthoalgebra, 6, 48 orthocomplementation, 5, 13, 27, 28, 188-190, 193,207,217-219 orthogonality, 47, 156, 188, 190, 207, 212, 214, 215, 218 orthomodular identity, 5 lattice, 5 identity, 13 lattice, 6, 13, 90 poset, 6 orthosupplement, 40 orthosupplementation, 47 outcome, 9-12, 14-16, 19, 20, 24-29, 62, 92, 93, 111, 113, 117, 120, 121, 134, 153, 159, 160, 166, 176, 178, 211, 212, 214, 215 paraconsistent, 230 paradox, 2, 104, 106, 110, 115, 116, 119, 123, 125, 127, 135, 137, 141, 143, 149, 190, 194, 197 photon, 84 physical object, 117 point estimation, 23 possible world semantics, 227 Post, 69, 72 negation, 73 pre-order, 109, 171, 172, 183, 185,198 preclusivity relation, 122 predicate non-sharp, 76 sharp, 75, 76 preparation, 116 probability, 19, 121 measures convex combination of, 20 full set of, 19 order determining set of, 19 model, 19 projection, 4, 5, 7, 29, 30, 54, 62, 89, 91, 92, 94, 125, 137, 143, 158-160, 162, 168, 178, 183, 184, 186, 187, 189, 190, 212, 230 operators, 230 property, 38, 42, 44, 48, 54, 62, 87, 103, 104, 108, 111, 123, 125-128, 133, 137, 146, 164, 166, 167, 169173, 176-178, 181, 183-186, 188192, 196, 197, 200, 201, 211-213, 215,217,229,230,232 correlation, 126 first order, 123, 126 higher order, 123 testable, 123 theoretical, 126 proposition, 29 propositional function, 92 always-false, 92, 95 always-true, 92, 95 pure states, 227 quantum logic, 230 quantum machine, 145, 149-152, 154, 155, 162, 165-167, 169, 179, 185, 191, 198, 199 quantum mechanics, 1-6, 18, 25, 2931, 38, 48, 50, 53-55, 58, 59, 61, 62, 83, 86-91, 103-107, 109-116, 118-120, 122, 124-127, 129-137, 141-143, 145, 149, 153, 155, 158160, 162-165, 167, 169, 172, 182, 184, 186, 188-197, 199-201, 211, 212, 215, 225 quantum structures, 89, 92, 141, 142 INDEX reality, 132 received viewpoint, 104 reflexivity, 44 Reichenbach, 88 relativity theory, 142, 143, 200 Schrodinger, 2, 87, 141, 142 score, 233 239 testability, 110 transition probability, 151, 154, 155, 162, 179-181, 198, 209, 222 transitivity, 44 truth, 107, 128 access to, 107 empirical access to, 128 Tarski theory of, 107 Semantic Realism, 115, 119, 127 spin, 25, 30 experiment, 152, 157, 160 state, 2,16-18,20,22,25-27,29-31, 44, 48, 53-57, 59-62, 69, 76, 84, 87, 91-93, 95, 96, 103, 104, 108110, 114, 117-127, 129, 131-134, 137, 149, 150, 153-157, 159, 164, 166, 168-173, 175-182, 185-188, 190, 191, 195-199, 201, 207-209, 211-219, 227, 229, 230, 232 entangled, 125 first type, 125 mixed, 122 property space, 171-176, 182-187, 189-191, 195, 201 property system, 184, 185, 188, 189, 192-194 pure, 122 second type, 125 statement nontestable, 128 testable, 128 statistical hypothesis, 23 testing, 23 inference, 23 parameter, 23 parametric hypothesis, 23 Stern-Gerlach, 29,152-155,157,160, 209, 219 superposition, 18, 28 support, 20 testable, 124, 127 theoretical, 127 tensor product, 6 Tertium non datur, 226 uncertainty principle, 93 uncertainty relations, 229 undecidable, 233 unsharp approaches to quantum theory, 229 vector space, 2, 4, 89, 142, 145, 147, 148, 155-157, 162, 179, 188, 194 verificationism empirical, 110 logical, 109 von Neumann, 2, 4, 5, 29, 31, 5456,58,67,88-90,92-96,141,143, 144, 164, 215, 225, 227, 229, 232 weak modularity, 189 Zadeh, 76 operations, 78, 82 Zawirski, 74,88, 91 conjunction, 74, 82 disjunction, 74, 82

Log In

Quantum Structures and the Nature of Reality

Related papers

Related papers

Related topics