Academia.eduAcademia.edu

Data, Information, Knowledge and Competence

Introduction What does it mean being competent in a foreign language, say, French? The reader should try to answer this question before proceeding with this paper. It would be interesting to write down the answer, comparing it with the one given later on. I asked that question to many I.T. (Information Technology) professionals during interviews to assess their competencies. The answers varied from "having fluency in that language" to "being able to think in French". These vague characterizations are not very useful if the intention is to develop a data processing system to collect competencies and making it possible to indicate professionals to compose project teams or to fill up managerial vacancies. Looking at the literature for some help was not effective: "competence" is rarely handled, and there was quite a bit of confusion between this concept and that of "knowledge". Worst of all, there was a general confusion between "knowledge" and "information", and also between "information" and "data". Stenmark [2002: chap. 3] says: "It has often been pointed out that data, information, and knowledge are not the same, but despite efforts to define them, many researchers use the terms very casually. In particular, the terms knowledge and information are often used interchangeably, even though the two entities are far from identical." When I reached the question of what "data" is, I could use a definition I had previously developed. In this paper, I will provide this definition, and clear characterizations to the other three concepts. It would also be interesting if the reader would try at this point to give her/his characterization of what s/he understands by "information" and "knowledge", comparing it with the ones given later. But her/his probable difficulty is not uncommon: on issue #81 dated Aug. 10, 1998 of the excellent electronic magazine Netfuture-Technology and Human Responsibility its editor, Stephen Talbott, describes that during two lectures to large audiences of librarians, he asked what "information" was and nobody risked an answer besides "that's the stuff we work with" [Talbott 1998]. This paper begins with the definition of "data". Then characterizations (and not definitions, as will be clear) of "information", "knowledge" and "competence" are given. It will be seen that "competence" depends on two factors, leading to a matrix representation, the "competence matrix". After general considerations about these concepts, a review of the literature is made. Then it is shown how they were used in the implementation some years ago of two different competence management systems for PROMON (a very large Engineering firm in São Paulo), and PRODESP (the São Paulo State Data Processing Company, which has more than 1,000 technical professionals in the data processing field). Finally, some considerations are made on the implementation of Competence Centers, congregating employees of some professional area. 2. Data I define data as a sequence of quantified or quantifiable symbols. Thus, a text is a piece of data. In fact, letters and characters are quantified symbols because there is a finite number of them; any alphabet (including digits and special characters) may be considered as a numbering system. Pictures, figures, recorded sounds and animation are also examples of (quantifiable) data, because they may be quantified (using digital scanners, cameras, recording devices, etc.) to the point that it is eventually difficult to distinguish, from their originals, their reproduction made from the quantified representation. It is very important to note that, even if incomprehensible for a reader, any text constitutes a piece of data. This will become clearer in the next section. Therefore, in this definition data are necessarily mathematical entities, and thus are purely syntactic. This means that data may be totally described through structural, formal representations. Being quantified or quantifiable, it can obviously be stored into a computer and processed by it. Hence, it is valid to use the expression "data processing". Inside a computer, a piece of text may be linked to other pieces, through physical contiguity or through "pointers", which are addresses of the storage unit being used. Thus, one gets a "data structure". Pointers may link some place of a text to a quantified representation of a figure, sound, etc., introducing more structure. Data processing in a computer is limited exclusively to structural manipulations of data, done through programs. The latter are always implemented mathematical functions, and thus are also "data". Examples of such manipulations in the case of texts are their formatting, sorting, comparing with other texts, changing fonts, statistics of words appearing in the text, etc. 3. Information Informationis an informal abstraction (that is, it cannot be formalized through a logical or mathematical theory) which is in the mind of some person in the form of thoughts, representing something of significance to that person. Note that this is not a definition, it is a characterization, because "mind", "thought", "something", "significance" and "person" cannot be well defined. I assume here an intuitive (naïve) understanding of these terms. The most common way of representing and transmitting information is through data. For example, the phrase "Paris is a fascinating city" is a piece of data that can be received by someone, and is incorporated as information if she understands English and knows what Paris means. In this sense, information may be acquired through such pieces of data as texts, pictures, recorded sounds, pictures and animation. A more elaborate example may clarify the difference between data and information. Suppose a person named P, who does not know anything about Chinese (say, Mandarin), sees a text with 2 columns. In the first line, there are some ideograms; in the subsequent lines, there are in the left column some ideograms, and in the right column some 2-digit numbers like-4, 5, 25, etc. P doesn't have the faintest idea what the whole text represents, that is, for her it is simple data. She may guess that it is a table of some sorts, and that the first line has its headers. In this case, P may reformat the table, for instance ordering the lines below the first one using the texts in the first column (given an appropriate alphabetical ordering of the ideograms), or using the numbers in the second column. She may also choose special fonts for the texts or numbers; or may decide to exchange the two columns, enlarge the size of the table, set borders, etc. These actions are pure data processing. Now, suppose a Chinese explains to P that the texts in the first line mean "City" and "Temperature", and explains what city names are represented in the first column, like Beijing, Saigon, Paris, etc., and tells P that the table represents the average temperature of the previous day in those cities. Now P understands what the table means: she incorporates the data from the table as information: how cold or warm it was in Beijing, etc. If the representation of some information is done through data, as in the phrase on Paris or the temperature table, it may be stored into a computer. But, attention, what is stored this way is not information, but its representation under the form of data. This representation may be transformed by the machine, as in text formatting, a syntactical transformation. The machine cannot change the meaning starting from the latter, because meaning depends on a person who has the information. Obviously, the machine may shuffle the data in such a way that it may become unintelligible to the person who receives them; in this case it has ceased to be information for that person. Furthermore, it is possible to transform the representation of some information in such a way that its meaning changes for the person receiving it (as for example automatically changing the name "Paris" to "London"). There is a change of meaning for the human receptor, but in the computer the change was purely syntactical, a mathematical data manipulation. It is not possible to process information in a computer. For this, information has to be transformed into data, and then it is not information anymore. Similarly, it is not possible to store information in a computer; what is stored is the representation of someone's information in the form of data. Hopefully, it will be read or seen by someone else as information. The crucial problem is to make those two people have precisely the same understanding of that representation. In general, when people say that humans process information, an analogy is being made to data processing by a computer. This association is undue, because we know exactly how computers process data, but there is no idea how

Data, Information, Knowledge and Competence Valdemar W.Setzer www.ime.usp.br/~vwsetze Original: Apr. 2001; new revised and enlarged version (3.1): Feb. 2006 1. Introduction What does it mean being competent in a foreign language, say, French? The reader should try to answer this question before proceeding with this paper. It would be interesting to write down the answer, comparing it with the one given later on. I asked that question to many I.T. (Information Technology) professionals during interviews to assess their competencies. The answers varied from "having fluency in that language" to "being able to think in French". These vague characterizations are not very useful if the intention is to develop a data processing system to collect competencies and making it possible to indicate professionals to compose project teams or to fill up managerial vacancies. Looking at the literature for some help was not effective: "competence" is rarely handled, and there was quite a bit of confusion between this concept and that of "knowledge". Worst of all, there was a general confusion between "knowledge" and "information", and also between "information" and "data". Stenmark [2002: chap. 3] says: "It has often been pointed out that data, information, and knowledge are not the same, but despite efforts to define them, many researchers use the terms very casually. In particular, the terms knowledge and information are often used interchangeably, even though the two entities are far from identical." When I reached the question of what "data" is, I could use a definition I had previously developed. In this paper, I will provide this definition, and clear characterizations to the other three concepts. It would also be interesting if the reader would try at this point to give her/his characterization of what s/he understands by "information" and "knowledge", comparing it with the ones given later. But her/his probable difficulty is not uncommon: on issue #81 dated Aug. 10, 1998 of the excellent electronic magazine Netfuture -- Technology and Human Responsibility its editor, Stephen Talbott, describes that during two lectures to large audiences of librarians, he asked what "information" was and nobody risked an answer besides "that's the stuff we work with" [Talbott 1998]. This paper begins with the definition of "data". Then characterizations (and not definitions, as will be clear) of "information", "knowledge" and "competence" are given. It will be seen that "competence" depends on two factors, leading to a matrix representation, the "competence matrix". After general considerations about these concepts, a review of the literature is made. Then it is shown how they were used in the implementation some years ago of two different competence management systems for PROMON (a very large Engineering firm in São Paulo), and PRODESP (the São Paulo State Data Processing Company, which has more than 1,000 technical professionals in the data processing field). Finally, some considerations are made on the implementation of Competence Centers, congregating employees of some professional area. 2. Data I define data as a sequence of quantified or quantifiable symbols. Thus, a text is a piece of data. In fact, letters and characters are quantified symbols because there is a finite number of them; any alphabet (including digits and special characters) may be considered as a numbering system. Pictures, figures, recorded sounds and animation are also examples of (quantifiable) data, because they may be quantified (using digital scanners, cameras, recording devices, etc.) to the point that it is eventually difficult to distinguish, from their originals, their reproduction made from the quantified representation. It is very important to note that, even if incomprehensible for a reader, any text constitutes a piece of data. This will become clearer in the next section. Therefore, in this definition data are necessarily mathematical entities, and thus are purely syntactic. This means that data may be totally described through structural, formal representations. Being quantified or quantifiable, it can obviously be stored into a computer and processed by it. Hence, it is valid to use the expression "data processing". Inside a computer, a piece of text may be linked to other pieces, through physical contiguity or through "pointers", which are addresses of the storage unit being used. Thus, one gets a "data structure". Pointers may link some place of a text to a quantified representation of a figure, sound, etc., introducing more structure. Data processing in a computer is limited exclusively to structural manipulations of data, done through programs. The latter are always implemented mathematical functions, and thus are also "data". Examples of such manipulations in the case of texts are their formatting, sorting, comparing with other texts, changing fonts, statistics of words appearing in the text, etc. 3. Information Informationis an informal abstraction (that is, it cannot be formalized through a logical or mathematical theory) which is in the mind of some person in the form of thoughts, representing something of significance to that person. Note that this is not a definition, it is a characterization, because "mind", "thought", "something", "significance" and "person" cannot be well defined. I assume here an intuitive (naïve) understanding of these terms. The most common way of representing and transmitting information is through data. For example, the phrase "Paris is a fascinating city" is a piece of data that can be received by someone, and is incorporated as information if she understands English and knows what Paris means. In this sense, information may be acquired through such pieces of data as texts, pictures, recorded sounds, pictures and animation. A more elaborate example may clarify the difference between data and information. Suppose a person named P, who does not know anything about Chinese (say, Mandarin), sees a text with 2 columns. In the first line, there are some ideograms; in the subsequent lines, there are in the left column some ideograms, and in the right column some 2-digit numbers like -4, 5, 25, etc. P doesn't have the faintest idea what the whole text represents, that is, for her it is simple data. She may guess that it is a table of some sorts, and that the first line has its headers. In this case, P may reformat the table, for instance ordering the lines below the first one using the texts in the first column (given an appropriate alphabetical ordering of the ideograms), or using the numbers in the second column. She may also choose special fonts for the texts or numbers; or may decide to exchange the two columns, enlarge the size of the table, set borders, etc. These actions are pure data processing. Now, suppose a Chinese explains to P that the texts in the first line mean "City" and "Temperature", and explains what city names are represented in the first column, like Beijing, Saigon, Paris, etc., and tells P that the table represents the average temperature of the previous day in those cities. Now P understands what the table means: she incorporates the data from the table as information: how cold or warm it was in Beijing, etc. If the representation of some information is done through data, as in the phrase on Paris or the temperature table, it may be stored into a computer. But, attention, what is stored this way is not information, but its representation under the form of data. This representation may be transformed by the machine, as in text formatting, a syntactical transformation. The machine cannot change the meaning starting from the latter, because meaning depends on a person who has the information. Obviously, the machine may shuffle the data in such a way that it may become unintelligible to the person who receives them; in this case it has ceased to be information for that person. Furthermore, it is possible to transform the representation of some information in such a way that its meaning changes for the person receiving it (as for example automatically changing the name "Paris" to "London"). There is a change of meaning for the human receptor, but in the computer the change was purely syntactical, a mathematical data manipulation. It is not possible to process information in a computer. For this, information has to be transformed into data, and then it is not information anymore. Similarly, it is not possible to store information in a computer; what is stored is the representation of someone's information in the form of data. Hopefully, it will be read or seen by someone else as information. The crucial problem is to make those two people have precisely the same understanding of that representation. In general, when people say that humans process information, an analogy is being made to data processing by a computer. This association is undue, because we know exactly how computers process data, but there is no idea how humans process information. The word process is normally associated to some mechanical process. According to the American Heritage Dictionary (2000 electronic edition), the word indicates the processes of digestion, of obtaining a driver's license, of manufacturing, etc. It is not possible to prove that when humans work internally with information, they only use the brain in a mechanical or electronical way. It is a scientific speculation that the mind resides in the brain, but this is not a scientific fact. For instance, A. Damasio says: "What I am suggesting is that the mind arises from activity in neural circuits ..." [1994, p. 226]. Even if the mind would reside in the brain, it is not known exactly how the brain works. So I consider it absolutely undue and wrong saying that humans "process information". The problem here is that this expression, together with others mentioned below, lead to an undue image of humans as being machines -- the prevalent concept in modern cognition theory. Curiously, this is linguistically wrong, because what is understood under "machine" is a device that was designed and constructed by humans -- eventually, with the help of other machines. But humans were not designed by humans and were not constructed. Data, as far as it is intelligible, is always incorporated by someone as information, because (adult) humans are always looking for meaning and understanding. When the phrase "the average temperature in Paris in December is 5 oC" (by hypothesis) is read or heard, an immediate association is made by the reader (or hearer) with cold, with a certain period of the year, with the particular city, etc. Note that "meaning" cannot be formally defined. Here it will be considered as a mental association with a concept, as temperature, Paris, etc. The same happens when we see an object with a certain format, and we say that it is "circular", associating -- through our thinking -- our mental representation of the perceived object with the concept "circle". For a deep study of thinking, showing that as far as our inner activity is concerned, it is an organ for the perception of concepts, see one of the fundamental works by Rudolf Steiner, his "Philosophy of Freedom" (direct translation of the German original title), specially his chapter IV, "The world as perception" [Steiner 1963, p. 76]. Information may be an inner property of some person, or may be received by her. In the first case, it is in the mental sphere, and may originate from an inner perception, like some pain; information is in this case the thoughts the person elaborates about her pain. In the second case, it may or may not be received through its symbolic representation as data, that is, under the form of text, figures, recorded sound, animation, etc. As said above, the representation by itself, for example a text, consists exclusively of data. Reading a text, a person may absorb it as information, as long as she understands it. It is possible to associate the reception of information through data to the reception of a message. Nevertheless, information may be derived from an experience where there was no data involved. This was the case for the pain mentioned above. Another situation occurs when someone is in a closed room and, extending her arm through the window she has some experience of warmth; thinking about it, she generates information about her experience, that is, if it is too cold or not in the outside. In the case of pain or feeling warmth there were no "messages". Messages may depend on their context to be understood as, for example, when someone hears a strong shout or vocal noise: it may transmit lots of information for its receiver, which is not present in the sounds. When I exemplified data, I used "recorded sound". This is due to the fact that sounds in nature contain much more that what may be recorded: hearing them, there is a whole context that disappears in the recording. The noise produced by sea waves, for example, comes with the view of the sea, of the latter's smell, the air humidity, the luminosity, the wind, etc. If someone expresses her thoughts, she considers that she is transmitting information, because it obviously makes some sense to her. Another person hearing what she is saying is receiving data; if she understands at least part of it, she is also receiving information. The big question is if the information is the same for both persons, that is, if it has the same meaning. A fundamental distinction between data and information is that the former is purely syntactical, and the latter necessarily contains semantics (implied by the word "meaning", used in its characterization). It is important to recognize that it is impossible to introduce semantics into a computer and process it, because the machine itself is purely syntactical (as the whole of mathematics also is). If one examines for instance the field of the so-called "formal semantics" of programming "languages" one would notice that it is in fact just syntax, expressed through an axiomatic theory or through mathematical associations of its constructs with operations performed by a (eventually abstract) computer. As a matter of fact, "programming language" is a misnomer, because what one normally calls a language contains semantics. (Many years ago I heard in a public lecture N.Chomsky -- the famous researcher who established in 1959 the field of "formal languages", and who had intensively looked for syntactic "deep structures" in our language and brain --, saying that a programming language is not a language at all.) Other misnomers used in the computer field, connected to semantics, are "memory" and "artificial intelligence". I am against their use because they give e.g. the false impression that our memory is equivalent in its function to computer storage devices, or vice-versa. Theodore Roszack makes interesting considerations showing that our memory is infinitely wider [Roszack 1994 p. 97]. John Searle, the author of the famous allegory of the Chinese Room (in which a person, following rules from a handbook in English, combines Chinese ideograms without understanding them at all, and thus answers questions in this language -- this is exactly the way computers process data), demonstrating that computers have no understanding, argued that computers cannot think because they lack our semantics [Searle 1991, p. 39]. 4. Knowledge I characterize knowledge as a personal, inner abstraction of something that has been directly experienced by someone. Thus, someone has some knowledge of Paris only if she or he has visited it. Later on I will loosen somewhat this requirement. In this sense, if someone reads a manual, e.g. a guide book to Paris, she acquires information about Paris but no knowledge about it. A person who, looking through the window, sees that it is raining, knows that it is raining through personal experience. In this case, she has gained both information and knowledge. If this person is just told by someone else that it is raining, she gained information, but no knowledge. It is not possible to fully describe one's experience -- unless the experience is observing data. Furthermore, it does not depend just on a personal interpretation, as with information, because it requires a personal experience with the object of knowledge. Thus, knowledge is in the purely subjective realm of humans or animals. Part of the difference between both resides in the fact that a human may be aware of his or her own knowledge and may partially describe it conceptually in terms of information, for instance through the phrase "I visited Paris, so I know it" -- supposing that the reader or hearer understands what is meant. A baby has quite a bit of knowledge. For example she may recognize her mother, she knows that when crying she gets fed, etc. But it is not possible to say that a baby has information, because she does not associate perceptions and concepts through thinking. Along this line, it is not possible to say that an animal has information, but it certainly has lots of knowledge. As mentioned, the representation of information as data may be stored into a computer. Knowledge is not subjected to a complete representation, so it cannot be inserted into a computer. Thus, in the sense expounded here it is absolutely wrong to speak of a "knowledge base" in a computer. At most one may have a traditional "database" which, for someone who retrieves its data, it is hopefully the representation of some information. Thus, there exists information that is related to some knowledge, as in the case of the phrase on Paris, pronounced by someone who knows this city. But there may be information without this relation, for instance if the person reads a travel guide before visiting Paris for the first time. Therefore, information may be practical or theoretical, respectively. Knowledge is always practical. Information may be derived in a purely mental way (for instance, when discovering some mathematical properties); in this case, it is not necessarily related to knowledge. In the knowledge management discourse, one frequently finds the notions of tacit, implicit and explicit knowledge. There are different interpretations for these terms [Stenmark, 2002: chaps. 6, 7]. Let us summarize the different characterizations found in the literature as follows. Tacit knowledge is a knowledge that cannot be expressed, as for instance the capacity someone has of recognizing faces of known people: it is not possible to describe how this recognition is done and transfer this capacity to other people. Implicit knowledge is the one that has not been expressed, but may eventually be expressed in the future. For instance, a cook that does a wonderful recipe for a special dish, and when asked how much salt she puts in it, she says: "Just two pinches." If someone follows her preparing the dish, those two "pinches" may be measured in grams, thus precisely expressing what she meant. Explicit knowledge is the one that has already been expressed. Taking into account the characterization given before, one may recognize that in that sense knowledge is always tacit: no knowledge can be fully expressed. What is expressed is data describing some thoughts, that is, information, derived from some knowledge. When assessing information, knowledge and competencies, I realized that technical people understood better what I meat by information if I called it theoretical knowledge, and by knowledge, as explained above, if I called it practical knowledge. But in this case it is necessary to clearly distinguish the latter from competence. Information was associated to semantics. Knowledge is associated to pragmatics, that is, it is related to something existing in the "real world" of which we have a direct experience. (Again, I am assuming here a naïve understanding of "real world".) The word comes from Greek through the Latin pragmaticus, "skilled in law, business, state affairs". 5. Competence I characterize competence as an ability of executing a task in the "real world". A person has competence in some field if he or she has demonstrated through past accomplishments the ability of executing a required task. Extending our previous example, a person having the ability of working as a guide in Paris has this kind of competence. Just by knowing Paris (i.e. having knowledge of it) or having studied lots of material on Paris (i.e. having information) does not mean that a person has competence as a guide for it. As competence involves past actions, it requires knowledge, because those actions are experienced by the competent person. Knowledge was associated to pragmatics. Competence is associated with a physical activity. A person may have a good degree of competence e.g. in delivering speeches. For this, she or he must move her or his mouth and produce physical sounds heard by other people. A competent mathematician is not just a person who is able to solve mathematical problems and eventually create new mathematical concepts -- which may be purely inner, abstract, mental (and thus not physical) activities. He or she must also be able to transmit his or her mathematical concepts to others. This transmission is obviously done through physical (outer) actions. The creativity which may be associated with competence reveals another one of its characteristics. Competence may be connected to freedom, which did not appear in the other three because there was no activity involved with them, other than their acquisition. In the example given above, a competent guide to Paris will tour two different tourists in different ways, recognizing that they have different interests. Furthermore, such a guide may consciously improvise different tours for two tourists with the same interests but with different personal reactions along the tour or just by having an intuition that the tourists should be treated differently. Cusumano and Selby [1997] describe how Microsoft Corporation has organized its software development teams permitting the creativity typical of hackers but at the same time directing it to established objectives, maintaining the compatibility of modules through periodic synchronization. Here is another distinct feature of humans and animals in terms of competence: humans are not necessarily directed by their "programs" (due to genetics and the conditioning provided by the environment) as animals are, and may be free and creative, improvising different activities in the same environment. In other words, animal competence is always automatic, deriving from a physical necessity or conditioning. Humans may establish mental objectives for their life, such as cultural or religious ones, having nothing to do with physical needs. These objectives may involve the acquisition of some knowledge and certain competencies, leading to self-development. Competence requires knowledge and personal capacities for realizing something concrete. Therefore, it is impossible to introduce competence into a computer. One should not say that an automated power lathe has some competence. One should say that it contains data (programs and input data) which are used to control its functioning and it can realize some specific tasks. As with knowledge, competence cannot be fully described. When comparing competencies, one has to know that this comparison just gives a rough idea of the degree of competence a person has. Thus, when classifying a competence into, say, "none", "developing", "proficiency", "strong" and "expert", as proposed in the MIT I/T Competence Model [MIT I/T], or "novice", "advanced beginner", "competent", "proficient" and "specialist", according to Hubert and Stuart Dreyfus [Devlin 1999, p. 187], one should be conscious of the fact that something is being reduced to data or, if those terms are understood, information. There is a clear intuitive ordering of those degrees, from none or weak to high competence. Associating a "weight" to each one, as 0 to 4 in the MIT case, and 0 to 5 in the Dreyfuses (here, 0 should e interpreted as "none"), there has been a quantification of something that is non-quantifiable in its essence. Therefore, one should be aware of the fact that when calculating someone's "total competence" over various fields -- eventually required by some project --, a metric is being introduced which reduces some subjective human characteristic to an objective shadow of what it really is, and this may lead to many errors. The situation is worsened with behavioral skills, like "leadership", "ability to interact with others", etc. I am not saying that such quantified assessments should not be used; I just want to point out that they should be used with extreme reserve, and one should be aware that they do not represent what competencies the person being assessed really has. I think that such assessments may be used just as rough suggestions, and should be followed by personal -- and thus subjective -- further analysis. If the computer is used to process data, one is in the objective realm. Humans are not objective entities, so they should always be treated with some degree of subjectivity, otherwise they are reduced to machines (this is obviously even worse than treating them as animals). 6. Intellectual areas The characterizations given above apply very well to practical areas, such as data processing or engineering, but they need further elaboration for purely intellectual areas. Let us examine the case of a competent historian. There is no problem with his or her competence: it is manifested through written papers, books, lectures, courses, etc. On the other hand, it is necessary to extend the characterization of knowledge to encompass such intellectual areas as history: in general historians do not have personal experiences of past times, people and places. Nevertheless, a good historian is certainly a knowledgeable person in her or his area. Unfortunately, my way out of this apparent incongruence of that characterization will not be accepted by everyone: I postulate, as a working hypothesis, that a good historian has in fact a personal experience -- not of physical situations, but of the Platonic "world" of ideas. Ancient facts are recorded in that world as "realities" and are grasped through thinking by a person that immerses him or herself into the study of ancient accounts. The words "intuition" and "insight" deal with mental activities having sometimes to do with a "perception" of that world. In fact, "insight" means, according to the American Heritage Dictionary (1970 edition), "the capacity of discerning the true nature of a situation", "an elucidating glimpse". "True natures" are concepts, hence do not exist physically; I make the hypothesis that through insight, that is, an inner perception, we "glimpse" the world of ideas. See [Steiner 1963, p. 112] for a deep elaboration of this and other kinds of cognition). If one may admit as a working hypothesis that the concept of a circle is a "reality" in the world of ideas, existing independently of any person, then it is not difficult to admit that our thinking is an organ of perception of concepts with which we may "experience" the eternal, universal idea of "circle", which is not stored somewhere in our brain (in fact, neuroscience cannot point to the place and how it is stored there, so this does not contradict known scientific facts). In this sense, and using our characterization for "knowledge" one may say that a person may have a knowledge of the concept "circle". Note that nobody has ever seen a perfect circle, as no living person has experienced with his or her senses the French Revolution, or has met Goethe, but both are realities in the latter's "archetypal world". 7. General comments It is necessary to recognize that those characterizations for data, information, knowledge and competence are not the usual ones. For instance, it is common to consider "data" as a proper subset of "information", that is, data is a particular kind of information. I found it useful to separate completely these two concepts, that is, according to these considerations, data is not part of information; at most, in some cases it may be its representation. The same applies to information and knowledge, and to knowledge and competence. It is interesting to observe that, according to these characterizations, there is no (formal) "Information Theory". What Claude Shannon developed was in fact a "Data Theory". Theodore Roszack [1993, p. 12] relates discussions originated from the name "Information Theory". Shannon's theory deals with, for example, the capacity channels have of transmitting data, and not information. Thus, one should not talk about "amount of information", but "amount of data" transmitted by some channel. "Bit" is not a unit of information measure, but of data, as shown by its name ("BInary Digit"): numbers, by themselves, do not contain information, they are pure data. Data is purely objective -- it does not depend on its user. Information is objective-subjective, in the sense that it is may be described in an objective way (texts, pictures, etc.) but its meaning is subjective, depending on its user. Knowledge is purely subjective -- each person has her own personal experiences. Competence is subjective-objective, in the sense that it is a purely personal characteristic, but everybody may examine its outcome. The characterizations made above may be useful for enterprises. They should become conscious of the fact that they don't introduce information into computers, but data. There are two aspects to be considered here. Data should represent as well as possible the information that should be acquired from them. Furthermore, the enterprise's professionals always interpret them. The same data may be used as two different pieces of information. To avoid it, it is not sufficient that the desired information be clearly represented, but that the professionals be prepared to interpret it in the expected manner. Keith Devlin mentions some tragic cases, such as airplane crashes, due to erroneous interpretation of data or to ambiguous representation of information [Devlin (1999), pp. 9 (the case of the Canary Islands in 1977, with 583 deaths) and 76 (the Cali case in 1995, with 159 deaths)]. On the other hand, it is important to know that it is impossible to transmit knowledge: what is transmitted is data, eventually representing some information. To accomplish knowledge transmission from one person to another, it is necessary to provide for personal interactions between both, with the first one vividly showing or describing her experience (but recall that what is being transmitted is information, if understood by the receiver). Devlin mentions two cases of large enterprises where there was a tentative of transmitting knowledge through data, but the transmission became effective only through personal contact [pp. 176 and 177]. As for competence, it can only be acquired by doing something. Thus, enterprises wishing to develop competence among their professionals in a certain area should make them work in that area or participate in projects, preferable with people with high competence. Courses only develop competence when they involve substantial practical exercises or projects. 8. Literature I found support for some of these ideas in the literature. For example, Y.Malhorta [1998] says: "The traditional paradigm of information systems is based on seeking a consensual interpretation of information based on socially dictated norms or the mandate of company bosses. This has resulted in the confusion between knowledge and information. Knowledge and information, however, are distinct entities. While information generated by computer systems is not a very rich carrier of human interpretation for potential action, knowledge resides in the user's subjective context of action based on that information. Hence, it may not be incorrect to suggest that knowledge resides in the user and not in the collection of information, a point made two decades ago by West Churchman, the leading information systems philosopher." Note that, as characterized above, information cannot be generated by a computer. Computers can only reproduce its representation under the form of data, eventually with some change in format or some purely syntactic treatment. Computers may generate data, for example calculating the average temperature of various cities. Furthermore, I have associated "action" to competence, and not to knowledge. Malhorta also says: "Karl Erik Sveiby, the author of The New Organization Wealth: Managing and Measuring KnowledgeBased Assets (Berret Koehler, 1997), contends that the confusion between knowledge and information has caused managers to sink billions of dollars in information technology ventures that have yielded marginal results. Sveiby asserts that business managers need to realize that unlike information, knowledge is embedded in people, and knowledge creation occurs in the process of social interaction. On a similar note, Ikujiro Nonaka, the first Xerox distinguished professor of knowledge at University of California at Berkeley, has emphasized that only human beings can take the central role in knowledge creation. Nonaka argues that computers are merely tools, however great their information-processing capabilities may be." I consider the confusion between information and competence much worse than between information and knowledge. Competence should be faced with much more subjectivity, and should be connected to some physical accomplishment. According to the characterization of knowledge, an individual may acquire it without social interaction. For instance, someone may make an extensive visit to Paris alone, without speaking to the local people. Well, Paris is also a result of social interactions, but the lonely visit could be made to a lake or mountain. Nonaka seems to imply that knowledge can be described. I do not agree with this. Finally, as we have seen, there is no "information processing", just "data processing" done by computers. For example, formatting information in a computer consists, in reality, in formatting the data which represents that information. With yet higher emphasis, I am against using the expression "knowledge processing" or "knowledge database", as explained above. In their book on knowledge management, Davenport and Prusack's say [1998, p. 1]: "Knowledge is neither data nor information, though it is related to both, and the differences between these terms are often a matter of degree." I agree with the first statement. But in the characterizations given above, the three are absolutely different, and not just a matter of degree. They are also in agreement with Malhorta: "Confusion about what data, information and knowledge are -- how they differ, what those words mean has resulted in enormous expenditures on technology initiatives that rarely deliver what the firms spending the money needed or thought they were getting. ... Organizational success and failure can often depend on knowing which of them you need, which you have, and what you can and can't do with each." [p. 1.] I have tried to establish essential differences; I hope they help to end the present confusion -- and unnecessary expenditures. They characterize "data" as "a set of discrete, objective facts about events" [p. 2]. I agree that data is discrete and objective, but of particular facts: symbolic representations. I do not agree with the events: data may be generated by computers. For example, it may be the outcome of some calculations having nothing to do with facts of the real world (the events). They state that "data by itself has little relevance and purpose." [p. 2.] I consider data by itself as being just symbolic representations, having absolutely no relevance and purpose; only when it is used not as data, but as representation of information, relevance and purpose are given to it -- but then it is not data anymore, it has been incorporated and interpreted by someone. They also state that "... there is no inherent meaning in data. Data describes only a part of what happened." [p. 3.] Yes, there is no meaning in data, it is just a syntactical description, but per se it has no connection to what it describes. A human must establish this connection. Furthermore, I would not say that "data describes" something. It may be the representation of some information, but may also be pure garbage, without possibility of extracting any information from it. For example, the table with city names and temperatures in Chinese (see section 2) is pure garbage for someone that does not read or does not understand that language. Two interesting statements: "Data is important to organizations -- largely, of course, because it is essential raw material for the creation of information." [p. 3.] "Firms sometimes pile up data because it is factual and therefore creates an illusion of scientific accuracy" [idem]. I mentioned data's objectivity; furthermore, it can always be expressed mathematically, hence the mentioned illusion. In their section on information, they describe it as "... a message, usually in the form of a document or an audible or visible communication. As with any message it has a sender and a receiver. Information is meant to change the way the receiver perceives something ... The word 'inform' originally meant 'to give shape to' and information is meant to shape the person who gets it, to make some difference in his outlook or insight." [p. 3.] The characterization given above is more general: it did not imply that information is meant by its originator to be transmitted to someone else. Furthermore, as in the example of putting the arm outside a window to assess the warmth, information may not be received through a message. But I appreciate the standard concept of information as a message (as long as it makes sense to a human receiver), because it covers most of the purposes for creating some data that is received as information. An important point here is "Strictly speaking, then, it follows that the receiver, not the sender, decides whether the message he gets is really information -- that is, if it truly informs him." [p. 3.] Later on, one reads "Unlike data, information has meaning -- the 'relevance and purpose' ... Not only does it potentially shape the receiver, it has shape: it is organized to some purpose. Data becomes information when its creator adds meaning." [p. 4.] The problem of the creator notwithstanding (who associates meaning is mainly the receptor, and the data "creator" could be a computer), it is nice to see that my ideas, developed independently, are in full agreement with some of theirs. Their last phrase in this section is worth mentioning. "The corollary for today's managers is that having more information technology will not necessarily improve the state of information" [p. 5.] This is obvious: the technology is a data technology, and not information technology or, in the best hypothesis, it is the technology of data representing information. As their book is on knowledge management, in their section on knowledge they provide a thorough characterization for it: "Knowledge is a fluid mix of framed experience, values, contextual information, and expert insight that provides a framework for evaluating and incorporating new experiences and information. It originates and is applied in the minds of knowers. In organizations, it often becomes embedded not only in documents or repositories but also in organizational routines, processes, practices and norms." [p. 5.] The characterization given in this paper restricts knowledge to a personal experience; it does not agree with the rest , which, by the way, is quite vague. In particular, routines and processes may not be in the minds of knowers, and written norms are just data. They may be read as information, but probably some of those norms are incomprehensible, and thus are just pure data. "While we find data in records or transactions, and information in messages, we obtain knowledge from individuals or group of knowers, or sometimes in organizational routines." [p. 6.] Yes, knowledge resides in individuals, but what they transmit and what may obtained from them is data representing some information (if the receiver understands it correctly), that is, messages, according to the authors, in general under the form of data. But the example of the shout (see section 3) also shows that information may be transmitted without being represented through data. It is interesting to observe that their valuable book does not mention competence. Sometimes they marginally touch "skills" [p. 11, 77, 97], but their main focus is on storing and transmitting knowledge (rather, their understanding of it), practices and technologies for knowledge management, etc. An interesting table summarizing various definitions and characterizations of data, information and knowledge may be found in [Stenmark 2002, chap. 3]. In his book, K. Devlin [1999] attempts "to develop a scientific understanding of information and knowledge." [p. 3.] "It is because it is built on a sound, theoretical investigation of information that this book differs from the majority of business books with the word 'information' or 'knowledge' in their titles." [p. 5.] But, at the end of the book, he says that his "theory", which he calls "situation theory" or, more precisely, his book is simply "at the very least, the beginnings of a science of information." It could not be more that that, because those two words depend on human factors, something he clearly recognizes in the latter case [idem]. Fundamentally, it seems that his "theory" covers specification of contexts ("situations"). In fact, in the sense given here, in order to have information it is necessary to have an understanding of the message or perceived phenomenon, as in the examples of feeling the cold air or shouting, cf. section 3. This involves context, given by the person who receives the message or has the perception. As I do, he also gives importance to a precise conceptualization of data, information and knowledge: "Understanding the subtle distinctions between the three concepts of data, information, and knowledge is essential..." [p. 14]. According to the concepts expounded here, the distinctions are not subtle, they are enormous; thus, maybe the concepts given here are clearer. "... as I have observed, the distinction between information and knowledge is not a clean one, and it in large part a matter of emphasis." [p. 151]. I consider my concepts quite "clean", mainly in practical areas like engineering or data processing (as it was shown in section 6, the situation is not so simple as far as purely intellectual areas are concerned). "Roughly speaking, data is what newspapers, reports, and 'computer information systems' provide us." [p. 14]. The definition given here (cf. section 2) is much more precise and generic. Then comes the following: "When people acquire data and fit it into an overall framework of previously acquired information, that data becomes information." [idem], which sounds quite circular. As a good mathematician, Devlin gives and "equation": "Information = Data + Meaning" [idem] and thus arrives to one of the forms of information given here. Recall that data are symbolic representations, and information can only be acquired from some data if meaning is associated to it by the receiver, but it may also be acquired without data. By the way, I don't like these representations through equations, as this would be a branch of Mathematics -- which it is not, by all means. How is it possible to add two different measures? (This is analogous to adding somebody's height to her weight.) Still worse, here we don't even have measures, although it is possible to express data through "bits" according to the definition given above. He does not define data and, as it was seen, "meaning" is something informal. Thus, this addition has nothing to do with additions in Mathematics, and the whole is not an equation, as he calls it. At this point, he reaches knowledge: "When a person internalizes information to the degree that he or she can make use of it, we call it knowledge. ... As an equation: Knowledge = Internalized information + Ability to utilize the information" [p. 15]. Here I diverge: in the expounded sense, it is possible to have knowledge without making use of it. Furthermore, it is possible to use theoretical information (internalized) to derive some other theoretical information. As characterized here, knowledge requires a personal experience, but the internalization mentioned by him could happen from theoretical data (as in the case of the guide book in section 3). But the disagreement is not total: "Knowledge exists in the individual minds of people." [idem], in spite of probable different conceptions of "mind" (to me, it is not physical). But, at the end of the book he says: "Knowledge is information possessed in a form that makes it available for immediate use to guide action." [p. 200]. If it exists in the minds of people, how is it possible to say that it has some "form"? I am not going to enter into the details of other formulations he gives to these concepts, because this would turn into a review of the book. It remains to briefly handle what he understands under "competence". He deals very little with it, and calls it "expertise" [p. 185]. "Competence" looks better, because at the lowest competence levels it is not possible to say that a professional is an expert. He introduces the competence grades mentioned in out section 5, characterizing each grade [p. 186]. The "novice" is a person characterized by following rules consciously and unquestioningly. The "advanced beginner" also follows rules, but "modifies some of the rules according to context". The "competent performer still follows rules but does so in a fairly fluid fashion -- at least when things proceed normally." "For much of the time the proficient performer does not select and follow rules." As it may be seen, these characterizations are not very objective and probably require that the degree of competence has to be determined by another professional. And what should be said about intellectual areas, as designing, computer programming, etc.? In these areas, the activity is always conscious. He makes the following association: "Stage 1 of expertise [novice] corresponds roughly to information so simply and directly linked to its representation that it could almost be classified as data. Stages 2 and 3 of expertise correspond more or less to the possession of information. Stages 4 and 5 correspond to knowledge." [p. 188]. These analogies do not seem to be very natural. From the points of view expounded above, it is like mixing up quite different things. Someone can do nothing just with data, because it has no meaning. With information it is possible to do something concrete, but only in the case it is related (or becomes related while executing an action) with some knowledge, otherwise it has nothing to do with the real world. In the characterizations given here, even in its more elementary level competence always involves some knowledge, because it deals with the real world. Devlin does not recognize that competence has to do with some ability (skill) over some knowledge area, as will be described in the next section. R. Lindgren expounds in his doctoral dissertation 6 papers he wrote on competence systems [2002]. It is interesting to observe here that he does not characterize competence: "The concepts of knowledge, expertise and competence are closely related and the boundaries between them are fuzzy." [p. 95.] I don't consider my characterizations above as being fuzzy (in fact, he wrote me that he found my characterizations quite interesting). He calls the attention to the fact that there is a clear distinction between competencies for job-based in contrast to knowledge-based enterprises. In the former, competencies are assigned to the required jobs, a situation best suited for mass production. Such companies follow fixed rules, laws and formulas. On the other hand, "The alternative to job-based organizing has been to design organizations in which the competence and knowledge of individuals are the primary focus." [p. 5.] Let us now shift to competence systems and then there will be more opportunity of citing his dissertation. 9. Competence matrices The example of a competent guide to Paris indicates that competence is an ability of producing something (acting as a guide) over a certain knowledge area (Paris). In this case, a person has to know Paris in order to be a competent guide to it, hence the knowledge involved. Someone is competent in a foreign language (the knowledge area), if s/he has the ability of reading it, or understanding the spoken language, or speaking it, or giving lectures in it, or doing written, simultaneous or consecutive translations from it, etc. Note that a person may have different competencies for every ability in each one of many different foreign languages. But for all foreign languages one may consider the same abilities. So, there is no sense in simply asking if a person is competent in, say, French: it is necessary to specify in which ability (or abilities) one wants to assess the competence, that is, what is (or are) the desired ability (abilities) in that knowledge area. This answers the question formulated in the beginning of this paper. Thus, one may construct for each professional a competence matrix, indicating in its rows the knowledge areas of interest, and in its columns the various abilities which apply to each area. Each cell contains a degree of competence, e.g. one of the 5 or 6 mentioned in section 5 or those that will be described in the sequel; the lack of competence may be indicated by an empty cell. I have not found this matrix concept in the literature. A professional may not be competent in a certain ability for a certain knowledge area (that is, he has not produced anything using that ability in that area), but may have knowledge (personal experience) thereof. I indicate this fact by assigning a degree of knowledge to the correspondent cell in his matrix. The same with information if there is no knowledge, in the senses expounded before. Thus, the competence matrices may be used for representing also knowledge (requiring some practical experiences, such as having done some exercises, having accompanied some project, etc.) and information (representing a mere theoretical knowledge, obtained through reading, studying, taking courses without practical exercises, etc.) To simplify the matrix, representing some degree of competence in a cell overrides the representation of some degree of knowledge, which in turn overrides the representation of information. This has worked well in the various areas of data processing; interviewed professionals were quite satisfied with this simplification. A further simplification was introduced at the PROMON competence management system (see section 1), shrinking the knowledge and information degrees from 5 to 3 values (none, weak/reasonable and good/excellent). In the PRODESP system the simplification was even greater: only two degrees were used for competence ("basic" and "advanced"), and only one each for knowledge and information (corresponding to having it or not), besides "none" (empty cell), valid for the three. For example, if someone has just taken a theoretical course or has read some manuals on a certain area a degree of information is inserted into the correspondent cell. If the same person has done some practical exercises or has carefully examined some products developed using that information, then it is classified as knowledge. Only if the professional has already produced some product in that area or has worked in it for some time a degree of competence is inserted. In the engineering and data processing fields, many products and systems are produced through projects. In these cases, the following typical abilities were represented, corresponding to project phases: 1. Analysis (of requirements and objectives); 2. Design (planning and product modeling); 3. Construction (programming, assembling the system or product); 4. Implementation (testing, user training); 5. Support (maintenance, help desk). At PROMON, the first two items were combined in just one, because I noticed that every professional that had one of them had the other too. The PROMON system had only one matrix, for I.T. In the PRODESP system, the number of matrices is variable. I introduced the following: 1. I.T.; 2. Systems (where each knowledge area is a system developed by the company); 3. Administrative areas (representing the competence outside of I.T., as for instance in legal procedures, management of human resources, etc.); 4. Foreign languages and 5) Academic studies. Abilities and competence degrees vary with each matrix. For example, in the academic studies there are only two columns for abilities: highest degree attained (complete or incomplete), and number of years of experience the professional has worked in each area which s/he had some academic study. The PROMON system was implemented as a prototype, using electronic spreadsheets. Thus, the competence matrix of a professional is simply a spreadsheet. Some database structuring was employed, such as coding the knowledge areas. On the other hand, at PRODESP a database management system was used, permitting the generalization of all data structures. Each matrix is displayed as a column in which knowledge areas are structured in the form of a two-level tree in the MS Windows standard, that is, with possibility of expanding or contracting the first level. Abilities for a selected area appear in a second column, as a pop-up window, also in two levels; the degree of competence appears at the side of each displayed ability. While exhibiting a matrix for a certain professional, the selection of an area with the "mouse" produces automatically the display of the degrees of competence assigned to that professional in the abilities valid for that matrix. 10. Selecting professionals For the selection of professionals who satisfy a certain combination of competencies, both PROMON and PRODESP systems employ the same matrix representation used for assigning competencies to professionals. In the former, cells are filled with the minimal desired competencies. When various abilities are desired for the same area or different areas, the logical AND connective is assumed. On the other hand, in the PRODESP system it is possible to specify if in the comparison with competencies of professionals which of the comparison operators <=, = or >= is going to be used. This indicates that the professional must have a "less than or equal to" (to see which professional doesn't have the minimum required competence, for instance when selecting candidates for a training program), "equal" or "greater than or equal" competence than that indicated at the selection window. Each line of this condition may have: 1) Just a knowledge area (indicating that any professional with any non-empty cell in any ability for that area is selected); 2) An area and an ability (any professional with any degree assigned in that ability for that area); 3) An area, an ability and a degree of competence (the overall comparison operator is used). A line with a selection condition may be combined with the next one through a logical AND or OR connective. In the latter case, it is possible to specify alternatives like "competence in installing UNIX or LINUX ". Besides the selection condition, it is possible to specify an obsolescence factor, giving the year in which the professional has worked in a knowledge area for the last time, as for instance "Select the professionals who have worked with Delphi at least up to year 2001." When using competence matrices and vectors for the assignment of professionals to projects and positions, one should take into account the observation that was made on objective and subjective evaluations (see end of section 5). During the specification of a selection condition the system assembles an SQL query to the database, using an algorithm which was designed for this purpose. The human resource management system PeopleSoft (HRMS module), which does not have the matrix concept (curiously, except for foreign languages, where various fixed abilities are considered), permits specifying selections with a "degree of importance" (from a fixed number of them) for each term of the selection condition. As there is an internal quantification associated with each degree of competence, a linear combination of degrees of competence multiplied by the degree of importance produces a numerical "global competence" for each selected professional. The system displays the professionals in the order of the global competence. The assignment of those weights is a delicate question. The system should permit their variation, making it possible to simulate various combinations; this is not the case with the PeopleSoft system. Lindgren [2002 pp. 43, 207] describes vaguely other competence systems: Compass (an in-house developed system at Frontec, Sweden), the Competence System (also developed in-house by the Guide company in Oslo), and the commercial products Prohunt (Palmér System AB in Sweden), TP/HR (Tieto Datema AB in Sweden), and a module of SAP/R3 which does not sell separately from the entire ERP system. He mentions just a few details; one may just guess that none of these systems use the matrix representation approach. The Compass system uses only free text, and no "formalized competence" (the competence degree above) [p. 209]. He mentions that users wanted both a formalized and a feature to insert free text to accompany the competencies. My system developed at PRODESP permitted only a free text for each knowledge area of each employee. Another very important point desired by various large companies that implemented the systems was the lack of an observation by the employee referring to his present interest in exercising each of his competencies [p. 49]. For instance, I presume that it is very common that a programmer with a large experience in a certain programming language (a typical example could be COBOL) has lost his interest in developing programs for that language (for instance, after having used 4th-generation languages for databases, application generators or object-oriented languages). Another desired feature was the availability of each employee -there is not much use in selecting people with a high degree of competence if they have no time to participate in a new project [p. 210]. Still another desired feature is knowing in which job or project each employee is working at the time. Finally, it is interesting to register the aspirations and ambitions of each employee [p. 213]. 11. Applications The application which generated the study presented here and its first implementation (the PROMON system) had as a goal the selection of professionals for organizing competence centers (see section 12) and the selection of project teams. At first glance, it may seem natural to formulate a single selection condition for the whole team of a project. Obviously, this is not the case: project managers think on terms of professional profiles necessary to form sub-teams. If there are no available professionals with a required competence, the fact that someone was able to transform, in another project, his information or knowledge into competence is a strong indication that s/he will repeat the feat. For this, it would be necessary to store a history of competence changes for every professional. Besides being useful for selecting professionals, competence matrices serve to count how many professionals have at least a certain degree of required competence in each area/ability. This resulting matrix was called general competence count. With it, one has the profile of the enterprise in terms of competencies. It is then possible to detect cells (indicating abilities within knowledge areas) where there are too few professionals with the required competence. To capitalize the existing professionals in the enterprise, for future fill-up of the required cells, the characterizations expounded above may be utilized. Thus, a professional who has a good knowledge does not require further training; what is needed in this case is participating in project teams to acquire the desired competence level in some area/ability. The lack of information indicates the need for some training, taking courses, studying manuals, etc. If a professional has the information, it may be useful to allocate him/her to a course with practical sections, or even to take part in a team to acquire some knowledge and eventually the required competence. The system may be expanded to represent core competence matrices. Each non-empty cell indicates the fact that the company finds it essential not to hire external services for the correspondent competencies, or not to acquire finished external products. Comparing with the general competence count matrix, it is possible to detect which areas/abilities are lacking in the enterprise or require upgrading. Competence management systems as described here may be useful for a call center, when looking for professionals to answer customer questions, to select candidates within the company to participate in disputations for vacant positions, professionals that may give press interviews on some of the company's projects or products, etc. Another application could be the allocation of teachers in a state or city educational system. For example, the State of São Paulo Education Secretary hires hundreds of thousands of teachers. The system could allow for the creation of a matrix with cities, locations and neighborhoods, so that it could indicate teachers according not just to their pedagogical competencies, but also according to preferences of geographical region. Finally, a similar system could be useful for collecting data on candidates for enterprise openings, because the matrices of a professional may be considered as curriculum systematization. A specific system for openings is the excellent Selector, developed by PCA Software Engineering, in São Paulo (see www.selector.com.br, in Portuguese). It is based upon "curriculum cards". There is a card with basic information, permitting each enterprise the addition of its own cards. Any person may introduce his/her professional data through the Internet, filling up the basic card, adding up further data in specific cards corresponding to enterprise openings, introduced by the latter. In some sense, PCA has created a language for the definition of curriculum cards. The system has an interesting possibility of associating weights to each required competence, thus permitting the ordering of the selected candidates. This method permits knowing which candidates fulfill all the minimum requirements, because they receive the maximum weight sum. Many great enterprises are using the system; through the Internet it is possible to gain a good idea of its principles, implementation and usefulness. Up to the time this paper was written, I felt the absence in that system of the concepts formulated here for information, knowledge and competence and the characterization of the latter as the confluence of an ability over a knowledge area, with the corresponding matrix representation. 12. Implementations At PROMON, the prototype of the competence management system was implemented with José Márcio Illoz, using a spreadsheet program. A "standard" matrix was implemented, with the knowledge area names in its rows and abilities in the columns. Areas were codified, with their codes being used for making the logical link with a spreadsheet containing the consolidation of competencies of all professionals. This consolidated matrix is used for selecting and counting of professionals. A very important work connected to competence assessment is the establishment of knowledge areas. In the case of PROMON, 160 different areas were collected for I.T., divided in 3 hierarchical levels, which were called "great areas", "areas" and "sub-areas". Unfortunately in the I.T. field it is necessary to enter into too much detail. For example, a professional that has competence on the installation of LINUX may not have it for MS NT. The PRODESP system was programmed by Mateus Saldanha using Delphi and Oracle. The use of a network database management system permitted the generalization of the system for any number of matrices, and a variable number of abilities per matrix (in the PROMON case, there was just a single matrix with a fixed, pre-determined number of abilities, suited to the fixed format imposed by the electronic spreadsheet software). The variable number of abilities per matrix permitted increasing the number of abilities in relation to the PROMON system for the I.T. matrix. In the PRODESP system, there are 3 groups of abilities: infrastructure (hardware and software), system development, plus acting as instructor in the knowledge area (an ability present in every matrix), in a total of 8 second-level abilities. This way, area duplication was eliminated. For example, in the PROMON system there was a UNIX entry in the section for infrastructure, that is, installation of this operating system, and the same area for development, that is, using UNIX for developing systems. One of the consequences was the reduction of hierarchical levels of knowledge areas to just 2, simplifying the system. Finally in the PRODESP system, an obsolescence factor (see section 10), and an access security system were also introduced. Security in the PRODESP system was made in 4 access levels. The first corresponds to the "generic user", and is opened to any person having access to the network. Such a user may only use the screen for specifying conditions for the selection of professionals. S/he obtains only the name and basic personal data of those professionals who satisfy the given selection condition. The "personal user" has his/her data stored into the system, with a password. Besides selections, such a person may access his/her own personal data and competence matrices, and may change them. The "supervisor" may have all the access of a personal user, plus being able to read the competence matrices of his subordinates. Finally, the "system administrator" has unrestricted access to any data. A delicate question concerns filling up the matrices. At PROMON, I did the interviews and assigned the competence degrees together with each professional. At PRODESP this would be unfeasible, because there were too many professionals (about a thousand, just for I.T.). The solution was to give a lecture on the system concepts, and letting every professional fill up his/her own matrix. Later on, the supervisors or project leaders would to revise the data and try to uniform it. Unfortunately, due to internal problems both projects were stopped short of being used to select professionals. 13. Competence Centers An enterprise dedicated to projects may be organized into "competence centers" (CCs). This means that professionals are grouped not into business departments or divisions, but into groups of associated knowledge areas. In this organization, business departments are reduced. Their goal is now to develop new projects for the enterprise or its clients. The project designs are the responsibility of business departments, which request from the Project Management Center one or more managers for a project, and from each CC the technicians necessary to develop it. Obviously, clear characterizations of information, knowledge and competence held by professionals and surveying them are necessary for the characterization, organization and functioning of a CC. The reason for organizing CCs is clear: enterprises want to optimize the allocation of human resources, diminishing personnel idle time and choosing for each project or function the right people with the necessary competence. Moreover, such an organization provides for a much greater flexibility and operational dynamics, making it surely more suitable to our hectic, fast changing times. Thus, the organization of CCs seems better suited for knowledge-based enterprises than for job-based ones. The advantages are clear. But, what about the disadvantages? I fear that CCs may disrupt social integration and a sense of identity to the company. Professionals may identify themselves with the project they are involved with, but projects are not so stable and durable as departments and divisions are. When a project was the initiative and accomplishment of a department, finishing it would mean remaining in the same department and embarking on another project with some of the same colleagues and in the same administrative environment. Coming from a CC, after the end of a project the professional will return to that center, meeting people that participated in other projects. It is being argued in the company for which I was studying the question of the IT CC that professionals will develop an identity to their CCs. They will be able to interact much more with their peers, who were scattered along many departments in the classical organizational models and had almost no contact with each other. This should also provide for the exchange of information and knowledge, helping the development of competencies through joint work. I hope this view is correct, and that my fears do not materialize. 14. Conclusions In this work original characterizations for information, knowledge and competence were presented. In the dozes of interviews I made to assess competencies of professionals in the I.T. area, those concepts showed to be extremely useful. Interviewed people got rapidly used to them, when classifying their degrees of information, knowledge and competence. Another contribution was the characterization of competence as referring to a certain ability over a certain knowledge area. This way, it was possible to introduce more clarity into those concepts, and represent competencies as bi-dimensional matrices, grouping together knowledge areas in different matrices according to the set of abilities which apply to different areas. These matrices represent, in essence, a systematization of curricula in terms of competencies, knowledge and information possessed by professionals. Traditionally, curricula used for selecting candidates for available positions in the formation of project teams consist of non-systematic, free texts. Even when well organized in clear sections, these texts cannot be subjected to data processing, as opposed to the method seen here. It differs from other competence management systems by using matrices and taking into account different degrees of information, knowledge and competence. It is important to emphasize that the computer only indicates which professionals qualify to the required competencies. After this indication, one should proceed to examining curricula, doing personal interviews, etc. This way one complements the data with a phase of subjective analysis, always necessary when dealing with human questions (see section 5); otherwise, people are handled as machines, leading in general to psychological problems, besides faulty selections. The practical results of competence assessments using this method at the large companies PROMON and PRODESP was very good. Professionals were thankful for the opportunity of having their curricula represented in a systematized way, and the possibility of continuously updating them. The desired features mentioned by Lindgren [2002] at the end of section 10 would had certainly enhanced the usability of the systems implemented at PROMON and PRODESP. My doubt here is how much data one should collect from each professional, so that inserting data and maintaining it does not turn into a too cumbersome task, as well as taking too much time. By the way, one critical aspect of competence systems is certainly the maintenance of the data. Probably, the system should verify periodically for how long each professional has not inserted any changes to the database, acknowledging him and his manager of a problem in this direction. Probably the best way to maintain the database with up-to-date data is managers keeping a constant eye on the introduction of new data by subordinates. There are 3 great problems for assessing competencies with this method. Firstly, there is a need of leveling the criteria for assigning the various competence degrees, otherwise it is not possible to compare professionals. This problem was solved at PROMON by concentrating all interviews in just one interviewer. But this is not feasible when having hundreds of professionals, because each interview takes at least 1 hour. Secondly, this method does not take into account the quality of projects and the work accomplished by assessed professionals. For this, it would be necessary to introduce one further factor, which should be assessed by managers and project leaders. But then an aspect of assessment by third parties would be introduced, with all the social problems involved. I avoided these problems by not considering that quality. A third problem, which was not faced, was the introduction of a behavioral matrix, with competencies on leadership, on team work, quality of written and oral communications, etc. Many authors give more importance to behavioral competencies than technical ones, as for example Daniel Goleman [1993]. But the assessment of these competencies introduces a delicate factor which should probably avoided in an initial phase of a competence management project: the need for an assessment done by the professional's manager. At PROMON the requirement was to assess just technical competencies, thus the problem was cut at its root. At PRODESP, I proposed to start without behavioral competencies, in order to avoid the problems originating from a professional being assessed by a third party, which could position them against the project. Maybe because in both cases I avoided such conflicts, the system was extremely well accepted by professionals in both companies. References Cusumano, M and R.W. Selby [1997]. How Microsoft Builds Software. Communications of the ACM. V. 20, No. 6, June 1997, pp. 53-61. Damasio, A. [1994]. Descartes'Error -- Emotion, Reason, and the Human Brain. New York: Putnam. Davenport, T.H. and L.Prusak [1998] Working Knowledge: How Organizations Manage What They Know. Boston: Harvard Business School Press. Devlin, K. [1999]. InfoSense: Turning Information into Knowledge. N.York: W.H.Freeman. Goleman, D. [1993] Emotional Intelligence. N.York: Brockman. Lindgren, R. [2002]. Competence Systems. Doctoral Dissertation. Göteborg: Viktoria Institute and Department of Infromatics, Göteborg university, Sweden. Malhorta, Y. [1998]. Tools@Work - Deciphering The Knowledge Management Hype. Journal of Quality & Participation, special issue on Learning and Information Management, V. 21, No. 4, July/Aug 1998, pp. 58-60. Available through http://www.brint.com/km/whatis.htm. MIT. I/T Competence Model. Available through http://web.mit.edu/is/competence. Talbott, S. (Ed.) [1998]. Netfuture #81. Dec. 10, 1998. Available through www.netfuture.org. Searle, J. [1991]. Minds, Brains & Science. New York: Penguin Books. Steiner, R. [1963]. The Philosophy of Spiritual Activity (transl. Hugo S. Bergman), GA (complete works) 4. West Nyack: Rudolf Steiner Publications. Stenmark, D. [2002]. "Information vs. Knowledge: The Role of Intranets in Knowledge Management". In Proceedings of HICSS-35, IEEE Press, Hawaii, January 7-10, 2002. Available at http://w3.informatik.gu.se/~dixi/km/.