Instructor (Mehran Sahami) : So Welcome Back! So Yet Another Fun-Filled, Exciting Day

Download as pdf or txt
Download as pdf or txt
You are on page 1of 17

http://technicalsupportindia.blogspot.

com/
Programming Methodology-Lecture24 Instructor (Mehran Sahami): So welcome back! So yet another fun-filled, exciting day of 26A. After the break, it almost feels like I came into the office this morning, not that I wasnt in the office during the entire break, but I came into the office this morning and it felt like a new quarter had started. And I was like, oh, its been a whole week. And Im sure for you, it feels like you just wish a new quarter was starting because we still have two weeks left. So a couple quick announcements before we get into things. One is there is one handout, which is your section handout for this week. And kind of one of the themes of this week is bigness. In some sense writing bigger programs, bigger data structures, thats the whole deal. And well kind of talk about that as we go along. Another quick announcement, just wondering how many people tried the Name Surfer demo online and had a problem with it? You folks, yeah, we updated it. So evidently there was some issue that only shows up on Windows XP with Java 1.6. And like, if you had a Mac you didnt see it, if you had Vista, presumably, you didnt see it, if you had Java 1.5 you didnt see it. But in that one case, it would come up, so the name suffer web applet demo was updated a few days ago, I think on Friday, maybe. So now it should work for everyone, hopefully. If you still have an issue, let me know. The only thing that youll see now, though, if youll try running this applet, is that the interactors, instead of it being on the south border of the screen are on the north border of the screen. That was just a little hackler we had to put in there to get things to work. The functionality is exactly the same. As a matter of fact, if you want, you can put your interactor and border on the north instead of the south. Itll make no difference to the rest of your program other than where you say south for adding your interactors, you say north, thats the only place it makes a difference. But you actually see that in the web applet version, the interactors are just in the north border instead of the south. Otherwise, it doesnt make any difference. But in case you saw that and was like freaked out, theres nothing to worry about. Okay. So also, I hope you had a good break. Just wondering, how many of you actually enjoyed their week break? Good time. And how many were working most of that break? Yeah. Good times. Hopefully, it didnt cause you too much pain, but if it did, hopefully, youre like, all caught up or ahead of the game in all your classes, now, so life is good. So I want to spend a little bit more time talking about today, well, actually a lot of time talking about today, is thinking about data structures, building large-scale data structures. And we begin to talk about it just a little bit before the break and its been a while so were going to review it a little bit and kind of build up even more. But one of the things we talked about, in the past, right, what a lot of our computers do is they manage data. They manage lots of data. And in fact, I would venture to guess that theres a whole bunch of applications out there that manage a whole bunch of data about you, but you may not have thought about all the data they actually manage. So some of the things that

http://technicalsupportindia.blogspot.com/
actually come up, for example, online stores, right. Anyone actually bought anything online, just wondering. Yeah. Theres a huge amount of datas that involved with that. Not only the particular transactions you make when you buy something, but keeping track of accurate transactions, figuring out things like people who buy product X also, tend to by produce Y. All of that is data management. And what makes those companies successful is they just do a very good job of managing their data. Okay. There are other things like, Im almost frightened to ask, but social networks, like, Facebook, or MySpace, or ORCHID, or Friendster, or Linkdin, or you could just keep going on. Anyone on a social network? Just wondering. Yeah. Thats good because your next assignment is going to be to implement one so you can see what its actually like. But that will be coming in a couple of days. And theyre not that hard, really. But what it is is a data management problem. Right? And it keeps track of things like who you are and information in your profile in the social network, and who your friends are, and all that happy news. Or you know, even things like a friend web search, right? Theres a huge amount of data you need to be able to keep track of to be able to web search, right? So all these things are all about managing data well and so part of this class, right, is youve got a whole bunch experience in terms of building up code, and different kinds of classes, and doing nice things with user interfaces, and the whole deal. And one of the things that we need to spend a little bit more time on is talking about how do you manage lots of data and then do something interesting with that. Okay. So heres some principles to think about, if we think about good software engineering, some of the principles of thinking about data kind of in the large. Okay. When you think about keeping track of lots of data, one of the things you want to think about is the information you want to keep track of, what are the nouns you want to keep track. And youre like, I dont want any nouns, what do you mean by the nouns? Lets say I was writing an application that was an online store, to keep track of, oh, lets say, music. And so one of the things I would want to think about is, where are the nouns that are associated with music? Youre like, okay, now youre really getting weird. No, its pretty simple. Things like a song, right, is a noun, thats associated with music, or an album, or an artist, right. And so what you want to think about is the things that are the nouns in the domain that youre dealing with oftentimes end up translating into what your classes are. So you may end up having a class that keeps track of information about a particular song or class that keeps track of information about a particular album. So the good linguists out there tell me we not only nouns but we also have verbs, not unless you happen to be talking to my son, who seems to only have nouns, but thats a different story. And he loves jarens by the way. But like, why are you telling me this? Just cause its fun, because I just spent a whole week dealing with it. In terms of verbs, these are oftentimes the methods that are associated with your classes, right. So when you want to do something, some noun takes some action, which is a verb, which is some class, has some method that operates on that class. So at an abstract level thats what you want to think about in terms of high-level principles of design. Now, there are some other

http://technicalsupportindia.blogspot.com/
sort of more concrete things that you might want to think about, things that have to do with what are the characteristics of the data you actually want to store so one thing that comes up, oftentimes, is thinking of the notion of having a unique identifier, identifier. What do I mean by unique identifier? All of you have unique identifiers, whether or not you like it or not, as a result of being at Stanford. Your Stanford University I.D. number is a unique identifier for you at Stanford. Every student has an I.D. number. Okay. So it identifies you and its unique. No two students share the same I.D. number. So you get issued this number when you show up here and you have it for life. When you leave its still with you. I know, I left, I came back, I have the same student I.D. number. It just exists and this uniquely identifies you. And in different cases you might want to think about what are unique identifiers. Right. So in some cases, for example, if you had a social network you might consider the names of people and not the social network to be identifiers, or say the names of their profiles, for example. In other cases, you might have something different. If youre managing a store you might have some I.D. number for books, an ISBN number, or if youre keeping track of music, you might say that the combination of the songs name and the band that plays it is a unique identifier for that song. In some cases the unique identifier can be a combination of things. But if you think about your data having a unique identifier that also gives you some insights about what kind of data structures you might want to use to keep track of certain things. On other unique identifier that some of youve already grapple with is saying Name Surfer. Right? If you think about the data in Name Surfer whats the unique identifier there? Student: Name. Instructor (Mehran Sahami): Name, right? Name is a unique identifier and for every name you have some list of values associated with it, which was the rank of that name over the last century in terms of how popular it was for names. But every name, well, I shouldnt say every name has some value associated with it, but every unique identifier in the system has some value associated with it and only one set of values. And so the important thing to keep track of there is when you actually are doing your Name Surfer assignment to fact of this thing is a unique identifier can potentially help you keep track of the data that youre using. And well sort of go into that as we go along in the class. Okay? So some other principles we can kind of think about in terms of designing data structure, in terms of actually doing the design, theres some questions you want to ask yourself. And the questions you want to ask yourself is, are you keeping track of some collection of objects? Right? So there comes some collection of objects through data that you want to have. And if you have a collection of objects, say in an online music store, you might have a collection of songs that you want to keep track of, this word should be a tip off too, that perhaps theres an interesting collection that exist in Java that would be a way of keeping track of that information. It may not be in Java if youre programming in some other language. But the fact that Java has something called a collection and the reason why they gave the name of collections to a certain group of stuff is because theyre used

http://technicalsupportindia.blogspot.com/
to keep track of a collection of objects. And the question that you ask yourself then is what collections do you actually want to use? Okay. So with that said, what we can do is spend a moment, and it will be a brief moment, revisiting the collection hierarchy. Right? Youve seen this picture before but Im just showing it to you again, because the last time you saw it was like two weeks ago, which is a lifetime in a quarter. Right? I think it too, its about a fifth of the quarter. Youre like, oh, what was I doing two weeks ago? Was the break out, was I learning Printland? No, no it wasnt that long ago. But what you were learning about, a little bit, was collections. And o there are some collections, for example, like an ArrayList that going all the way up the chain of the hierarchy is itself a collection. Or there are other things, for example, like a HashMap. And a HashMap, if you said hey, I have some HashMap, the set of keys in that HashMap ends up actually being a set, which happens to be a collection. Okay. And so what you want to think about, do I have different things that I can keep track of? Like an ArrayList is one way to keep track of things. A HashMap may be another way of keeping track of things. When is the appropriate time to use one thing versus another? And so when you want to think about the appropriate times of one versus the other, you want to think about what are the methods that a collection provides to you. And it turns out all collections that implement the collection interface, like the ArrayList or the key set of the HashMap, have all of these properties. And some of you have seen them before, but just to review. You can add a value, right? So this is a parameterized values type. Like you can have an ArrayList of strings and you can add some value to it, and it adds it to the collection, and little did you know, or maybe you did know, but at the time we didnt really care about it, was it returned a bullion. Most of the times we just returned the bullion, we didnt care about it. But actually returned true the collection changed. So in an ArrayList, it always returned true because when you were adding a value, it didnt care about duplicates, it would always just add them to the end and always return true. Some collections, like sets, actually dont allow you to have duplicates. So if you try to add something to a set that already has the value youre trying to add, it will not change the set and return false, because it says, hey, I already have that value and nothing changed. A couple other things that you should know about, most of these youve seen. Remove, removes the first instance of an element as it appears and returns true if a match is found or returns false if it didnt find anything to actually remove. And clear, basically, just sort of nukes the whole collection. It just says get rid of everything in the collection. Im done with that and the collection is dead. Actually, the collection is not dead to you, it still exists, its just an empty shell of what it was before. There are violins playing in the background. And then size, you can get the size of the collection. Youve seen this, youve probably used a lot of these before in your programs. Contains, thats an important one, right? You actually want to see if a collection contains some particular value, if a collection is empty. And heres one thats sort of interesting that we talked about a little bit but we didnt actually talk about the fact that a collection or all collections can give you one these. All collections can give you an iterator.

http://technicalsupportindia.blogspot.com/
So we talked about, for example, having an iterator over the key set of HashMap. Thats one thing we did before. We said we had some HashMap that lets a map from strings from some other strings. And we want, say, hey what I want to see is get a set of all the keys and I want to iterate all over those keys. Thats great! You can do that and thats perfectly fine. When we used ArrayList we always had like a four loop and said, oh, from zero up to the size of the ArrayList do something. But if we actually wanted to, we could have an iterator over the ArrayList and then this would give us the elements of that ArrayList one at a time. So because an ArrayList is a collection it can also give us an iterator. And thats just something to keep in mind is that theres common patterns that get used in programming. One of the common patterns that get used is known as an iteration pattern, which again, is an iterator over some collection and you just go through and do something like printout the values for every element of that collection. And if you want to write it in the most general case, you dont care if that collection happens to be the key set of the HashMap, or an ArrayList, or whatever, you just say, hey, youre a collection, give me an iterator and I can go through all your elements one at a time and, for example, print them out. Okay. So theres just simple patterns that we get into. Now, youre like, okay, Marilyn, thats fine, you told me some design principles over here, you told me about some collections over here. Show me something concrete, like put it all together. So lets actually put it all together. Okay. And well view a little example, which is going to an online music store. And because many names for online music stores are already taken, our music store is going to be called Flytunes cause theyre tunes that will fly. All right. Yeah, man, when youre like in your mid 30s you just cant be that cool. But trust me, it is. Okay? So were going to make a little store that just keeps track of music and albums, and that music and actually lets us keep track of information and prices. And so what we want to think about is what are the things that we actually are going to do in that store, okay? So one of the nouns of that store is going to be a song, okay? So a song is some basic thing that were going to sell. This is what we want to be able to do with the song. Now, you could say, well, what does that mean, do I have some method called sell? If were doing inventory management we might not actually have a method called selling a song, but we might, for example, want to add for inventory to do things like add songs. And similarly, songs, oftentimes, are put together into albums. Okay, so we may also want to keep track of albums and do things like add albums to our inventory. Now, the interesting thing with an online music store that differentiates it from say a physical music store, is you can do interesting things, right? You can actually have songs that are not on any albums. And that works, right? Its kind of like thinking of a single, right. When you go and buy a single somewhere. In the days of yore, you could actually buy a little record single that had two sides on it so you got two songs, so it wasnt really a notion of a real single, single. I guess now, there are like CD singles. But who wants a CD single when it comes down to it? You can get songs that are on albums. At the same time, you can have the same song be on multiple albums, right? That always happens. Theres a band, I wont mention their name, but I remember from the early 80s, they had two albums. They had their first album and they had their best of albums, which were half the songs from their first album. Just anything you can do to milk the consumer. But

http://technicalsupportindia.blogspot.com/
basically, what that meant was songs can show up on multiple albums. Okay, so we want to begin to think about how that might actually affect our design. Now, if we think about putting the information together, right, nouns become our classes. So if were going to have song as a noun, were probably going to have some class song thats going to keep track of all the information associated with a song. And so just for the sake of brevity, Ill tell you what informations going to be associated with songs that we care about in our store. Theres a notion of the name of the song, the band or artists that perform that song, and then a price, because were going to allow for songs to be sold individually, so individual songs, as opposed to whole albums, have prices. Okay. And you can think about these things and think about, oh, what data types do you want to have for them. Right, so what type data type makes sense for a name, for example, string type, or if you want to have a band name, this would probably be a string. Price is always an interesting one. You could sort of say, well, now, and theres multiple things I could have it be. I could have it be an [inaudible], for example, if I was going to have it be the number of cents. In the simplest case, Im just going to have it be a double. Even though we know theres no fractional money unless youre a banker, in which case there is fractional money. But we wont talk about that right now. It was just like Superman III. Anyone see that movie? No. Its not worth watching, trust me. But fractional money does exist outside of movies, bad movies in Hollywood. So thats the information we want to keep track of for a song and then we want to think about what are some of the things that we want to be able to do in relations to those songs. The other thing we also want to think about is our friend the unique identifier. Is there some unique identifier for a song? And this is one of those things you really need to think about the application that youre using, what assumptions you can make. We might like to say that the name is a unique identifier for a song, but unfortunately, there are many songs that have the same name. Okay. But I would venture to guess that the combination of the name and the band would perhaps be a unique identifier for a song. The only thing is we dont have one string that we keep track of that keeps name and band in it. So thats another thing that we need to think about, and well get into code when we get into code, that we need to think about the design of that particular object. The other thing we need to think about is what changes in an object during its lifetime and what doesnt change. Like, so if I have a song its name and the band that made it for a particular song, like, some band can go uncover the song they learn, but thats a different song, the name and the band name dont change for a song. But hey, it can go on sale and you know, I can jack up the price at the holidays and all that kind of stuff. So the price is something thats malleable. So another thing you think about in terms of the principles of design is, of the data that I have associated with a particular object, whats going to remain static when that objects created and whats going to be potentially changed? And thats what gives you some insight about whats some of the data, for example, that you only get from an object, whats some data that you can potentially set in the object, and if you think about what potentially uniquely identifies that object, what data do you actually need at the time that

http://technicalsupportindia.blogspot.com/
you construct the object, right. To say this object is actually some particular unique thing that I care about. Okay. So lets turn that into a little bit of code just to make it a little more concrete. So well get rid of our friend, Power Point, and we fire up our friend, Blitz. Ah, and look, a song, how convenient. So heres the information to keep track of a song. Its just a class called song. And what we want to do is keep track of song, the songs name, the band name, and the price. So when we create the song, one of the things we might do is say, hey, give me all that information to start with. Because if youre going to put some song in your store and youre going to sell it, it better have some song name and band name that I can use to refer to it by, because thats going to be its unique identifier and give me some initial starting price. Now, we might necessarily not require an initial starting price, because its something thats going to change during the duration of the program, and isnt in support of our unique identifier. But in this case, were just going to ask for an initial price. The thing we do care about, in terms of the malleability of whats actually in this data structure, is thinking about song name, band name, and price. So song name, we only have a getter for there. Theres no setter. Once the objects created, you cant change the band name for that song. You cant say, oh yeah, you know, that was In Your Eyes, by Peter Gabriel and now its going to be like, In Your Eyes, by Kanye West. Like, thats a different song. And I dont know if thats happened. Its probably not a good idea. But the song remains the same, if youre a Led Zeppelin fan, right? And the band name, actually, the band name is also going to remain the same for that particular song. But the price has both a getter and a setter. Right? Because its something thats malleable. After we create that song, yeah, we might change its price. And because we know that we provide both of those things in the definition of the class. Now, as we talked about, in days of yore, whenever you create any class it should also have a method called Two String. And Two String just returns a string representation of the data in that class. So this just prints out inside double quotes, which is why we have this backslash, quote, thats a single double quote character, the title of the song in double quotes by the band name, and then it says cost, and it has the price associater with the cost. So it just returns a string to baseline caps lets the data. And heres the private instance variables of that particular class. Right. Theres a title, a band, and a price for the title of the song, the band that made the song, and the price of the song. And thats all the information thats in there. But it captures and encapsulates the notion of having a song and what parts of the song are static or cant change, and what parts of the song are mutable or can change. Okay. So besides songs, we also have this thing called albums. Any question about the song portion? If youre sort of feeling good with song, nod your head. All right, good times. If youre not feeling good song, shake your head. If youre awake nod your head. Theres a few thats not nodding, but thats okay. Thats cool, too. So lets do the class for an album. So the class for an album is another thing we care about. And albums become a little more interesting because an album not only has a name, right, so this is going to be a name, and yeah, the name will probably be some string. And theres also a band, potentially, that produces the album. Now, the interesting thing is the band you might say, but Marilyn, isnt that redundant? Like, dont I have

http://technicalsupportindia.blogspot.com/
some album and its going to have a bunch of songs on it, and so I already have names for the band for those songs? So why do I need the name of the band for the album? Anyone know? Want to venture a guess? Anyone have an album thats like this, 80s compilation is the critical word? Right. You can have an album thats band isnt actually a real band name. Its band name could just be something like compilation. And its going to have a bunch of songs on it, each of one which has a distinct band. Okay. So thats perfectly fine. Theres no reason why an album, especially in the online world when you can sort of create mixes all the time, needs to have a single band. And so there wouldnt be a need for having bands associated with songs. We still need to have bands associated with the songs. And potentially, at a higher level, we might want to be able to say, is this whole album by one band, or one artist, or is it actually a compilation. Okay? Now, the interesting part though, is that an album not only has a band and name, but it has a list of songs. So how might we keep track of that list of songs? What would be a reasonable data structure we could use? Student: [Inaudible]. Instructor (Mehran Sahami): An Array, our friend an Array. Well, the only problem with an Array is, right, it needs to have some fixed size. Theres some albums out there that are very short, like In A Gadda Da Vida, Iron Butterfly, theres one song thats one side of the album, if you were back on the LP days, and what a fine album it is. And theres other albums that are just like, oh, look theres like 300 songs on here. Okay. So an Array with just a fixed size might potentially waste a lot of space. Whats the more malleable version we could use? Student: [Inaudible]. Instructor (Mehran Sahami): Oh, yeah. I love it when its just all around. All right. Student: [Inaudible]. Instructor (Mehran Sahami): [Inaudible] one, I think. Like that post Thanksgiving. Its like the tryptophan, still like working its way. Yeah. You know. albums, to begin with. How do we actually add some list of songs on it. We need to have a way to be able to add songs to this album, and once we actually add songs to a list of songs on the album, we need to have some way of being able to list the [inaudible], or perhaps, iterating over them. The only thing with an ArrayList is enter implements collection interface so that it actually provides you enter it. Okay. So lets look at the code for that, just real quickly and then things will become more interesting, afterwards. Okay.

http://technicalsupportindia.blogspot.com/
So heres an album. Inside an album we have an album name and a band, those are the things that are going to start off by constructing an album. So we say heres the initial album name and band, and what I want to do is build up the contents of that album. So it lets you get the album name and get the band name but you cant set them. Those things are fixed. Okay. The other thing that Im actually going to assume here, which is something I didnt assume for songs, is that the name of the album is a unique identifier for the album. Because if I can potentially have compilation albums thats a compilation of multiple bands, so the band name is just something like compilation or maybe the band name is empty string, the album name by itself should be a unique identifier. Now, you might say, but Marilyn, thats not true in the real world. I have multiple albums that have the same title on them. Were just going to assume that for the purposes of what were doing here, and itll be okay. How do we build up the album? We have a notion of adding a song to an album and getting an iterator over the songs on the album. And so the way we do that is were going to have something called songs. Let me show you songs down here. Songs is just an ArrayList of songs. Okay. And so if I want to add a song to the album, I pass it in an actual song object and it adds it to its ArrayList. And if I want to list out all these songs that are on the album, I ask for an iterator over all the songs on the album. So what I actually get is an iterator over song objects. Okay. Two stings just returns the title and the band, it doesnt actually list out all the songs. It just says, hey, its just this name of this title and this band, and thats all thats in an album. Okay. Again, we think about whats mutable and whats not mutable. Now, to put the whole store together, this is where things get a little more interesting. To put the whole store together, you need to think about whats the store going to do. So let me show you a simple store running and this is the basic text interface for a store. Its kind of like online store circuit of 1995. Okay. So I can list out all the songs, I can list out all the albums, I can add a song, I can add an album. When the store starts, I have not songs or albums in the store. I need to add them all. I can list all the songs on a particular album and I can also update the price for a song. Okay. So if I list out all the songs. It says all songs carried by the store and says nothing, because theres no songs that the store currently has. And list out all the albums carried by the store and list out nothing here, because therere no albums. But I can go ahead and do something like add a song. And lets say the song I want to add is In Your Eyes, Peter Gabriel. Any Peter Gabriel fans out there? No? A little bit? Come on. Oh, man. I give up. Its all over. I just dont believe it. All right. Well say the song is, I say, okay, itll be 99 cents. Go get it. All right. So we add a song and if we list all the songs, now we have heres the string representation of a song, In Your Eyes, by Peter Gabriel, cost 99 cents. We still have

http://technicalsupportindia.blogspot.com/
no albums, rights, we just have a particular song that we can potentially sell by itself and we dont have any albums. So well come back to this. But this is the basic idea. We want to be able to list all the songs and albums, add songs, add albums, and then list the information for a particular album. Okay. So if we think about that, what we need is a bigger data structure to keep track of all this information about multiple songs and multiple albums. Okay. Now, if we want to manage an inventor, the two things we have to keep in mind are also what I mentioned before. A song can exist in our data that is not on any particular album. So as a result its not sufficient to just say what albums are carried by the store, because some songs may not be on any album, but we still sell them individually. So we need to have some notion of keeping track of a list of songs. Now theres different things we could think of for a data structure to keep track of songs. One thing is an ArrayList, right. Thats what were using in albums to keep track of a whole list of songs. Another thing we could consider is a HashMap of songs. And so if we think about a map versus an ArrayList, what question that you want to think about gets back to this identifier question, right. Because if you want to have a map, say for example, some string to song, and you want this string to uniquely identify a song, this string needs to be something that is a unique identifier. But a song doesnt have one string thats a unique identifier, its unique identifiers a combination of a name and a band. And so all kinds of funky things that are things that people consider. Oh, how can I connect those two strings together? People actually do that in real applications. Were not going to do that here. Were just gonna say, theres too much complexity in dealing with this, were going to go for a much simpler approach and just say were going to have an ArrayList of all of our songs and not worry about the unique identifier issue. So here we have an ArrayList of type song, and well just call this songs, thats all the songs in our database. And so here we create a new ArrayList of song and we call it constructor. Okay. Now, life in the album worlds a little bit different. Besides just keeping track of a list of songs, we also need to keep track of albums. But in the album world the name is actually a unique identifier. And if we want to be able to look up albums quickly, it might make sense to use a HashMap. So part of doing this whole example is to actually show you both ArrayList and HashMap in one application. So what we could do is have a HashMap that maps from stings to albums where the map, this string, is in some sense the name of the album and this is the actual album object. And well call this albums and we can do all the new, you know, la de da HashMap we actually created. Okay. So now we have these two big data structures that actually keep track of stuff for us. Now, heres where things get a little bit funky. And when things get funky, what youre going to need when you deal with big data structures, you need a guide. And youll see this in just a second because youre going to see some of the code that we write gets very

http://technicalsupportindia.blogspot.com/
long when we deal with big data structures. So Ill be your guide. All right. So in the days of yore, I almost bought the whole outfit. But its a little hot in here, under the lights. So in order to actually think about how you get the information and store the information when you have a large data structure, paper and pencil is your friend. Right If you spend all your time just staring at a computer screen it doesnt really allow you to internalize what is your data structure really look like and whats going on. So break out some pencil and paper, not right now but when youre working on data structures, and draw out, potentially, what things look like. So heres songs and songs, and songs is an ArrayList. And its going to have multiple, lets say at this point, three songs in it. And over here we have albums and albums is a HashMap, albums, that maps from names of an album to a particular album object. Now, the important thing to keep in mind in objects, and this is kind of the whole key to big data structure, is all objects, when you refer to them in Java, are references to objects. Remember when we talked about that. When you pass an object to a particular method in some application, youre passing a reference to the object. Youre passing where that object lives. Okay. Which means that when you have an ArrayList of songs, which what you really have here are a bunch of references, which we can think of as pointers that refer to the actual objects that contain the songs. Okay. So over here theres a In Your Eyes, by Peter Gabriel and it was 99 cents. And over here we might have say, Ramble On, tell me theres some Zeppelin fans out there. All right, good, good. We will not have to end lecture early. And Ramble On is such a great song, its like $12.99 by itself, single son. Thats probably why most people dont listen to it. And over here we have the master, Stairway to Heaven, Stairway to H, well just abbreviate it. Because its that good, well just have a moment of silence, also, by Led Zeppelin, and well just say that one should be like, 49 cents so everyone can listen to it. Its just kind of like the bonus tune. All right. And so thats what we have in a list of songs. Now, heres the interesting part, right. If Im going to have some albums, so I add some albums. So lets say add some album on here like Soul, by Peter Gabriel, and Soul actually has the song, In Your Eyes on it. Okay. Now theres two things that come up we will need to think about when we actually do this. We need to say, hey, this has got some ArrayList associate in there, and so I can create a new object that is a song for In Your Eyes and set my ArrayList to be a reference to that object. And thats a reasonable thing to do in some cases. The only problem is what happens if I go into my store and say, hey, I want to change my song In Your Eyes from being 99 cents, because no ones heard of it before, to 9 cents. Okay. So if I go thought my list of songs I say, oh, here it is, Ill change its cost to be 9 cents. Now, unless I go through all of my albums and find for every album go though every song thats listed on the album and see if I can find that same song duplicated, Im going to create an inconsistency in my data. What I really want to have is say, hey, theres only one object that is that song. And if that song happens to be a song thats sold individually, or its a song thats both in my list of songs and on some albums, theres only one object ever that I refer to for that song, which means, I never create the second object out here

http://technicalsupportindia.blogspot.com/
for that same song. What I do, is when Im creating the album Soul, and someone tells me, oh, its got the song, In Your Eyes, on it, I say, hey does that already exist in my store. If it does exist in my store, Im going to add that object to my ArrayList. Im not going to create a new object, which means each song only ever gets created once, but it can potentially get added to multiple ArrayLists. And its the same single underlying object that has multiple references to it. Why is that cool? Thats cool because now, when I come along and a whole bunch of people start listening to In Your Eyes, and Im like, Peter Gabriel, he just deserves a lot more money, were going to make this $9.99. Its $9.99 everywhere by changing it once. And thats the real key to large-scale software engineering. You think about not only reusing you remember for a long time we talked about having methods that you reuse and how you generalize your methods, this is about reusing your data. Thinking about your data, sort of, if its only one thing, exists in one place ,and everything refers to it. Okay. So any questions about that idea? This is what we refer to as a shallow copy, because what youre getting, after youve created that song once, when you want to add that song somewhere else, youre just setting a reference to it, youre creating a shallow copy, theres only one copy. The thing we did before, where we actually created a whole separate structure, is referred to as a deep copy. And sometimes, deep copies make sense in some particular cases. Most of the time they actually, well, I wont say mot of the time, they dont, it depends on the application, but most of the time what youll actually be using is your friend, the shallow copy. Okay. So what does that actually look like if we try to turn that into some code? Well, what does that mean in the application? Let me show you what that means in the application. So were going to add some songs. Were going to go through another example. All right, let me add the song and Ill just abbreviate, In Your Eyes, Peter Gabriel, $1.99. Then Im going to add Ramble On, oops, Ramble On, Led Zeppelin, and well make that, oh, I dont know, $2.99. Okay. Now, at this point I have two songs. Now, Im going to add an album. So I add a particular album and the album Im going to add is Soul, by Peter Gabriel and it says enter a song name. Its going to have In Your Eyes on it. And it asks me because the unique identifier is both the song and the band name, it still needs to ask me for the band name, and the band name I give it is Peter Gabriel. And it says, hey, that song is already in the store. Its just letting you know, hey, I found that song in my store, so when I add it to the album, Im adding that same object thats also in my store to the album. And then you could say, well, theres other stuff on there like there happens to be a tune called, Red Rain, which is also by Peter Gabriel, and you know its a fine tune, but lets just say its 1 cent, okay. And it says new song to add to the store. What did it do here? What it did in this case, it says, hey, you want to have a new song called Red Rain, by Peter Gabriel. That song costs 1 cent, you want to add it to your album. Well, if you want to add it to your album, its also a song that Im going to see in the store. So it actually adds it to the store and adds it to the album. And theres still only one copy of that object ever. It just needs to make sure that when it creates a new song to add to an album thats not already in the store, it adds it to the store, as well as to the album.

http://technicalsupportindia.blogspot.com/
If the song already exists in the store then it just adds a reference to the album. Okay. Thats the critical idea here. All right. So now, if we sort of list Ill hit enter quit and if we list all the songs, right, the song Red Rain has now been added to the store and costs 1 cent. And if I list all the albums that are sold by Peter Gabriel, and if I list all the songs on that album, it has the songs In Your Eyes and Red Rain so it matches the picture that I think. Thats why having a piece of paper, where you draw pictures, is useful. Because you look at what youre application is doing and you say, does it match what I actually think should be happening in my picture. And if doesnt, then you know one of two things is wrong. Either your pictures wrong or your code thats supposed to be dealing with that picture is wrong. But in either case, youve already figured out a bug, even though the program hasnt crashed or anything, you just know theres an inconsistency. Okay. And so now, if I update the price for a song, like I update the song, In Your Eyes, by Peter Gabriel, and I change its price to, I just go crazy, no ones going to buy the song anymore, the price is updated. Now, if I list all the songs, that song is $999.99 in the store, and if I also list the songs on any album five, so lower case, the price is also updated on each of the individual albums, because theres only one object. Okay. Thats where the consistency comes in. Thats why the consistencys key. Okay. So what does this actually look like in code? How do we do this? Let me show you what the actual application looks like for our little friend, the Flytune Store. Okay. So theres a bunch of stuff at the beginning that just asks for the user selection, basically print some stuff out to allow you to make a selection, and then gets youre selection for you. And then theres a big case statement that calls an appropriate method, depending what selection you made. So Ill go though some of the simple ones pretty quickly. You can list out all the songs carried in the store. In order to be able to do that, we need to keep track of how this informations actually stored, its exactly in these data structures I just showed you. Song is kept track of in an ArrayList of songs and albums is kept track of in a HashMap that maps from the name of the album to the actual album data structure, itself. Okay. Any questions about that, hopefully, thats all clear. I will take off the hat. So how do we print these things out? To list all the songs, we just go through our ArrayList up to its size, and this is why you want to think of data structure as your needed guide, because youre going a journey. At any given point, when youre dealing with a data structure, you want to think, what is the type Ill dealing with right now? What does that mean? It means, when I want to print something out, what I need is a string that prints out. How do I get a string? If I started at songs, songs is an ArrayList. I dont have a string I can print out. But from an ArrayList I can get an individual element. When I get an individual element of that ArrayList, what do I have? I still dont have a string I have a song. What can I ask the song for? I can ask to get the string version of the song and I have a string to print out. Okay. So you always want to think of it as youre going on a journey. Where do you start your journey? Youre journey starts at the data structures you have available to you. In this case, we have a data structure called songs, another data structure called albums, thats

http://technicalsupportindia.blogspot.com/
whats available to us. And what we want to do is go from that starting point through a series of steps to get to the thing that we actually care about at the end, hat little piece of data that we want to display or interact with somehow. So heres another example. If I want to list all the albums, how do I list all the albums? Well, to list all the albums, albums is a HashSet. So in order to do something with a HashSet I need to say, hey, I want an iterator over all the keys of that HashSet. So albums is the HashSet, I get the keys of the HashSet, which is a collection, and I get an iterator for that collection, which is an iterator over all the keys of the HashSet. And now, as long as my album iterator, which is just my iterator over the keys, has an element, what do I do? I start at albums. I need say I need to get a particular album. Okay. Get. Which album am I going to get? Im going to get the album whose name is associated with the next elements of the iterator. Right, because its an iterator over all the names of albums. So get, gives me a particular album. Then, when I have the particular album, I can call two strings on it to get the string form of the album. Okay, any questions about that? Because theyre going to get even longer, so if there are any questions about sort of the chain of things we call. If its making sense, the chain of things we call, nod your head. All right, and if its not making sense, shake your head. And if its kind of making sense, just keep looking and ask a question if a question comes to mind. All right. So how do I find a particular song? This is something where Im going to use the helper method, so its private to find a particular song. Songs, our unique identifier, is a combination of both the band name or the name of the song and the band name. So how do I check for that? Im going to go through all my songs, its an ArrayList so I can count through all the songs. Heres where things get long. How do I check to see if a song, thats actually in my data set, matches on its name with the name thats passed in? I start at songs, get the I song, and I have one particular song. For that particular object I get the song and name. Now, I have a string. I want to check to see if that string is equals to the name thats passed in. Okay. And I do the same thing with band names. Song, get the I song, get the band name of that song, and then check to see if thats equal to the band. And if both of these are equal, then, hey, I found the song, and so Im going to return an index, which is the index location of that song in my ArrayList, and I can just break out of the four-loop, here. Because once I find it, I say, hey, I found that, I dont need to keep looking, so actually this is one of the rare cases where youll see a break in a four-loop, is you dont need to finish the loop. You got to what you were looking for and get out of the loop. If you manage to get through this whole loop without ever finding something that matches on both, the name and the band, well, your index remains negative one. So you return negative one to indicate, hey, I didnt find it, because you know negative ones not a valid index for an ArrayList. So if you return it that means you didnt find a valid element. Okay. How do we use find song? Heres how add song works. Okay. When you want to thing about add song, you want to think about this property that were only ever going to create an object once, and everything else is going to be references to that object. So the way add song is going to work is its going to return a song object. Okay. And what its going to do, is its going to ask us for the name of a song, if the user enters blank line that means they want to stop adding songs so it just returns null to say, hey, you want to stop adding songs, I didnt create a new song, heres a null to indicate you

http://technicalsupportindia.blogspot.com/
are done. But if they dont impress enter quick, I also ask for a band name, and then I ask to find the song. Okay. I call that find song method I just wrote and I say, does that song exist. If the song exists, the song index is not going to be minus one. And that means, that song already exists in the store. So you told me to add a song that already existed in the store. So Im not going to create a new song because its already an object in the store that encapsulates all the information for that song, I will return to you a reference to that object, which means I just returned from the songs ArrayList whatever song happens to be at the index that that song actually lives at. Okay. So this just returns an actual object. It actually returns a reference. If you can, think of it as returning a pointer to the object. If I didnt find it in there, then, hey, I need to create the new song, right. Its sort of like Red Rain at the end. You wanted to add a song. It didnt exist in the store, let me get the price for that song. Ill create a new song object and now. Heres the funky thing, I will add that song to my ArrayList of songs for the whole store, write out to that the new song was added to the store, and Ill return that new song to you so you can do whatever you want with it. And so now, you might ask, okay, Marilyn, if I just added a song to the store I dont really care about doing anything with that song, why are you returning the song to me? And thats true. If I just add a song to the store, if thats all I care about, I ignore the return value. Thats actually what I do up here, which is very funky. Right. If you want to add a song, I just call the add song method, it goes ahead and adds the song to the store, if it doesnt already exist, and it returns reference that song object. If all Im doing is adding a song, I dont care I just ignore it. I dont assign it to anything, I just say, yeah, thanks for returning that object, that was fun, whatever, and just get rid of it. Okay. But the reason why Ive written it this was is if Im adding an album, what do I do? I ask for the name of the album, and I check to see if that albums already in the store. If the albums already in the store Im not going to do anything because the albums already in the store. If the albums not already in the store, then I ask for the band name and I create a new album. And then I put that album in the store. So album is my HashMap. I put in that HashMap the name of the album is going to be the key and the actual album object is the object. So I add, you know, the album Soul to my HashMap. Now, Im going to add all the songs. So I have a Y-loop that goes through and keeps adding songs until I get a null from add song to indicate that the user wanted to stop adding songs. But heres the funky part, every time the user adds a song, right, it comes along and says, hey, you want to create some new album? So lets say I actually want to create some new album over here when I create the album Soul, so none of this stuff exists yet. Okay. So to create a new album, I say, hey, I want to create the album Soul. It says, okay, thats fine, create an object for the album Soul. It has the name Soul, its by Peter Gabriel. And it says, okay, what songs are on going to be in there? And it starts asking me for songs, because its going to add them to my ArrayList in here. And so the first song I say is In Your Eyes in on that album. It goes and says, hey, find that song, it already exists. It returns a reference to that song, as a pointer that reference is what gets added to my ArrayList. Now, I go and ask for another song. Do you have any more songs? I say, yeah, theres another song. The song is called Red Rain. When I go

http://technicalsupportindia.blogspot.com/
to create Red Rain it comes up here to add song, add song comes along, asks for the name and the band, it tries to find the song and says, hey, that song isnt already there, so Im going to create a new song. It creates a new song called Red Rain, by Peter Gabriel, has some price associate with it, and adds it to the list of songs for the store. And then it returns this object, which means it returns a reference to this object, and that reference to the object oops, sorry this got blocked all right, this is where it is creating a new song, and it adds the song to the store. Right, it adds it as a song, which is that ArrayList up there, and then it returns the object. So when it returns the object, I went too far, the add object does not know I add that song to the album. So this album, were going to add a song, and the song were going to add is that same object. Its Red Rain. Okay. So thats the important thing to keep in mind. That object we only created once, and we passed around references to it or we can return references to it, and assign them other places. And thats how you get consistency in a much bigger date structure. Now, there are a bunch of other things we could do in here. I wont go through all the excruciating details down here. But we can list the songs on an album, we can update the songs price. And by updating a songs price, all we do is we ask for the song and the band, we find the song in the data set if it existed. If it doesnt exist we just say, hey, its not in the store, and if it does exist then we read in its price, and then for the songs in the store, we find the song at that index and set its price. And we know that whatever other albums contain that particular song, if we happen to update the price over here to you know, $6.99, we only update it once and all the places that refer to it automatically will see the updated version, because they point to the same object. Okay. Any questions about that? So I know its a lot of complexity to kind of deal with a big data structure like this. But now its one of those things like, now youre old enough to kind of see the big honking data structure. Because in the real world, when people think of software engineering the large, these are the kinds of things they need to worry about and thats where the complexity comes in. Its keeping track of all your objects and thinking about what objects you actually need to design and build, in order to actually build an application thats kind of successful to keep track of and makes thing consistent with all the data you have. So any other questions? Uh huh. Student: [Inaudible]. Instructor (Mehran Sahami): Oh, can you use the mic, please? Student: Sorry. So in this application, do all the songs and albums, theyre also, I guess, singles, or cause the albums are never priced, right? Its just the individual songs Instructor (Mehran Sahami): Right. So the albums dont have a price. You could imagine the cost for an album is the total of all the songs on the album. Or you could actually do something funky. Like this is one of those places you can make a policy decision and say, an album is 90 percent of the cost of all the songs on it. And then all the individual song prices can change and any time you just say, whats the price of the

http://technicalsupportindia.blogspot.com/
album, total up all the prices of the songs, and take 90 percent of that. So it also allows for very dynamic album pricing. Student: Thank you. Instructor (Mehran Sahami): All righty. If theres any more questions, come on up. Otherwise Ill see you on Wednesday. [End of Audio] Duration: 49 minutes

You might also like