start recording hello everyone and welcome to handmade hero the show where we C to complete game live on stream uh we were talking in the pre-chat and I was just not I wasn't feeling like it was a good night for it I was feeling chatty uh so I asked if people wanted to do a chat because sometimes we do a chat once in a while uh instead of programming and they said yes and so I feel like um this is going to be a chat stream and we'll file it under handmade chat uh and we'll just see what people want to chat about we'll just see what they want to chat about and we can open up um our Milton as well let's see here uh oh did I kill it we're using a by the way we're using an experimental version of Milton here um it's got a brand new GPU renderer which technically is not supposed to work on AMD yet it's only on Nvidia uh and we are running it on an AMD card because I want to scare Sergio I guess um uh so yeah expect uh expect Milton problems we're not really in a we're not in in the correct um uh we're not where we should be yeah it's looking like it's it's having some trouble let's put it that way uh but I'm not quite sure why uh it's sort of working it's see now it's in there I don't know what I really should do at some point is switch this machine over to an Nvidia card probably or at least a more modern AMD card uh all right so maybe we won't do that maybe that's not what we'll do I take it all back um I don't know if we can launch the old one I may have to download a copy of the old one uh I'll just see if we can load the old one up here oops there we go uh so yeah we'll have Milton up so I can draw if I want to on the chat um let's see here I'll scroll through some of the cues let's see so I guess what I should say is Allan since you definitively voted yes on the chat did you have a topic you wanted me to chat about out I will ask first and then if you don't I will go through the questions um on the uh on the um stream and select some that I think might be good uh and we'll go from there for all right so let's see oh and Alan asked is Milton successfully maintaining your data between versions um I guess I don't know but it looks like it is um I mean I I definitely wrote that on the new on the new one and now I've olded it loaded it in the old one um but I haven't really tested it thoroughly if that makes sense Mr fourth dimension oh shoot that first question wasn't my questions I think a lot of people are interested in this U business and I am too so let's start there and see what comes up after okay um well so so the UB thing um is I think uh I guess I I don't really know when this term got coined um the UB part of it but obviously in C and C++ uh there's the concept of undefined Behavior right uh and UB just stands for undefined behavior for those of you who don't know what that is uh but what undefined Behavior means in the C and C++ spec is just that if you put in a line of code that does something that is classified in the specification as undefined Behavior right um like say dfference a null pointer or something like this then the idea is that the specification does not say what has to happen at that point it instead simply says that it's undefined Behavior which means that if you write it Anything could happen right um and just to give you uh an example of this from uh the talk in fact even that uh touched on this if you D reference a null pointer on some machines it will fault because it has virtual memory and memory protection and D referencing a null pointer will cause an exception like the kind that we see on handmade hero if we write to a null pointer accidentally that will happen that is not in the c or C++ specification right the c or C++ specifications do not say that dereferencing a null pointer causes a memory exception or a segfault the reason for that is there are other platforms that do support CN C++ compilation which do not not have memory protection and so on those platforms the reference gel pointer just works you can read and write from the zero address all day long it's a valid memory location right uh and so what undefined Behavior essentially is in the C++ specification uh is just a way for the spec to say we don't know what's going to happen when you do this because the results can vary per platform and we don't find that it's a idea to be overly prescriptive here because if we were we may rule out C's ability to run on certain platforms or we may make it so that the compiler the code the compiler has to generate in order to um uh in order to fulfill the behavior definition specified in the spec is uh prohibitively expensive it's too slow right because you could imagine in that example I gave with the null pointer you could imagine if you did want that to always be the behavior avior that a fault would happen right you could still Implement that on all platforms but what you would have to do is everywhere in C before the first time that any particular pointer state is written to or read from you would have to manually insert an if check to see if it was zero first and if it was you would then have to manually call an exception raising routine right um hopefully that makes some sense so obviously the spec writers don't want to say that they don't want to say writing to a NP pointer is a Segal because now all of a sudden these other platforms uh it creates a a prohibitive cost if you will um for for C to run on those platforms okay so that's what undefined Behavior sort of initially was in terms of like how I would have described it to you before sort of the nonsense of the p last 10 20 years right uh now what has started happening unfortunately is as compiler vendors have been starting to get more aggressive with optimizing compilers and that's not a bad thing right um we want them to be aggressive with optimizing compilers because we want them to produce code that runs quickly so we want to be we want to to be spending time trying to think of new ways to optimize code certainly uh what they have been doing is they've been introducing optimizations which use the fact that undefined behaviors are specified in the spec as being literally undefined uh to essentially optimize code to the point of being what I would call intuitively incorrect right um but technically correct uh now I'll just write those two words down because I'm making them up right they we don't have real words for these things I'm just saying how I would view them uh so there's what I would say is there's a concept of what's intuitively correct in a piece of code right and then there's a concept of what's technically correct right um and just to give you an example of of what I mean there right I might expect uh in my code uh for example that if I defined an INT right let's say I have an INT uh and I and the int is currently set uh to like 2 billion or whatever right so the int is at 2 billion very close to the wrapping range of an integer right because you know we've got 4 billion values that a 32-bit number can take on and let's say this is a 32-bit integer as it would be by default on say you know the platforms we are currently programming on um if I were to start at zero and go up to 2 billion well when I get to like two billion in change I'm going to wrap around to negative2 billion and come back through right on the CPU okay so that's what's going to happen on the CPU if I were to issue an ad however my understanding and I am not a spec person because I you know I hate the C++ and c-pcs with a passion uh because I never seem to agree with how they decide to do things so I don't spend a lot of time reading them but my understanding is that an integer when it overflows is not specified uh so basically it is undefined Behavior what happens if you took 2 billion as an integer and added another 2 billion to it right now it's completely rigorously defined what will happen if you do this with at x64 addition right if you do a 32-bit ad and another if you do a 32-bit ad of a 32-bit register and a another 32bit register both containing two billion it will wrap back around right we know what's going to happen there right um but as far as the spec is concerned the compiler is under no um uh is under no obligation to do that right it does not have to actually obey that uh rule because it is technically undefined what happens when an integer wraps around if you or I shouldn't say wraps around if an integer overflows its value so if an integer Can Only Hold between negative -2 billion and positive2 billion and you go over positive2 billion the compiler does not have to make it wrap around all right uh now the technically correct thing here right is anything in that case the intuitively correct thing would be for it to wrap around right that's what I expect that's what the programmer expects to happen why because I understand the CPU I know what the CPU does if you add two numbers I Su that if I have 2 billion plus 2 billion that it will wrap around like the CPU would normally do do cuz in my mind what's going to happen is compil is going to turn that into an ad however the technically correct thing to have happen as per the spec is anything the compiler wants to have happen right because it is undefined Behavior now uh start to get into the reason why this was coming up in the internet uh Chandler carth the guy at Google who does work on the optimizer for llvm uh I don't know if he leads the optimization team there so something like this I think it's his position uh he's he's one of the main guys let's put it that way he gave a talk uh that touches on this and I'll explain why he was giving that talk in a second um so to give you a little more background on it um but uh just to sort of talk about something that he said in that talk it's important to understand as well and I agree with him on this point certainly that anything here it's not like uh if you if youve don't really understand this issue or haven't really seen it before a lot of times people may joke about it or whatever and say that the anything like literally means anything like it could format your hard drive or whatever and yes that is technically true because it is undefined Behavior so technically the compiler could literally do anything if it wanted to but obviously that's never going to be true right there's never going to be unless the compiler vendors were particularly malicious they're never going to do anything where they intentionally introduce a bug into your program for no reason at all right you're not like out to get you or something like this although sometimes it may seem that way um so what does it mean to say that anything could happen why would something else happen right like why would something other than the intuitively correct thing happen the reason is because once optimizations are considered there are plenty of opportunities for the compiler to do something more efficient than what it could have done if it did the intuitively correct thing for example in this case where I added 2 billion to 2 billion it assumes that I'm using a 32-bit ad right because if I had two integers and I do a different kind of AD let's say for example I'm on a processor architecture where a 64-bit ad is inexpensive but a 32-bit ad is not I'm sorry a 64-bit ad is inexpensive and a 32-bit ad is expensive so doing a 64-bit ad is much cheaper because it's it doesn't 32-bit is like an afterthought for this processor it's used to handling 64-bit native right well in that case if I want to do computations with this thing I add 2 billion to 2 billion the compiler knows that if it calls the 32-bit ad it's going to be particularly slow right so it wants to do the 64-bit ad because the 64-bit Adder is much faster but when it calls a 64-bit Adder it's going to get four billion right which is not the right answer in this case and it needs to wrap the value around right it needs to produce the wrap or something right now if it's going to do a subsequent arithmetic operation on this thing where something else is going to happen to it where it matters whether it has wrapped around or not or whether that top piece exists at all right whether that additional uh greater than assigned integer could be part exists it's going to have to truncate the value here manually right it's going to have to like and out the top part or do something you know it's going to have to do some work uh additional work and so since this is undefined behavior in the spec the compiler is free to do the technically correct thing which may give a completely unintuitive result because the programmer was expecting it to wrap since he knows it's a 32-bit value and he thinks 32-bit Valu is wrap right but lo and behold it does not wrap and it gives some very different result because the processor actually did it in 64 bit right uh now this is not even hypothetical even uh back uh if we take a look at something that was before any of this sort of stuff started happening if you just take the example of the x87 uh FPU right before we had x64 architecture we had the x86 architecture and on the x86 architecture there was a floating Point Unit called the x87 right uh and of course they were on one chip after a while but I think originally it might have been a co-processor I'm not sure I I was an Amiga person at that point so I don't really know EX actly what happened before then by the time I got to um the PC they were on one chip I don't know if they were ever separate right uh but the xa6 architecture has the notion of a floating Point Unit in it and it had a floating Point Unit uh which had high Precision uh it had greater much greater than 32-bit uh i e floating Point okay 32bit I floating point is real weak sauce compared to the x 87's FPU the x87 FPU when you did operations on it unless you did some special mode things later on in time if I remember correctly it added the ability to to you know make the Precision less it would do things to 80 bits this is my recollection forgive me if I'm getting this wrong it's been a long time but my recollection is like it did it had 80 bit floating Point numbers that's just what it did uh and so you can imagine what would happen if somebody was expecting to use i e 32-bit floats so in there C code they wrote float X or whatever right and they did some operations that they thought would be 32-bit and then they compared the results of those operations to doing it on an an x87 CPU and then compared the results to doing it on some other CPU that only had 32-bit floating point and what they would find is that their results would be totally different on the x87 because there were so many more bits of precision you might get totally totally different results drastically different in the case of things like catastrophic cancellation or other sorts of floating Point operations where the extra bits of precision make a huge difference right and so again you can imagine here if you were a compiler vendor and someone had told you that the spec says you have to have 32-bit floating points instead of saying something like floating points are whatever they are right however many bits they are if as long as it's at least 32 or something you would have to insert all this extra code to constantly truncate these Val vales to make sure that they weren't doing higher Precision calculations than you were expecting right um so that's what what uh you know how I would sort of start to frame this problem in terms of of how to start thinking about the undefined Behavior issue and now I'll start talking about what kind of came up recently and sort of has been brewing over time but that's kind of the way I start thinking about it is there's an intuitively correct thing and a technically correct thing uh we might want to use uh the drop the correct here if you're uncomfortable but say there's an intuitive thing and then there's a the technically correct thing right um or spec correct let's say if you're uncomfortable with that uh just to use very generic terminology but point being there's your intuition of what should happen and then there's what's actually going to happen according to the spec uh and sometimes the spec just doesn't guarantee anything so the compiler can do whatever is efficient and that may have unint Ed consequences uh because again it's undefined Behavior you can't count on it all right so recently what's been happening is we've been getting a lot of pretty damaging versions of this thing right so compiler vendors are you know looking through the spec they're looking for optimization opportunities they see oh okay you know I've got this sort of wraparound thing that's happening here I don't have to guarantee that wraparound so hey guess what I can fold these operations in and make a much more efficient um execution uh a much more efficient set of instructions to do this operation but hey if the programmer was actually counting on wraparound you introduced a bug in their code right now it's not technically a bug as per the spec because the spec did not guarantee the programmer they could wrap the integer right but it is definitely intuitively a bug because the programmer was thinking they were using 32 bit integers and it turns out that they really kind of aren't because the C compiler isn't guaranteeing them that right okay so that is what's been happening a lot lately and one of the big sort of uh the reasons why this is kind of getting aggravating to a lot of people is this can introduce a lot of Fairly serious bugs this can introduce say very bad security bugs because somebody thinks they wrote code that if the wrapping works properly will never overflow a buffer but hey guess what the compiler compiled it so that it can actually far exceed the buffer when it does the offset because hey the spec didn't say that you had to guarantee the wp would happen right so you're starting to see more and more of these things where the compiler vendors are technically within their rights to do a particular optimization they do it it introduces fairly severe bugs uh in some form or another into well-known software packages those people get set obvious reasons uh and you get this like he said she said of you know okay well just why are you doing that you're breaking this code it's like well that's the spec that's our job is implement the spec you should have written code that conform to the spec you know your code is buggy technically and then they're like my code's not buggy you guys just don't understand the need what you're supposed to be doing here why are you even doing this no one cared about this appstation so you can see like there'd be a lot of acrimony there right you could understand the acrimony there uh and to give a personal example of this uh from quite a bit further back in my programming career when I wasn't even aware of this stuff it's the first time I really became aware of this I actually had a bug exactly this way um and I'll tell you what it was cuz I remember it uh in all the versions of Granny that I shipped uh there was a tree that would do some kind of tracking of pointers and I don't remember exactly what it did it for uh but it tracked basically some addresses of things and the reason it was doing that is because the address was just a convenient thing that I knew would be unique right because I knew that each of these data structures existed somewhere in memory and I know that no two of them could exist the same place in memory because hey that's how allocation works you can't store the same stuff in the same place and so what I would do inside this tree is I would just say oh well I've got two pointers you know a um pointer to Fu a and a pointer to Fu B right and I just was like oh well I just need to return the sort CR criteria for these two things so it's fine just do like you know um I've got an INT or something that I need to return so I'll just do like a minus b or something right uh so if a subtracted from B if it turns out that a produces a negative result uh that's what'll happen for my sort criteria if it returns a positive result that's what it is if it's zero then they're equal it's exactly what I want and I don't care about overflowing or wrapping or anything here because I just as long as it always produce is the same comparison between these two things in terms of positive negative or zero I can use a standard binary tree algorithm which is what I was using and it will all just work right okay problem is that technically according to the C++ specification if you have two nonc care Stars so if these are not 8 bit uh pointers to 8 bit values right so let's let say struct FU has you know an INT in it so it's it's four wide or something like this um then pointer arithmetic is only guaranteed to work if the two things came from the same allocation right so in other words if I have F star um array equals uh you know Malo uh I don't know uh 12 time size of Fu right if I do an allocation and I say that you know these things have to uh these things all came from here and then I want to say okay I've got my a is you know array + three and my B is array + 7 then technically this is required to work in the spec it is not undefined Behavior if I remember correctly it is is technically defined to be the distance between these two things in the array however if they came this way so allocated from two different pools now they no longer have to uh have any kind of this doesn't have to work at all so for example these could be two totally different pointers totally different this is still allowed to evaluate to zero right uh now you may say why on Earth uh would anyone ever want that to be uh allowed in the specification right uh and I would like you to push that thought on the stack for a minute because this I I guess this sort of example will be perfect for me to illustrate my opinion of this whole subject matter in a moment uh when we get to talking about how I feel about it right which I don't know you know since it is my stream presumably someone asks this question because they want to know what I think about it um so I'll tell you my opinion not that anyone else cares what that is uh certainly not the C++ committee um but so hold that thought for a minute but hopefully that's pretty clear uh and then just save your bewilderment of why that would be for later because there's a perfectly good explanation of why it would be and also I think a perfectly good explanation of why it's ridiculous so I'll give those two to you and you can kind of chew on them anyway um so now we get to the the latest sort of thing about this uh Chandler carth the aforementioned Google uh optimization guy in llvm uh gave a lecture where he was sort of talking about this because uh and this I don't know very much about so you know I don't know if James Whitman's on the stream I know he's a c CPP uh spec kind of person he might be able to tell you more about this this aspect of it um I don't follow CP P stuff anymore obviously um but I believe there is now push on the C++ standards committee uh called UB or unsigned Behavior kind of um uh they they have introduced this as kind of the the abbreviation for it there's kind of this push to start dealing with undefined Behavior a little bit more to start like nailing down uh you know uh how you sanitize undefined behavior from programs and I don't know I mean I'll be honest with you I don't really know I don't know why it became more of an issue maybe because people have been complaining about it and they want to see people take more action about this I don't really know uh so you'll have to ask somebody else about that um but he gave a lecture at cbon um about undefined Behavior Uh and he kind of uh gave you know examples of various undefined behaviors uh why you know the compiler why they implemented certain things compiler that you know why those un beh fine behaviors are happening uh and that sort of thing uh and he also gave you know the I think the part if you saw my [ __ ] about the part I was very uncomfortable about was for some reason he called out a lot of game developers people I respect and kind of the tone was very odd it was like he was sort of suggesting they were stupid or didn't know what they were talking about or were you know arguing about stupid things I don't know it was just it felt very dismissive and I didn't understand why he was doing that cuz like it was like Fabian you know who's I mean one of the most meticulous programmers I know so to suggest that he would be you know un aware of the important aspects of this issue was odd you know uh or John Blow who like obviously you know I mean he's writing his own compiler that's like better than C++ so why you wouldn't you know take what he says very serious I don't know so there was kind of like the it was a tonally weird and um that made me kind of uncomfortable uh to call them out you know it just felt really odd and dismissive um however Chandler later said on Twitter after I kind of posted I was cover about it he said on Twitter that wasn't his intention so I think that may have you know that that may well have been a case of of him you know trying to be more humorous about it than it came across or something for all I know you know so so I don't really know uh so maybe that part just was you can just forget about you know don't worry about that part of the talk maybe it didn't come across as he intended uh and it wasn't meant as sort of a a dismissal of some of these other points of view I'm not sure um but anyway so he gave uh his talk about that and uh you know I guess what I would say is the talk if anything I I think maybe the talk was meant to to persuade people like me um who you know maybe were uh not zealots about this issue or hadn't been involved in the discussion you know very uh strongly or something in the past would maybe to persuade them to uh not be so much against the way that the C++ people do it or I I I don't really know but you know again because I already know that the tone of the lecture I misinterpreted perhaps from what he had expected I couldn't really say what he was necessarily trying to accomplish with lecture because maybe I'm mising that too but it felt to me like something where he was kind of trying to put forward a model of undefined Behavior or a way of looking at the problem that is in line with the way the C++ spec and compiler vendors want to look at it as being a productive or useful way of looking at the problem and to not be um upset about it or something like this but I did not find it at all persuasive in that uh way um and uh I guess now I can pingpong back to my example and maybe give uh a clearer explanation of why I am not persuaded by that uh based B on how I like to program and what I expect my compiler to do and so on uh so let's ping pong back here and see if now I can sort of tie those things together so we have a case where we're trying to do a minus B and we would like the compiler to give us back a positive or A negative or a zero and we're expecting zero to mean that they're equivalent positive to mean sort One Direction negative to mean sort other direction all we care about is it always evaluates to the same thing and it only evaluates the zero if the pointers the same right that's what we wanted uh now the reason why you do not get this Behavior necessarily when you do Point arithmetic and the reason why it's undefined in the c-spec if they come from two different memory pools is because there's ways of doing more efficient determination of the arithmetic here in the case where you know that the uh pointers must be an even multiple of the struct size away from each other okay so if I know that this is four elements long and I know that A and B are of from the same pool I know that they must have the same alignment relative to each other so if this guy starts on a certain boundary I know that this guy will be on the same boundary mod four right you you you will never have them uh be separated by some other amount I can't say Make A and B Be seven bytes away from each other I could make them be eight or I can make them be four but never seven right and what that means is that remember c++'s specification of the distance between two things is technically uh going to be the distance between the two pointers divided by the size of the structure right now A an integer divide could be very expensive so if this was for example not a power of two let's say this was something like this all right so it's 12 bytes long well I don't want to issue an integer divide by 12 what I want to do instead is do some shorthand Shifty multiplication weirdness or something uh to do a divide by 12 more efficiently than issuing the integer divide and so that's exactly what they'll do and they'll use the compiler will use the fact that these uh things are unknown alignment relative to each other to make it happen right it can do more efficient um arithmetic that way and avoid the Divide all all together okay so you can see that sounds pretty reasonable right that's a very reasonable thing for a compiler to want to do and the reason that they are enabled to do that was specifically because this is not allowed in the spec it is undefined the compiler could emit working code for a subtraction between a and b or it can emit completely broken code for that and in this case what it decided to do was emit completely broken code for that because it made the code for the correct thing the correct bpec thing to be much more efficient right so hopefully you can see why that makes some sense and I certainly don't want to throw compiler uh writers under a bus for doing something like that because you can see why that would speed up a bunch of cases in your program which are based on doing exactly this right but now let me try to explain why I think this sort of thing is really ridiculous and why I don't like it and why I wish uh there was a different attitude taken and more specifically what that attitude actually is so the reason that I think this is ridiculous is because I think it comes at the problem from a fundamentally misguided perspective not necessarily from the perspective of the compiler writer but from the uh perspective of the specification authors right and here's why if I am going to Define subtraction between two pointers A and B I think if I am a a diligent spec writer then I have to choose to do one of two things either I have to specify an inefficient but complete option for doing so or I have to specify multiple uh efficient options that are complete this is the way I tend to look at this sort of problem right and what I mean by this is if I'm only going to specify one a minus B operation right which is sort of what we're saying because this is the only way I can take the difference between a and b right there are not multiple subtraction operators if if I have a pointer a and I have a pointer B and I say that a minus B is my grammar for subtracting these two pointers if I'm only specifying one it must be the one that always works if on the other hand I do not want to handcuff the compiler writers or the optimizers into a situation where they cannot write efficient code for things then that's fine but you then have to give me options for doing the correct thing right um or you could say multiple uh you know options that are complete some of which are efficient some of which aren't right and that would look like this um a minus B right yields you know is always correct um a you know uh minus SLB right is correct only in the same pool right that makes very good logical sense to me because there what you're doing is you're saying okay we know that this may be an operation where people need to uh write it in a way that could produce incorrect results in certain cases so what we're going to do is provide two different operators uh with that awareness and this way when the programmer knows that they are well within the specified usage of this uh faster version of subtract they can use it and when they are not they can use the regular subtract and ensure that there will be no gotes right in my mind there is no other sensible way of approaching this problem and the reason that I don't think there's any other sensible way of approaching this problem is because approaching it the other way requires every programmer that ever uses your language at any time to be completely and fully aware of every corner case in your entire specification and is that really realistic I would argue that there is no way that that will ever be realistic most CC programmers don't even know all of the things in the spec let alone their Corner cases I would say 100% of all C++ programmers don't know everything that's in the spec like there's probably one thing in the spec somewhere that you could pick for any C++ programmer and they wouldn't know it they would be wrong a little bit or they just didn't know it was there right that's just the nature of a 500 page document or whatever it is at this point nobody's going to remember every last little bit and maybe there is someone somewhere who just proves that but they're like you know the autistic programmer guy in the basement who just recites the spec well like you know going like this okay so from my perspective none of these things should have ever come up right there shouldn't be this idea that the specification can leave all these undefined things happening where the undefined thing can be very non-intuitive right you need to be able to say that here here is a very reliable intuitive way that this thing works and when we are going to depart from that in some way we must make it very clear to the programmer kind of like a buyin if you will that you are about to do something where you understand the cost of what you're doing uh in terms of these minutia right because no one's ever going to write this if they don't understand the difference between those two things and especially nowadays when computers are incredibly fast introducing a bug that could be very problematic that only happens on certain compilers with certain optimization settings because you take advantage of this automatically when someone just wrote this that's really really hurtful to programmer productivity confidence in your compiler ability for programmers to know that they've written things that are correct right and possibly the security of whole systems okay so in my mind 90% of the undefined Behavior things in C and C++ that I've seen fall into this Camp there are problems with the specification the specification should not say undefined Behavior instead what it should do is specify a reasonable intuitive behavior that can be implemented on most machines uh relatively well but perhaps more costly and then they should say if we have all these optimizations in mind for this thing there will be a faster way of doing it that you can opt into right that you can opt into and you can opt into this in various ways maybe one way of doing it is changing the type of your variables maybe you have an INT and then you have an INT opt or something that opts into a few things maybe it's an operator based thing I don't know obviously for different circumstances we could imagine different things being more efficient but the point is simply you should have to buy into these kinds of optimizations that are extremely non-intuitive uh based on what you might expect to happen in to a reasonal programmer the the principle of of least surprise if you will okay here's thing number two so let's talk about the other case that I brought up uh over here uh so the thing that I brought up over here uh was the wraparound integer case right uh so let me talk about that okay so fundamentally uh all of there's a whole family of things in this whole like you know uh wrapping uh sort of family of things or like you know uh uh casting you know whatever implicit conversion I don't know whatever you want to talk about here uh but but for example in the C++ spec you know int unsigned uh has welldefined wrapping wow writing well defined wrapping so you do know that if you have an unsigned integer it will wrap properly whereas int does not right so this may not wrap properly but this I think is guaranteed too that's my recollection okay so the interesting thing about these uh is this gets at another fundamental weakness in the c-spec and C++ spec in my opinion uh which is that types the generally the way that we talk about types in C doesn't correspond to what the processor does uh in a way that is useful to the programmer right and this comes from a fundamentally uh I think misapplication or mis a fundamentally uh uh um how should I put this it comes from a fundamentally differing worldview between the people who work on the specification from C++ and people like me uh now that doesn't mean that that I'm right and they're wrong uh but what it does mean is I don't want to use C and C++ right what it means is it's not a good language for me and I can say that pretty definitively right uh and here's why when you talk about things in these sorts of terms and you say that int unsigned has well-defined wrapping behavior and int does not just the fact that you have stated things in this way completely ignores the fact that programmers such as myself are not trying to write an abstract program like I am not trying to write a mathematical proof in a book I am not trying to write an abstract mathematical program that can be preserved for Generations on a single piece of paper for future people to look at and understand how it worked what I am trying to do is program a PlayStation 4 right or I'm trying to program a PC I'm trying to program an Xbox 360 right right that's what I actually need to have happen and uh this is sort of the essence of Mike Acton's cppcon talk for example where he talked about this very specifically to the C++ audience I don't know that anyone found it convincing but fundamentally speaking the kind of programming that I do during the day and that most people in the gamer during today is not about an abstract programming to a language it's about making something that runs on actual hardware and in a way that is specified very completely by that actual Hardware so when I type mm you know store PS I'm not trying to say some abstract thing about how I might want to write some values to memory at some point what I'm trying to say is like put a move APS in there right I'm trying to tell the processor what to do and that's how I generally view what I do during the day now you might say well why don't you program in Assembly Language then right uh and the answer is because well it's vastly more complicated to program an Assembly Language I would have to do things like register allocation I'd have to do things like uh think about how to like put things on the stock and where they should go right and I don't want to do those things I appreciate the fact that I have a higher level language to do those things for me but what I don't want my higher level language to to do at all is pretend that I'm on some kind of an IL defined system that doesn't really know whether it uses two's compliment or not or whether or not the size of a register is 64 bits I have no interest in that because I know exactly what I'm shipping on and what I want to be able to do is write code confidently that translates into well-defined behavior on the processor I am targeting right so I don't ever want you to say something like well-defined wrapping or non-w well-defined wrapping or honestly ever int or unsigned in because as far as I'm concerned these things are actually the same thing they're a 64-bit register that's going to get operated on and it doesn't matter whether it's an INT or an unsigned int they get the exact same instructions generated right for a huge number of the operations you might do on them there isn't like going to be two different types of add for example there's only going to be one and it doesn't matter whether it's an unsigned in or a signed in when it starts to matter which one is which is when I do something like you know uh comparison operator or something like this when I do something that actually takes the two's complement and has to do a different thing for the two's compliment version than the nonto compliment version so it's actually operator based signed in unsign is actually not about this value it's not storing anything different it's storing the exact same thing right the difference is just how I'm operating on it and you can see this brought into very sharp relief when we start switching to the mm uh sort of stuff where we just have like you know mm 128 or whatever it is right or sorry m128 you know these m128 types right uh well if you've ever programmed anything with simd even very basic stuff with simd what you realize is that if you start to do do anything besides floating point if you do anything with uh with non-f floating point you are quickly writing code that treats these registers as a whole host of different things right so the problem was kind of like simple back here and you could sort of say well there's just two ways to treat it oh it's an in or it's an unsigned in so we'll just have those two things or right but when you start writing this code you realize no no no no no registers and processors nowadays there can be like 50 different ways to think about them like oh this thing was like eight sliced signed values for this operation and then the operation immediately after it I was thinking of it as four slice 16s and blah blah blah right and so what becomes very very clear very very quickly is the view of the data in memory is entirely based on the operation that you're doing with it and this in signed unsigned nonsense is really just some convenience features that people thought were were cool back when there were very simplistic ways of looking at values and registers where hey if I just specify that these two types then the compiler can just always infer which of the operations I wanted to do in the case where the CPU does have differing operations for signed versus unsigned right and so I this is just a complete waste of time like the fact that anyone's even spending time looking at that just kind of underscores to me that I don't think about these the way way they do I would like a different language I would like it to be based on how processors work not on how language designers think about things or types what I'd rather have is a language that just lets me say here are a bunch of Primitives they're based on what processors can do right I can execute any of them that I want if I try to use one that doesn't isn't supported on a particular platform that's just an error so it's up to me to make different compilation units or versions of whatever to support the processors that I want and then I never want you to talk to me about spec Behavior I don't want you to specify anything the specification for the behavior should be whatever the processor does that's it right and furthermore if I want some extra Behavior I'll tell you what it is right if I don't want want you to care about values rapping I'll tell you right I'll make my own type and I'll mark it don't care if it wraps and then the optimizer knows it can do anything it wants and doesn't have to worry about rapping right I'll tell you but don't arbitrarily have a bunch of people in a room decide that ins don't need to be specified as too compliment and it doesn't matter how they wrap so under fine don't do that that doesn't help me at all that just hurts me right it slows me down it has bugs to my program and it doesn't get me any closer to what I need which is a more efficient version of Assembly Language for programming actual computers that's what C is supposed to be right and so I find this whole discussion to be incredibly misguided I don't think it addresses the actual problems it certainly doesn't help me out and it's going nowhere good as far as I can tell right uh and so you know my preference as how things would go forwards programming wise is to try and go back to the original basic idea that platforms specify their behaviors what their CPU actually does the idea behind the language is to allow the programmer to write portable code when they don't care by using things which are specified very uh concisely and anytime there's any question about what's going to happen the programmer makes it explicit it is not the optimizer job or the Spec's job to leave things up in the air so that random stuff can start happening to make the code faster like that just doesn't help me I don't like that uh I think that hurts me a lot more than it helps right um so I feel like that is a fairly full description of the problem I hope that uh I hope that sort of um clarifies my position on that subject for anyone who actually cares so I think that took basically the entire hour Allan says he's quite satisfied and since he asked the question originally that is good glad to have new stuff to think about uh yeah I mean I feel like fundamentally uh I really would just like to see some work being done in the area of let's now that we know what Assembly Language actually looks like on Modern CPUs in terms of their instruction set can we make a better version of like the original idea behind SE which was like make it easier to write low-level code because right now it's not really that way you know so uh so yeah that's what I would say uh that's what I would say and I don't know like um I don't know that would be that difficult to do you know uh but you know it just seems like it just seems to me like fundamentally um not that hard of a problem to do this part of it and I feel like with all of the effort that's expended on C plus plus spec and compilers and they never improve the core programming experience you know they they never get they never actually get down and fix the fact that it's just I it's this huge nightmare to program and do basic stuff like work with you know Vector registers but that's you know that's why people program in these languages still you know people people writing web apps aren't writing them in C and C++ you know uh the people still using C and C++ are the people who care about these sort of things to a certain degree right I mean I feel like the people who care simply about using high level languages I got to think they would be leaving C++ and droves right because C++ is not a very good highle language you know it's not I feel like you could easily find a better one somewhere else if you're just if you don't care about the the low-level aspects of it and pointers and things like that why are are you using it you know why aren't you using python or rust or go or I don't know right there's all these other options out there and the C++ spec is filled with you know tons of these crazy things and none of them are particularly you know um easy to use or complete or anything else right so um let me see maybe I'll take one more if it's quick otherwise I think I'm going to go Andrew JDR has one that I can answer I think pretty quickly but not in a very satisfying way curious if you've experimented with different 3D polygon triangulation approaches and settled on something that you use under most circumstances uh and the answer there is no because I have not really had to do that um I probably will have to do that in future incarnations of our engines at Molly uh so maybe I'll have more of opinion about that 5 years from now not that it'll be relevant to you at that point I'm sure but uh mostly I just have not had to do um con like concave you know a 3D polygon is pretty trivial to triangulate if it's if it's convex right um so there's really no need to get fancy uh if it's convex but if it's concave which is I assume what you're talking about um you know uh oh well maybe you're not maybe you just mean if you want to upres it because you said constrain Don Delon triangulation um so I don't know I guess I should clarify first are you just talking about adding triangles to inside of a triangle that's already um uh that's that's already con concave I'm convex are you talking about concave yeah talk about concave so yeah I've never really had to deal with that um and uh I I think it sounds like a fun problem you know I would like to uh play with something like that it sounds fun uh but uh I haven't I do know that sha Barrett uh did [Music] um some work with this because uh he wrote a uh flash emulator for rad it's called Iggy um uh I don't know if I can why are you not working stream's still working DNS Server doesn't look like it's working that's not a good sign uhoh all right not great not great uh anyway there's a a thing called Iggy um uh I can't get to the site to show you uh but there's a thing called Iggy that is a a flash emulator for um uh for games like uh if people want um you know to playback flash animations and he you know cuz flash let you do uh arbitrarily um you know arbitrary uh triangle things or whatever arbitrary polygons and animate them around or whatever and so uh he wrote a thing that will triangulate those and I don't actually know what he settled on using uh but you know uh you could ask him sometime when he's streaming to talk about because he probably has an opinion I mean he probably looked at a couple different things and picked an algorithm for some particular reason uh so I don't know uh yeah so I guess only the DNS server got got whacked right um because it looks like it just can't resolve the DNS server but it's not a problem resolving uh traffic so traffic's still going through but the the uh but the DNS server is just out right uh and let's see uh yeah I think that's it for short questions I think that is it so probably going to wind it down for all right all right I think we've got everything uh all the rest of the questions look like kind of involved ones so I'm going to go and of course I can't get to the Internet so that's always good um anyway uh thank you for joining me for a special edition of handmade hero for a chat which we don't often do but we did tonight uh I'll be back next week I think it depends on whether uh I think I will probably be moving the handmade hero machine to the office at some point uh because we had to get a static IP address for the office for various contractual reasons um so although it won't be plugged into anything I could just sit the handmade hero thing there and use that and stream from the office uh which I might decide to do if I do do that we'll probably take a week off at the point where I move it um and get it back up and running there uh but I don't know if and when that's going to happen so as always check out the schedule bot um and uh and figure out whether uh whether that happens by the the handy schedule bot anyway thank you very for joining for your hat chat I think I will be back next week if not we'll be back the week after that one way or the other um and that is about it for now have fun programming this weekend and uh yeah try not to hit any undefined behavior in your compiler until uh next time uh take it easy everyone and I'll see you on the internet Back To Top