Settings

Theme

Named element IDs can be referenced as JavaScript globals

css-tricks.com

177 points by mmazzarolo 3 years ago · 112 comments (111 loaded)

Reader

esprehn 3 years ago

The global scope polluter has pretty bad performance and interop surprises, you shouldn't depend on it and instead use getElementById even if it's a bit more verbose.

It uses a property interceptor which is fairly slow in v8:

https://source.chromium.org/chromium/chromium/src/+/main:out...

to call this mess of security checks:

https://source.chromium.org/chromium/chromium/src/+/main:thi...

which has this interop surprise:

https://source.chromium.org/chromium/chromium/src/+/main:thi...

which in the end scans the document one element at a time looking for a match here:

https://source.chromium.org/chromium/chromium/src/+/main:thi...

In contrast getElementById is just a HashMap lookup, only does scanning if there's duplicates for that id, and never surprisingly returns a list!

  • toddmorey 3 years ago

    I really wish in the source code it was actually named globalScopePolluter()

  • goatlover 3 years ago

    Is there a reason to not use querySelector, since it’s a lot more flexible? One reason jQuery became so popular is because the DOM was painful to use. Things like querySelector fix that.

    • dspillett 3 years ago

      > Is there a reason to not use querySelector

      getElement is slightly faster, but not by enough to care IIRC so I use querySelector for consistency and it's flexibility.

      > One reason jQuery became so popular is because the DOM was painful

      I would say that is the key reason, with everything else being collateral benefits. Assuming you combine element selection, dealing with legacy incompatibilities, and function chaining to reduce boilerplate code, under the same banner of "making the DOM less painful".

    • throwaway0asd 3 years ago

      QuerySelectors are slow. Epic slow. I would also argue that querySelectors are far less flexible and became popular because they are instead easy.

      https://jsbench.github.io/#b39045cacae8d8c4a3ec044e538533dc

      ProTip: Without numbers performance opinions are wrong by several orders of magnitude 80% of the time.

      • wruza 3 years ago

        I’m getting 70mops byId and 23mops qs-#id. This looks like making a huge difference until I add these cases:

          3: "abc”.replace("a", "b")
          4: "1" * 2
        
        Which result in 15mops and 74mops respectively. This test measures diameters of neutrinos so to say.
        • throwaway0asd 3 years ago

          My experience is that most developers tend to guess at performance and throw away numbers they disagree with. As a result performance testing is only something product owners care about.

    • masswerk 3 years ago

      BTW, in my experience getElementById() is still fastest.

      • eyelidlessness 3 years ago

        In isolation definitely, but in real world code it might be faster to use querySelector for branchy code if it doesn’t always use an id. As with everything, if it’s not performance-sensitive write the code that’s easier for humans to read, and if it is measure first.

        • olliej 3 years ago

          I'm not sure what you're trying to say here, as it's tautologically correct that getElementById can't be used in cases where you want to select on more than just the id. Do you mean a use case where you have branchy code that produces a selector string that has some id only paths?

          • eyelidlessness 3 years ago

            Yes. Branchy code which could sometimes use getElementById and other times use querySelector may be faster if it always uses querySelector, even if that call itself is slower. The reason for this is that the JITs sometimes deoptimize on branchy logic with inconsistent property access between branches. They also deoptimize on branchy logic defining intermediate values, but much less often when the value is a consistent type like a string (selector).

            • olliej 3 years ago

              This would only be relevant if you're doing something like

                  var theFunction = condition ? "querySelector" : "getElementById";
                  ...
                  document[theFunction](...)
              
              it won't apply to

                  if (condition)
                    document.querySelector(...)
                  else 
                    document.getElementById(...)
              
              As from the point of view of the runtime the latter has two call sites, and each one is monomorphic and will very quickly (first layer of the JIT tower generally) become a Structure/Shape/HiddenClass check on `document` followed by a direct call to the host environment's implementation function (or more likely the argument checking and marshaling function before the actual internal implementation).

              It is possible that the higher level JITs pay attention to the branch counts on conditions or use other side channels for the deopt, but for host functions it's generally not something that will happen as the JITs see natively implemented functions as largely opaque barriers - they only have a few internal (to the runtime itself) cases where they make any assumptions about the behaviour of host functions.

              • eyelidlessness 3 years ago

                > As from the point of view of the runtime the latter has two call sites, and each one is monomorphic

                I expected that to be the case but I’ve actually measured it and it’s not always. It is, when the object being accessed has a consistent shape/hidden class, as you mention, but a lot of times they don’t. A weird case is native interfaces because while the host functions are opaque and you’d expect they have a stable shape the interfaces themselves are often mutable either for historical reasons or shortcuts taken in newer proposals/implementations. Accessing document.foo isn’t and can’t be monomorphic in many cases, even if it can be treated that way speculatively. But branchy code can throw out all sorts of speculation of that sort. I don’t know which level of the JIT this occurs at, I’m just speaking from having measured it as a user of the APIs.

                • olliej 3 years ago

                  Hmmm, how did you measure?

                  This isn't me disagreeing, just me being surprised and trying to think of why the optimizer falls off.

                  JSC at least has flags on the structure that track which ones will bollocks up caching (e.g. the misery that is looking up things in the prototype chain if the object in question has magic properties that don't influence the structure).

                  One thought I have is if your test case was something like

                     if (a)
                       obj.doTheNativeThing()
                     else
                       obj.doTheOtherNativeThing()
                  
                  (or whatever)

                  and you primed the caches by having a being true/false be a 50/50 split, vs all one way. My thinking (I have not done any of the debugging or logging) is that the branch that isn't taken won't insert any information about the call target. I can see that resulting in the generated code in the optimizing layers of the JITs being something along the lines of

                      if (a)
                         call _actualNativeFunction
                      else
                         deopt
                  
                  The deopt terminates the execution flow so then in principle the VM gets to make assumptions about the code state after the whole if/else block, but more importantly the actual size of the code for the function is smaller, and so if you were close to the inlining limit dropping the content of the else branch _could_ result in your test function getting inlined, and then follow on optimizations can happen in the context of the function that you use to run your test with. Even if there aren't magic follow on optimizations removing the intermediate call can itself be a significant perf win.

                  Testing the performance of engines was super annoying back when I worked on JSC, as you have to try and construct real test cases, but that means competing with your test functions being inlined. JSC (and presumably other engines) have things you can do (outside of the browser context) to explicitly prevent inlining of a function, but then that is also not necessarily realistic. But it's super easy to accidentally make useless test cases, e.g.

                      function runTest(f) {
                        let start = new Date;
                        for (let j = 0; j < 10000; j++)
                          f()
                        let end = new Date;
                        console.log(end - start)
                      }
                  
                      function test1() {
                        ...
                      }
                  
                      function test2() {
                        ...
                      }
                  
                      runTest(test1)
                      runTest(test2)
                  
                  In the first run with test1, f (in runTest) is obviously monomorphic, so the JIT happily inlines it (for the sake of the example assume both functions are below the max inlining size). The next run with test2 makes f polymorphic so runTest gets recompiled and doesn't inline. Now if test1 and test2 are both small the overhead of the call can dominate the cpu time taken which means that if you simply force no inlining of the function you may no longer be getting any useful information, which is obviously annoying :D
      • devmor 3 years ago

        The performance difference is negligible. Both methods can return 70k-100k selections in 10ms.

      • olliej 3 years ago

        That’s surprising, webkit+blink and I’m guessing gecko all optimize the query selector cases. I assume it’s the cost of the NodeList (because NodeLists are live :-/)

        • masswerk 3 years ago

          May be worth testing it against getElementsByClassName(), which also returns a live collection.

          • olliej 3 years ago

            I actually just went and tested and in webkit at least my 100% perfect test case I had querySelector taking 2x longer than getElementById. I tried understanding what the current webkit code does but the selector matching code is now excitingly complex due to the CSS JIT.

            Many many years ago I recall querySelector starting out with a check for #someCSSIdentifier and shortcutting to the getElementById path, but maybe my memory is playing tricks on me.

            • esprehn 3 years ago

              Yup that's what it did, and Chrome still does. After much research and prototyping the CSS JIT didn't improve real world content (especially given the complexity) so it was never added to Chrome.

dfabulich 3 years ago

I'm surprised to find that this trick still works even in the new backwards-incompatible JavaScript Modules (using <script type="module">), which enables "strict" mode and a number of other strictness improvements by default.

I believe it works because the global object ("globalThis") is the Window in either case; this is why JavaScript Modules can refer to "window" in the global scope without explicitly importing it.

    <!DOCTYPE html><body>
        <div id="cool">cool</div>
        <script>
            console.log(this); // Window
            console.log(globalThis); // Window
            console.log("script", cool.innerHTML); // script cool
        </script>
        <script type="module">
            console.log(this); // undefined
            console.log(globalThis); // Window
            console.log("module", cool.innerHTML); // module cool
        </script>
    </body></html>
This seems like a missed opportunity. JavaScript Modules should have been required to "import {window} from 'dom'" or something, clearing out its global namespace.
stonewareslord 3 years ago

I don't think this article is complete. It mentions no pollution, which is true of window and most HTML elements, but not always. Check this out, you can set an img name to getElementById and now document.getElementById is the image element!

Here's a minimal example (https://jsfiddle.net/wc5dn9x2/):

    <img id="asdf" name="getElementById" />
    <script>
        // The img object
        console.log(document.getElementById);

        // TypeError: document.getElementById is not a function :D
        console.log(document.getElementById('asdf'));
    </script>
I tried poking around for security vulnerabilities with this but couldn't find any :(

It seems that the names overwrite properties on document with themselves only for these elements: embed form iframe img object

Edit: Here's how I found this: https://jsfiddle.net/wc5dn9x2/1/

  • jefftk 3 years ago

    Note that this is with the name attribute, not the id attribute the article is discussing.

  • dwild 3 years ago

    Curiously the article doesn't mentions it, but theses kinds of vulnerabilities are named DOM clobbering if you want to know more about it!

    It's weirdly not that discussed on the web, most probably because it require a pretty specific situation.

    • stonewareslord 3 years ago

      Thank you for this! I had a feeling it wasn't a security issue. I closed my ticket saying it might be one due to finding websites mentioning Dom clobbering

FrontAid 3 years ago

Another similar gotcha is that the global-scoped `name` variable must be a string. See https://developer.mozilla.org/en-US/docs/Web/API/Window/name for details.

    var name = true;
    typeof name; // "string", not "boolean"
Luckily, this is not true within ES modules which you probably use most of the time anymway.
  • esprehn 3 years ago

    That's not magic, it's just how property getter and setters work on the global:

        <script>
            var _value = "test value";
            Object.defineProperty(window, "testName", {
                get: () => _value,
                set: (value) => { _value = String(value) },
            });
        </script>
        <script>
            var testName = {};
            // prints [object Object] string
            console.log(testName, typeof testName);
            var name = {};
            // prints [object Object] string
            console.log(name, typeof name);
        </script>
    
    the `var` doesn't create a new property since the getter and setter already exist.

    Other properties have the same behavior, for example `status`.

    Note: there's also LegacyUnforgeable which has similar behavior: https://webidl.spec.whatwg.org/#LegacyUnforgeable

    Even if you're not using modules, using an IIFE avoids all this by making your variables local instead of having them define/update properties on the global.

  • efdee 3 years ago

    It takes a special kind of human to name variable "name" but not have it be a string.

    • orangecat 3 years ago

      Something like

        name = {first: "Jane", last: "Doe"}
      
      isn't obviously unreasonable. Which actually sets name to the string "[object Object]".
    • sanitycheck 3 years ago

      I work with such humans! I was looking at that exact situation a few moments ago.

    • Minor49er 3 years ago

      I can imagine someone doing this if they were using "name" as a verb

  • simlevesque 3 years ago

    I've been doing JS for like fifteen years, this one I never knew. Wow.

    I must have never used "name" as a name for a global variable or just for ones that were strings.

genezeta 3 years ago

This is one of those things that pops up every year or two years. Unfortunately, the person writing about the new discovered weird trick almost always fails to precede the article with a big, red, bold "Please don't ever do this".

  • svnpenn 3 years ago

    and then someone always follows up with "Please don't ever do this", without explaining WHY you should never do this:

    https://wikipedia.org/wiki/Wikipedia:Chesterton's_fence

    • FrontAid 3 years ago

      The article already explains that thoroughly.

    • pmoleri 3 years ago

      Nice article, thanks for sharing it.

    • genezeta 3 years ago

      It's has been explained enough times. It's just that looking things up for yourself seems to have gone out of fashion.

      • spookthesunset 3 years ago

        That doesn’t help people who stumble upon this when searching for the problem. All the “look it up” response does is make sure the search results are a bunch of content saying “look it up”, which isn’t really that helpful.

        • llanowarelves 3 years ago

          That's a classic.

          Get my hopes up finding an old forum post asking my question, hoping to find answers. All the answers are "use Google/etc", which is how I got there.

        • nkozyra 3 years ago

          It is explained fairly early in the article.

          This used to be done quite a lot in the early JS days when scope was kind of thrown out the window (no pun) and you just did whatever dirty thing you needed to in order to make a page work.

      • pierrec 3 years ago

        lol, I just searched "problem with referencing named element ids as javascript globals": the first result is the linked article and the second result is, you guessed it, this thread with your comment on top.

      • scratcheee 3 years ago

        >the person writing about the new discovered weird trick almost always fails to precede the article with a big, red, bold "Please don't ever do this"

        > It's has been explained enough times. It's just that looking things up for yourself seems to have gone out of fashion.

        It appears you've countered your own complaint.

  • LelouBil 3 years ago

    I discovered that with a couple of friends while in JavaScript class. Every one of us was like "this is actually horrible".

mmastrac 3 years ago

This has been a thing since the 90s. I really wish we'd done away with it for any document that specifies itself as HTML5.

It's great for hacking a tiny script together, however.

  • esprehn 3 years ago

    Yeah, HTML5 explicitly documented the compatible behaviors between browsers to reach uniformity, which meant standardizing a lot of weird stuff instead of trying to fix it.

    See for example this thread where Mozilla tried to not do this: https://bugzilla.mozilla.org/show_bug.cgi?id=622491

  • russellbeattie 3 years ago

    Yep, same here. The only time I use this bit of knowledge nowadays is in the console. If I see a tag has an ID, I save myself a few characters by just referring to it as a variable since I know it's already there anyways.

    IDs were the only way to get a reference to an element early on if I'm remembering correctly. Or maybe the DOM API just wasn't well known. All the examples and docs just used IDs, that I can remember for sure.

samtho 3 years ago

This always reminded me of PHP’s infamous register_globals. For those unfamiliar, anything in the ‘$_REQUEST’ array (which itself comprises of $_POST, $_GET, and $_COOKIE merged together) is added to the global scope. So if you made a request to index.php?username=root, $username would contain “root” unless it was explicitly initialized it before it was used.

twicetwice 3 years ago

iirc this doesn't work in Firefox? or at least it doesn't work the same way as in Chrome. I developed a tiny home-cooked app[0] that depended on this behavior using desktop Chrome which then broke when I tried to use it on mobile Firefox. I then switched it to using

  document.getElementById
like I should have and everything worked fine. Like others in this thread, I recommend not relying on this behavior.

[0]: https://www.robinsloan.com/notes/home-cooked-app/

shadowgovt 3 years ago

> It is implemented differently in browsers

In 2022, that alone is enough to wipe it from my toolbox as a web developer. Ain't nobody got time for that.

(... there are lots of other reasons it'd be bad practice to rely on this as well, although it's nice for debugging when available).

kiawe_fire 3 years ago

Seems like something that could have been made safer just by name spacing it a bit better.

Something like “window.elements.myDiv”? I wonder why the decision to go straight to the root.

  • dphnx 3 years ago

    `document.all` can be used in this way:

      <div id="foo"></div>
      <script>
        const { foo } = document.all
        // do something with foo
      </script>
    
    Don't use it though, it's deprecated as well[1].

    [1]: https://developer.mozilla.org/en-US/docs/Web/API/Document/al...

  • akira2501 3 years ago

    You can make this yourself with Proxy. I get a lot of mileage out of this:

        // proxy to simplify loading and caching of getElementById calls
        const $id = new Proxy({}, {
            // get element from cache, or from DOM
            get: (tgt, k, r) => (tgt[k] || ((r = document.getElementById(k)) && (tgt[k] = r))),
            
            // prevent overwriting
            set: () => $throw(`Attempt to overwrite id cache key!`)
        });
    
    
    Now if you have

        <div id="something></div>
    
    You can just do

        $id.something.innerHTML = 'inside!';
  • bobince 3 years ago

    The Netscape of the 90s wasn't interested in making features ‘safe’. They were about throwing out features as quickly as possible to see what would stick.

    The simplest possible syntax is to make named elements available globally, and if that clashes with future additions to the DOM API then well that's a problem for some future idiots to worry about.

    as a strategy it worked pretty well, unfortunately

    • WorldMaker 3 years ago

      As the article points out, this initiative was an 90s IE one and the Gecko team (Firefox, post-Netscape) were against it.

bjkchoy 3 years ago

I saw this "shortcut" used in code snippets, on online JS/CSS/HTML editors like JSFiddle. It did not even occur to me this was part of JS spec, I thought the editor was generating code behind my back!

  • seba_dos1 3 years ago

    > It did not even occur to me this was part of JS spec,

    It has nothing to do with JS spec; it's part of the DOM as defined by the HTML spec.

angelmm 3 years ago

Now I'm worried of using IDs and finding issues with globals in JavaScript. Seems to be a curious issue to be debugged.

  • mmastrac 3 years ago

    Avoid globals at all costs - use IIFE [1] instead, wrapping your function in parenthesis and invoking it right away.

    [1] https://developer.mozilla.org/en-US/docs/Glossary/IIFE

    • recursive 3 years ago

      If you have access to `let`, you can just put `let` declarations into a block. No need for a function to establish scope.

    • an1sotropy 3 years ago

      When, today, does it make more sense to organize things around IIFEs and not ES6 modules?

    • WorldMaker 3 years ago

      It's 2022, you can use ES2015 modules now. We can leave IIFE to the dustbin of the past.

    • jbverschoor 3 years ago

      And then get coworkers to remove it because they don’t understand that you can create scopes like that

    • throw_m239339 3 years ago

          {
              let foo = 1
          };
          // foo is undefined here
  • Slackwise 3 years ago

    If you read the article and the spec, you'll see that any explicitly created variables will always take precedence over automatic IDs, so any globals will always override these IDs.

  • bhhaskin 3 years ago

    You shouldn't be using IDs anyways. They are just bad for a lot of reasons. You can only have one on a page and it reduces your reusability. Use classes instead.

    • err4nt 3 years ago

      ID's aren't bad, they're unique identifiers, and useful for (deep) linking to specific pieces of content within documents. Please use ID's as liberally as you please, and use them for their proper use.

    • goatlover 3 years ago

      Use ids when JS needs to reference unique elements. Use classes for styling and accessing groups.

      • isleyaardvark 3 years ago

        JS can do just as well with unique classnames, which avoids issues with ids like those given in the article.

        • sgc 3 years ago

          I always presumed this would usually entail a performance hit, since you are accessing something that is not defined as unique.

grenoire 3 years ago

I discovered this the hard way and I am still really torn. The entire window global object is just a minefield.

eithed 3 years ago

>To add insult to the injury, named elements are accessible as global variables only if the names contain nothing but letter.

This doesn't seem to be true as shown within this fiddle: https://jsfiddle.net/L785cpdo/1/

Bear in mind that only undefined elements will be declared this way

  • mmazzaroloOP 3 years ago

    Author here. That was a mistake on my part, it shouldn't have slipped in :) I removed that section, thanks for reporting!

roberttod 3 years ago

For me, the disadvantage above any listed on the blog is that if I saw this global variable referenced in some code (especially old code, where some parts might be defunct), I would have absolutely no idea where it came from, and I bet a lot of others would struggle too.

croes 3 years ago

Isn't the opposite of

>So, if a DOM element has an id that is already defined as a global, it won’t override the existing one.

So, if a global has name of the id of a DOM element, it won’t override the existing one?

Wouldn't it be clearer to say globals always before DOM ids?

beebeepka 3 years ago

This was mostly useful back in the days when we had to manually query dom during development and debugging. I've seen some pretty horrible things but never have I seen this in a codebase, not even in a commit

  • 7952 3 years ago

    I remember using it on the first Javascript I ever used around 20 years ago. I naively assumed that the DOM was like state in a more procedural language and this variable trick played into that.

bsimpson 3 years ago

I've definitely used this shortcut to make CodePens less verbose…

I wouldn't use it in production, but it's handy for banging together a proof-of-concept.

monkpit 3 years ago

It hurts

  • SpaceL10n 3 years ago

    I think it would hurt less with TypeScript global types. Just need to know what IDs you'd expect to find in the DOM.

    • err4nt 3 years ago

      the ID's in DOM will never conflict or cause an issue with your own JS code. You can't reliably use 'named access on the window object' (the name of this feature) because of this, so it's never a problem, and also largely useless.

codedokode 3 years ago

It would make sense to disable this with new release of HTML, for example if the author uses an HTML6 doctype.

graderjs 3 years ago

And named form controls are accessible as properties of the same name on their form element.

tambourine_man 3 years ago

*rigamorale

Should read “rigamarole”

pkrumins 3 years ago

This is my favorite HTML and JS feature!

debacle 3 years ago

I don't want to sound like I have an axe to grind (but I do), but this is the kind of feature/wart that shows the age of the HTML/CSS/JS stack.

The whole thing is ripe for a redo. I know they get a lot of hate, but of all the big players in this space I think FB is the best equipped to do this in a way that doesn't ruin everything. I just wonder if they have an incentive (maybe trying to break the Google/MS hegemony on search?).

  • eptcyka 3 years ago

    The best equipped to do this are Google/MS/Apple because they actually control the source code of relevant contemporary browsers.

    • debacle 3 years ago

      I think that this is the case (right now) because of Apple's stranglehold on the browser on iOS and the complex relationship between Google/Apple.

      If FB could launch a browser on iOS that was in their walled garden, not only would it quickly receive wide adoption but it might become people's primary browser.

      Not that I necessarily think that's a good thing, mind you.

      • eptcyka 3 years ago

        Why would it quickly receive any adoption? Of all of the behemoths, I would trust FB the least here. Not that I trust any of the other big players enough not to use Firefox everywhere I can.

        • debacle 3 years ago

          The word "trust" doesn't factor into ~90% of users' decisions.

          If FB says "hey install this app," they will install it.

  • WallyFunk 3 years ago

    > The whole thing is ripe for a redo

    Web developers have worked around quirks for as long as I can remember. The stack has many warts, but we learn to adapt to them. Like 90% of a web developer's job is working around gotchas, and will continue that way. A 'redo' might not be needed. Developers need something to moan about and need something to keep them employed :)

  • doliveira 3 years ago

    I find it pretty funny that we humans have invented all these transpilers and bundlers, invested probably billions of dollars in JITs, just to keep writing JS

  • tfsh 3 years ago

    Could you explain how rewriting one of the worlds most complex and critical specifications would break of the Google/MS hegemony on search?

    • debacle 3 years ago

      Sorry, what I meant was:

      "If FB decided to try and break into search, then they might decide to attack the HTML/CSS/JS stack."

      Not the other way around.

  • goatlover 3 years ago

    There’s always WASM, and I think Zuck is more interested in VR than trying to push a new web standard.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection